Introduction
|
|
Organising data in spreadsheets
|
Never modify your raw data. Always make a copy before making any changes.
Keep track of all of the steps you take to clean your data in a plain text file.
Organise your data according to tidy data principles.
Record metadata in a separate plain text file (such as README.txt) in your project root folder or folder with data.
|
Common spreadsheet errors
|
Include only one piece of information in a cell.
Avoid using multiple tables or spreading data about multiple tabs within one spreadsheet.
Record zeros as zeros.
Avoid spaces, numbers and special characters in column headers.
Avoid special characters in your data.
Use an appropriate null value to record missing data.
Record units in column headers.
Place comments in a separate column.
Do not use formatting to convey information.
|
Dates as data
|
|
Quality assurance and control
|
|
Exporting data
|
Data stored in common spreadsheet formats will often not be read correctly into data analysis software, introducing errors into your data.
Exporting data from spreadsheets to formats like CSV or TSV puts it in a format that can be used consistently by most programs.
|
Survey
|
|