Understand the available file types
Datasets are generally provided in specific file types that are optimized for data analysis - most typically .csv (compatible with Excel); as well as the file types specific to software analysis programs such as SPSS, Stata, SAS and R.
- FYI: File types such as PDFs or image files such as jpeg etc will not allow you to analyze/clean/sort/filter and otherwise manipulate the data for your purposes. If the data are essentially "Read only" can you actually make any use of them?
- Data are occasionally provided in document files, e.g., Word tables - which would require you to copy-paste them into a data-analysis-friendly file format before you can undertake your own analysis tasks
- Be aware of the additional time that will be required to move the data into a new program, and potential for errors to arise during the process.
Software Considerations
- Do you have access to a statistical software program, such as Excel, SPSS, Stata, SAS or R?
- Do you know how to use the relevant program?
- Do you have the time / funds to undertake training in the relevant program?
For some guidance on statistical programs / data analysis see: