Research data are too valuable to risk them becoming inaccessible or unusable due to avoidable issues such as software obsolescence or failing to include information about the codes and metadata used to describe your data as well as the scripts and programs used to process/clean your data. |
If it's not possible to use an open file format and/or software program carefully consider your options for moving your data onto a new program if the original becomes obsolete/inaccessible.
The Research Data Management Workbook - Briney, K. Caltech Libraries [Workbook]
a collection of exercises for researchers to improve their data management. The Workbook contains exercises across the data lifecycle, including writing project-level READMe files & data dictionaries, setting up file organization/naming systems, choosing storage and backup solutions and much more.
|
Document your folder names & structure in a ReadMe file. Outline the content and organization of each folder, who the file authors/contributors are, any changes you make as you make them along with anything else a 3rd party needs to know to understand what each version of your data represents.
Make a copy of your original/raw dataset and save it to a read-only, password protected file to prevent anyone from accidentally overwriting it, deleting it etc. - and to provide the clearest possible version history for your data as you work through all the data processing and analysis stages.
Note that files in the Microsoft Cloud platform are auto-saved every few seconds. For this reason, it’s best to click the editing drop-down menu and then “Open in Desktop App” before beginning any work on your files. Depending on the version you have, the desktop versions of Word, Excel and PowerPoint either do not save your edits until you click save or save as OR have a clearly labelled "Auto-save - ON/OFF" button in the upper left of the application – making it much easier to |
Be consistent with your file name elements, such as:
Don't use spaces or special characters in your file names, e.g., & ? # . which could interfere with / be misunderstood by computer operating systems - leading to errors.
Periods should not be part of your file name other than where added automatically by your software application to indicate the file format extension, e.g., .jpg, .csv etc.
The Research Data Management Workbook - Briney, K. Caltech Libraries [Workbook]
a collection of exercises for researchers to improve their data management. The Workbook contains exercises across the data lifecycle, including writing project-level READMe files & data dictionaries, setting up file organization/naming systems, choosing storage and backup solutions and much more.