Documenting your data

How do I document my data?

Documentation can take many forms. It can be written in free text, such as a readme file, or the metadata can be captured in a structured, machine readable file, encoded using an xml format.

Structured, discipline specific metadata is preferable, but if no standard exists, writing “readme” style files are the most simple way of recording metadata.

Readme files

A readme file provides information about a data file. It allows yourself and others to understand and reuse the data at a later date.

Best practices:

Follow the Cornell guide to writing "readme" files.

  • Start writing the readme files at the beginning of the research project.
  • Record the information in a text file (.txt)
  • Use a template to help guide you, but tailor it to the needs of the project and kind of data that is being documented. Template examples:
  • Update the file as the research progresses.
  • When the research is complete and ready to be shared, deposit the readme file alongside the data in a repository.

Data dictionaries & codebooks

Data dictionaries and codebooks provide variable-level metadata. These two types of documents may provide overlapping information.


Lab notebooks

Lab notebooks (print or online) are also a great way to document your research. They include methodology, results, calculations, etc. They are helpful for publishing, sharing, or reproducing your research.


Metadata standards

Find out if your discipline uses a metadata standard to describe data. In fact, specific disciplinary data repositories may require a formal standard. These metadata files are often saved in a machine readable format, such as xml. There are tools that can help with the creation of these metadata files. See the Tools section for more information.

To find an appropriate metadata standard for your discipline, consult the following resources:


 
Back to top arrow up, go to top of page