Where can I deposit my data?
On this page
Where can I deposit my data?
Towards the end of the research process, researchers should deposit their data along with the appropriate documentation and metadata that will allow the interpretation and reuse of the dataset–subject to the data-owner approval and under conditions that respect research ethic standards such as the protection of human respondents’ anonymity. Listed below are the main repository options for Concordia’s researchers.
These accept dataset related to either a specific discipline (e.g. genomics) or a broad subject-area (e.g. social sciences). Some repositories allow for self-archival and will provide limited or no curation service; others, like ICPSR, will provide in-depth curation services to subscribing institutions (Concordia is an ICPSR member) provided that the data fits within their collection development policy.
Examples of discipline-specific repositories
A small but growing number of journals are requiring that researchers make the data associated with their papers publicly available to facilitate verification and replication of results. The data can be deposited in any repository, but services like Dryad (http://datadryad.org/) collaborate with journal publishers to coordinate the deposit of manuscripts with the submission of data. The cost of depositing data is either covered by the submitter or by a sponsoring organization. See this Dryad information page for details.
In a parallel development, a new type of publications called data journals or metadata journals has recently emerged. The data papers published in these journals describe publicly available datasets with the objectives of explaining the methodology or instrumentation used in the creation of datasets, and increasing their visibility and reuse.
There is an increasing number of affordable or even free options for researchers who wish to deposit and share their datasets themselves. These are suitable for small or medium-sized datasets that do not require specialized curation. The following are some of the main repositories available:
Dataverse is an open source platform created by Harvard’s Institute for Quantitative Social Sciences that allows institutions, research teams and individual researchers to archive and share their datasets. Scholars Portal, a technological infrastructure provided by the Ontario Council of University Libraries (OCUL), is hosting an instance of Dataverse. Creating an account on the Scholars Portal Dataverse is free and allows researchers to publish their datasets with accompanying documentation.
Zenodo is a multidisciplinary platform hosted by CERN. Accepts "all research outputs from all fields of science [...]. In the upload form you can choose between types of files: publications (book, book section, conference paper, journal article, patent, preprint, report, thesis, technical note, working paper, etc.), posters, presentations, datasets, images (figures, plots, drawings, diagrams, photos), software, videos/audio and interactive materials such as lessons."
Open ICPSR accepts social and behavioural science research data. Different levels of curation services (from none to complete) are offered at varying prices.
Figshare allows researchers to publicly share their data (unlimited storage for public data) or to securely archive them (1GB of storage space for restricted data).
Concordia’s institutional repository
We encourage researchers to use a discipline-specific repository such as the ones listed in the previous sections as those platforms are generally better suited for data curation and dissemination in your research area. However, if there is no suitable discipline-specific repository for your dataset, you should consider using Spectrum, Concordia University's institutional repository.
Directory of data repositories
Registry of Data Repositories (http://www.re3data.org/)