Data repositories

Do you have datasets that you would like to share publicly but don't know where to store them?

Projects in the INCF Community

NITRC Image Repository

From nitrc.org: “NITRC Image Repository (NITRC-IR) does for data what NITRC-R does for tools. Search for and freely download publicly available data sets including thousands of DICOM and NIFTI normal subjects and those with diagnoses such as: schizophrenia, ADHD, autism, and Parkinson's.”

GIN

The G-Node Data Infrastructure (GIN) services provide a platform for comprehensive and reproducible management of scientific data. Building on well established versioning technology, GIN offers the power of a web based repository management service (inspired by GitHub) combined with a distributed file storage open to established cloud-based services. The service addresses the range of data workflows starting from your analysis scripts on the local workstation to remote collaboration and data publication.

DataLad

From datalad.org: “DataLad aims to provide access to scientific data available from various sources (e.g. lab or consortium web-sites such as Human connectome; data sharing portals such as NITRC, OpenFMRI and CRCNS) through a single convenient interface and integrated with your software package managers (such as APT in Debian). Although initially targeting neuroimaging and neuroscience data in general, it will not be limited by the domain and we would welcome a wide range of contributions.”

Public domain-specific and general data repositories

This list is a sub-list of Scientific Data's recommended data repositories

Neuroscience

NeuroMorpho.org
Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (FCP/INDI)
OpenfMRI

Functional genomics

ArrayExpress
Gene Expression Omnibus (GEO)
GenomeRNAi
dbGAP
The European Genome-phenome Archive (EGA)
Database of Interacting Proteins (DIP)
IntAct
Japanese Genotype-phenotype Archive (JGA)
Biological General Repository for Interaction Datasets *
NCBI PubChem BioAssay

Taxonomy & species diversity

Integrated Taxonomic Information System (ITIS)
KNB: The Knowledge Network for Biocomplexity
NCBI Taxonomy *
Global Biodiversity Information Facility (GBIF)
Morphobank.org

Mathematical & modelling resources

BioModels Database
Kinetic Models of Biological Systems (KiMoSys)

Organism-focused resources

Eukaryotic Pathogen Database Resources (EuPathDB)
FlyBase
Influenza Research Database
Mouse Genome Informatics (MGI)
Rat Genome Database (RGD)
VectorBase
Xenbase
Zebrafish Model Organism Database (ZFIN)

Generalist repositories

Dryad Digital Repository
figshare
Harvard Dataverse
Open Science Framework
Zenodo

*Curated resource which may not accept direct submission of data. Contact the database directly for further information.