Data repositories

Do you have datasets that you would like to share publicly but don't know where to store them?

Projects in the INCF Community

NITRC Image Repository

From “NITRC Image Repository (NITRC-IR) does for data what NITRC-R does for tools. Search for and freely download publicly available data sets including thousands of DICOM and NIFTI normal subjects and those with diagnoses such as: schizophrenia, ADHD, autism, and Parkinson's.”


The G-Node Data Infrastructure (GIN) services provide a platform for comprehensive and reproducible management of scientific data. Building on well established versioning technology, GIN offers the power of a web based repository management service (inspired by GitHub) combined with a distributed file storage open to established cloud-based services. The service addresses the range of data workflows starting from your analysis scripts on the local workstation to remote collaboration and data publication.


From “DataLad aims to provide access to scientific data available from various sources (e.g. lab or consortium web-sites such as Human connectome; data sharing portals such as NITRC, OpenFMRI and CRCNS) through a single convenient interface and integrated with your software package managers (such as APT in Debian). Although initially targeting neuroimaging and neuroscience data in general, it will not be limited by the domain and we would welcome a wide range of contributions.”

Public domain-specific and general data repositories

This list is a sub-list of Scientific Data's recommended data repositories

Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (FCP/INDI)

Functional genomics

Gene Expression Omnibus (GEO)
The European Genome-phenome Archive (EGA)
Database of Interacting Proteins (DIP)
Japanese Genotype-phenotype Archive (JGA)
Biological General Repository for Interaction Datasets *
NCBI PubChem BioAssay

Taxonomy & species diversity

Integrated Taxonomic Information System (ITIS)
KNB: The Knowledge Network for Biocomplexity
NCBI Taxonomy *
Global Biodiversity Information Facility (GBIF)

Mathematical & modelling resources

BioModels Database
Kinetic Models of Biological Systems (KiMoSys)

Organism-focused resources

Eukaryotic Pathogen Database Resources (EuPathDB)
Influenza Research Database
Mouse Genome Informatics (MGI)
Rat Genome Database (RGD)
Zebrafish Model Organism Database (ZFIN)

Generalist repositories

Dryad Digital Repository
Harvard Dataverse
Open Science Framework

*Curated resource which may not accept direct submission of data. Contact the database directly for further information.