Open data repositories

It’s no secret that AI is driven by large datasets, especially in Life Sciences there is a great need for such repositories. Here I’d like to sum up just a few I know of. Let me know if you know more. Most of these allow researcher depositing their data to reference it through a DOI. Furthermore they have a long-term data preservation plan:
https://dataverse.harvard.edu/
http://datadryad.org/
https://zenodo.org/
https://www.eudat.eu/

https://www.mendeley.com/datasets

https://www.ebi.ac.uk/pdbe/emdb/empiar/
https://idr.openmicroscopy.org/
https://data.broadinstitute.org/bbbc/
https://www.ebi.ac.uk/biostudies/

Wow, this is amazing, Martin! Thanks! And indeed we should not forget the datasets aggregators like https://datasetsearch.research.google.com/ or https://www.kaggle.com/

Bringing several datasets from diverse source can challenge or improve generalisation of our models!

1 Like