Awesome Public Datasets

A topic-centric list of HQ open datasets.
Alternatives To Awesome Public Datasets
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Awesome Public Datasets57,021
2 months ago126mit
A topic-centric list of HQ open datasets.
Finmind1,994
a day ago129October 29, 202346apache-2.0Jupyter Notebook
Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/
Codesearchnet1,994
2 years ago7mitJupyter Notebook
Datasets, tools, and benchmarks for representation learning of code.
Fma1,773
a year ago10mitJupyter Notebook
FMA: A Dataset For Music Analysis
Open Data Registry1,255
2 days ago30apache-2.0Python
A registry of publicly available datasets on AWS
Qri1,05312 years ago271December 13, 2021220gpl-3.0Go
you're invited to a data party!
Uhttbarcodereference758
3 months ago8
Universe-HTT barcode reference
Data Juicer668
an hour ago3September 28, 202316apache-2.0Python
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Openml617
2 months ago362bsd-3-clausePHP
Open Machine Learning
Covid 19 Repo Data442
a year ago15cc0-1.0
Data archive of identifiable COVID-19 related public projects on GitHub
Alternatives To Awesome Public Datasets
Select To Compare


Alternative Project Comparisons
Readme

Awesome Public Datasets

Awesome

This is a list of topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. This project was incubated at OMNILab, Shanghai Jiao Tong University during Xiaming Chen's Ph.D. studies. OMNILab is now part of the BaiYuLan Open AI community. Other amazingly awesome lists can be found in sindresorhus's awesome list.

Special thanks to

BaiYuLanAI

NOTICE: This repo is automatically generated by apd-core. Please DO NOT modify this file directly. We have provided a new way to contribute to this repo. Join the slack community for an instant touch of HQ data updates.

  • OK_ICON I am well.
  • FIXME_ICON Please fix me.

Agriculture

Architecture

Biology

Chemistry

Climate+Weather

ComplexNetworks

ComputerNetworks

CyberSecurity

DataChallenges

EarthScience

Economics

Education

Energy

Entertainment

Finance

GIS

Government

Healthcare

ImageProcessing

MachineLearning

Museums

NaturalLanguage

Neuroscience

Physics

ProstateCancer

Psychology+Cognition

PublicDomains

SearchEngines

SocialNetworks

SocialSciences

Software

Sports

TimeSeries

Transportation

eSports

Complementary Collections

Popular Dataset Projects
Popular Open Data Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Dataset
Open Data