Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Awesome Public Datasets | 57,021 | 2 months ago | 126 | mit | ||||||
A topic-centric list of HQ open datasets. | ||||||||||
Finmind | 1,994 | a day ago | 129 | October 29, 2023 | 46 | apache-2.0 | Jupyter Notebook | |||
Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/ | ||||||||||
Codesearchnet | 1,994 | 2 years ago | 7 | mit | Jupyter Notebook | |||||
Datasets, tools, and benchmarks for representation learning of code. | ||||||||||
Fma | 1,773 | a year ago | 10 | mit | Jupyter Notebook | |||||
FMA: A Dataset For Music Analysis | ||||||||||
Open Data Registry | 1,255 | 2 days ago | 30 | apache-2.0 | Python | |||||
A registry of publicly available datasets on AWS | ||||||||||
Qri | 1,053 | 1 | 2 years ago | 271 | December 13, 2021 | 220 | gpl-3.0 | Go | ||
you're invited to a data party! | ||||||||||
Uhttbarcodereference | 758 | 3 months ago | 8 | |||||||
Universe-HTT barcode reference | ||||||||||
Data Juicer | 668 | an hour ago | 3 | September 28, 2023 | 16 | apache-2.0 | Python | |||
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据! | ||||||||||
Openml | 617 | 2 months ago | 362 | bsd-3-clause | PHP | |||||
Open Machine Learning | ||||||||||
Covid 19 Repo Data | 442 | a year ago | 15 | cc0-1.0 | ||||||
Data archive of identifiable COVID-19 related public projects on GitHub |
This is a list of topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. This project was incubated at OMNILab, Shanghai Jiao Tong University during Xiaming Chen's Ph.D. studies. OMNILab is now part of the BaiYuLan Open AI community. Other amazingly awesome lists can be found in sindresorhus's awesome list.
Special thanks to
NOTICE: This repo is automatically generated by apd-core. Please DO NOT modify this file directly. We have provided a new way to contribute to this repo. Join the slack community for an instant touch of HQ data updates.
Table of Contents