Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataset open data
dataset
x
open-data
x
43 search results found
Awesome Public Datasets
⭐
63,029
A topic-centric list of HQ open datasets.
Codesearchnet
⭐
2,298
Datasets, tools, and benchmarks for representation learning of code.
Fma
⭐
1,773
FMA: A Dataset For Music Analysis
Open Data Registry
⭐
1,271
A registry of publicly available datasets on AWS
Qri
⭐
1,053
you're invited to a data party!
Data Juicer
⭐
994
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Openml
⭐
689
Open Machine Learning
Covid 19 Repo Data
⭐
442
Data archive of identifiable COVID-19 related public projects on GitHub
Ucf Sst Citysim Dataset
⭐
283
Official github page of UCF SST CitySim Dataset
Rsocrata
⭐
227
Provides easier interaction with Socrata open data portals http://dev.socrata.com. Users can provide a 'Socrata' data set resource URL, or a 'Socrata' Open Data API (SoDA) web query, or a 'Socrata' "human-friendly" URL, returns an R data frame. Converts dates to 'POSIX' format. Manages throttling by 'Socrata'.
Pynasa
⭐
205
Who Owns What
⭐
171
Who owns what in nyc?
Adresse.data.gouv.fr
⭐
150
Le site officiel de l'Adresse
Public Datasets Pipelines
⭐
131
Cloud-native, data onboarding architecture for Google Cloud Datasets
Covid19
⭐
122
Novel Corona Virus - COVID-19 India Datasets by DataMeet
Cv Dataset
⭐
120
Metadata and versioning details for the Common Voice dataset
Covid19 Sir
⭐
118
COVID-19 SIR model estimation
Eurocrops
⭐
116
The official repository for the EuroCrops dataset.
Bimcv Covid 19
⭐
112
Valencia Region Image Bank (BIMCV) that combines data from the PadChest dataset with future datasets based on COVID-19 pathology to provide the open scientific community with data of clinical-scientific value that helps early detection of COVID-19
Crypto
⭐
104
Cryptocurrency Historical Market Data R Package
Transitland Datastore
⭐
103
Transitland v1 core components. Deprecated and only maintained occasionally. See Transitland v2.
Friendly Public Transport Format
⭐
97
A format for APIs, libraries and datasets containing and working with public transport data.
Openml R
⭐
90
R package to interface with OpenML
Sova Dataset
⭐
82
Json Stat
⭐
76
JSON-stat Toolkit version 0
Opendata
⭐
73
Datasets opened by Lithium Technologies | Klout
Od Weapondetection
⭐
62
Datasets for weapon detection based on image classification and object detection tasks
Open Data
⭐
61
Covid19 Vaccination Subnational
⭐
59
🌍💉 Global COVID-19 vaccination data at the regional level.
Soda.net
⭐
56
A Socrata Open Data API (SODA) client library for .NET
Data.world R
⭐
55
R library for data.world
Datasets
⭐
52
Datasets powering the Open Data API
Covid19ardata
⭐
51
Data COVID-19 Argentina actualizada y en formatos abiertos.
Acik Veri
⭐
37
Türkiye'nin açık veri kaynakları | Curated list of open data platforms of Turkiye
Italy
⭐
37
Free open public domain football data (football.db) for Italy / Europe - Serie A etc.
Job Titles
⭐
36
Normalized dataset of 70k job titles
Collection
⭐
33
Williams College Museum of Art (WCMA) collection data
Git Rdm
⭐
32
A research data management plugin for the Git version control system.
Digipathos
⭐
32
Brazilian Agricultural Research Corporation (EMBRAPA) fully annotated dataset for plant diseases. Plug and play installation over PiP.
Covid19 Datasets
⭐
31
A list of high quality open datasets for COVID-19 data analysis
Data Fair
⭐
31
Findable, Accessible, Interoperable and Reusable Data. A complete open-source solution for your open and private data needs. French only for the time being, internationalization coming soon.
Pyrdm
⭐
27
PyRDM is a Python-based library for research data management (RDM). It facilitates the automated publication of scientific software and associated input and output data.
Notebooks Collection Opendata
⭐
26
A set of simple notebooks using 8 TeV and 13 TeV ATLAS Open Data datasets
Restatapi
⭐
24
An R package to search and retrieve data from Eurostat database using SDMX
Awesome Datasets
⭐
23
Data Studio Connector
⭐
22
Google Data Studio connector for data.world
Describeml
⭐
22
DescribeML is a Visual Studio Code language plug-in to describe machine-learning datasets in a structured format. Build better data describing the composition, provenance and social concerns of your dataset.
Dataportals Registry
⭐
22
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
Berlin_corona_cases
⭐
20
Scraper for the official dashboard with current Corona case numbers, traffic light indicators ("Corona-Ampel") and vaccination situation for Berlin.
Coronavirus
⭐
20
2020 Poland coronavirus data (COVID-19 / 2019-nCoV)
Lod3 Road Space Models
⭐
20
LoD3 road space models in CityGML+SketchUp of the research project SAVe
Osdg Data
⭐
20
The OSDG Community Dataset (OSDG-CD) is a public dataset of thousands of text excerpts, validated by OSDG Community Platform (OSDG-CP) citizen scientists with respect to the Sustainable Development Goals (SDGs). The dataset is updated every quarter and published on Zenodo.
Econdata
⭐
19
R package containing a host of datasets useful for economic research. Complete with raw data and cleaning functions.
Ckanapi Exporter
⭐
18
Export dataset metadata from CKAN to Excel-compatible CSV
Ban Pl
⭐
18
Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service
Dw Jdbc
⭐
18
JDBC driver for data.world
Goodreadsbookdataset
⭐
18
Gathering dataset from Goodreads website
Fdnssearch
⭐
18
Swiftly search FDNS datasets from Rapid7 Open Data
Awesome Data Chile
⭐
17
Lista curada de datasets públicos sobre Chile.
Espana
⭐
16
Football data for España (Spain) incl. Primera División (La Liga), Segunda División etc.
Ukraine_twitter_data
⭐
16
Twitter data around the Ukraine Invasion in February 2022
Open Data On Github
⭐
16
Dataset files for the Open Data on GitHub paper
Transparency
⭐
15
Structured data files for topics covered by GitHub's Transparency Report
Toolkit
⭐
15
JSON-stat Javascript Toolkit version 1
Awesome Sweden Datasets
⭐
14
A curated list of awesome datasets to use when coding for the Swedish market.
Yalc
⭐
13
🕸 YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).
Open Traits Network.github.io
⭐
12
Open Traits Network Registry and Website
Data Publication
⭐
12
🔓 Open biodiversity data publication by the INBO
Publicsectornl
⭐
11
Open Source in the public sector in the Netherlands
Openfema Samples
⭐
11
Code, dataset, and analysis samples that utilize the OpenFEMA API.
Cache.soccerverse
⭐
11
Cache - Soccerverse
Czso
⭐
11
Use Open Data from the Czech Statistical Office in R
Open Data
⭐
11
Croatia open data repository (Open Data HR)
Odessa
⭐
10
Excel Add In
⭐
10
Excel add-in for data.world
Datasets
⭐
10
Automatically generated and up-to-date datasets for Cobalt.
Awesome Datatable References
⭐
8
Conjunto de dados tabulados com base nas referências cadastrais do IBGE, Dados abertos do GOV.BR, e iniciativas de curadoria de dados individuais
Sgraildata
⭐
8
Singapore Rail data
Data Portallist De
⭐
8
📚 list of all open data portals in Germany 🇩🇪
Uniwear Dataset
⭐
8
Tidy multi-material machine tool wear dataset for prognostics and health monitoring.
Open Traffic Datasets
⭐
8
open traffic datasets
Open Data Security
⭐
8
open-data-security description format is a simple JSON format to describe dataset released as open data by security researchers, security vendors or CSIRTs
Nrc Gamma
⭐
7
Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.
Tableau Connector
⭐
7
Tableau connector for data.world
Awesome Italian Public Datasets
⭐
7
A selection of interesting Open dataset from the Italian Public Administration and Civic Data use cases
Govpack
⭐
6
Python package for easier access to Ukrainian open data
Data Pr Downloader
⭐
6
Dump of all datasets found in data.pr.gov
Hlidacr
⭐
6
Access Data from the Hlídač státu API in R
Ckanext Requestdata
⭐
6
📧 📬 CKAN extension for requesting new data 📧 📬
Awesome Vehicle Datasets
⭐
6
A topic-centric list of Vehicles datasets.
Euro
⭐
6
JSON-stat for Eurostat
Soilsamples
⭐
6
Soil Sample and Soil Profile Datasets: an Open Compilation
Fso Lod
⭐
6
Swiss Federal Statistical Office (FSO) datasets as Linked Data
Deutschland
⭐
6
Football data for Deutschland (Germany) incl. Bundesliga, 2. Bundesliga, etc.
Nasawakeupcalls.data
⭐
6
Data extracts and analysis of the Music to Wake-up by project from the NASA History Division
Inspire
⭐
5
INSPIRE harmonization projects with open data
Thermal Solar Plant Dataset
⭐
5
Realtime Thermal Solar Plant Dataset for Machine Learning
Offenes Datenportal
⭐
5
📄 Inoffizielles Datenportal für die Stadt Essen
Datasets Links Collection
⭐
5
A collection of links for various datasets
Lacuna Db
⭐
5
legal data in machine-readable form
Related Searches
Python Dataset (14,792)
Jupyter Notebook Dataset (6,824)
Deep Learning Dataset (2,364)
Machine Learning Dataset (2,279)
Dataset Pytorch (1,887)
Dataset Tensorflow (1,583)
Dataset Classification (1,516)
R Dataset (1,483)
Dataset Convolutional Neural Networks (1,264)
Dataset Paper (1,252)
1-43 of 43 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.