Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for database dataset
database
x
dataset
x
116 search results found
Dataset
⭐
4,623
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
Esproc
⭐
4,318
esProc SPL is a scripting language for data processing, with well-designed rich library functions and powerful syntax, which can be executed in a Java program through JDBC interface and computing independently.
Goqu
⭐
2,122
SQL builder and query library for golang
Db
⭐
1,847
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Wikisql
⭐
1,370
A large annotated semantic parsing corpus for developing natural language interfaces.
Datasets For Recommender Systems
⭐
821
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Anime Offline Database
⭐
770
Updated every week: A JSON based anime data set containing the most important meta data as well as cross references to various anime sites such as MAL, ANIDB, ANILIST, KITSU and more...
Db Gpt Hub
⭐
759
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Database Rider
⭐
585
Database testing made easy!
Game Datasets
⭐
584
🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
World.db
⭐
573
Free open public domain world database 'n' schema for use in any (programming) language (e.g. uses plain text datasets)
Poldata
⭐
485
A dataset with political datasets
Text2sql Data
⭐
478
A collection of datasets that pair questions with SQL queries.
Ludicrousdb
⭐
467
LudicrousDB is an advanced database interface for WordPress that supports replication, failover, load balancing, & partitioning
Mquery
⭐
395
YARA malware query accelerator (web frontend)
Db To Api
⭐
356
Turns a Database into a Secure, RESTful API
Us Car Models Data
⭐
345
Introducing the most comprehensive and up-to-date open source dataset on US car models on Github. With over 15,000 entries covering car models manufactured between 1992 and 2023, this repository offers valuable information for anyone looking to incorporate car data into their applications. Best of all, it's completely free to use!
Graph Databases Use Cases
⭐
258
Example use cases from the O'Reilly Graph Databases book
Awesome Tensorlayer
⭐
212
A curated list of dedicated resources and applications
Ser Datasets
⭐
196
A collection of datasets for the purpose of emotion recognition/detection in speech.
Fakenewscorpus
⭐
184
A dataset of millions of news articles scraped from a curated list of data sources.
Dspp Keras
⭐
160
Protein order and disorder data for Keras, Tensor Flow and Edward frameworks with automated update cycle made for continuous learning applications.
Openbrewerydb
⭐
159
🍻 An open-source dataset of breweries, cideries, brewpubs, and bottleshops.
Dap
⭐
153
Data Analysis Pipeline
Azkar Db
⭐
133
Azkar Dataset مجموعة بيانات للأذكار والأدعية والرقية
Vfx Datasets
⭐
121
Jsonkv
⭐
113
Single file write-once database that is valid JSON with efficient random access on bigger datasets
Openml R
⭐
90
R package to interface with OpenML
Openlexicon
⭐
88
Access to lexical databases
Classix
⭐
85
Fast and explainable clustering in Python
Pyreports
⭐
85
pyreports is a python library that allows you to create complex report from various sources
Vietnamese Provinces Database
⭐
83
A complete SQL dataset of Vietnamese administrative units, includes Vietnamese provinces, districts and wards
Deepchange
⭐
83
official project page of the paper "DeepChange: A Long-term Person Re-identification Benchmark"
Dataengineeringpilipinas
⭐
80
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Conmask
⭐
77
ConMask model described in paper Open-world Knowledge Graph Completion.
Mmpd_rppg_dataset
⭐
75
MMPD: Multi-Domain Mobile Video Physiology Dataset(EMBC2023 Oral)
Splicer
⭐
70
Splicer - adds relation querying (SQL) to any python project
Open Mastr
⭐
69
A collaborative software to download the energy database Marktstammdatenregister (MaStR)
Document Image Binarization
⭐
69
A selectional auto-encoder approach for document image binarization
Mltoolkits
⭐
65
learningOrchestra is a distributed Machine Learning integration tool that facilitates and streamlines iterative processes in a Data Science project.
Scene Text Removal
⭐
63
EnsNet: Ensconce Text in the Wild
Biomart
⭐
57
Python biomart API
Quandl Ruby
⭐
56
Sqlitehelper
⭐
55
🗄 This project comes in handy when you want to write a sql statement easily and smarter.
Crema
⭐
53
Meta data server & client tools for game development
Geodatasets
⭐
51
Synthetic datasets for geoscience (geo)statistical modeling
Ved Explore
⭐
51
Exploration of the Vehicle Energy Dataset
Recommender System Datasets
⭐
50
A list of compatible datasets, noting other major repositories containing popular real-world datasets, along with sample code for a range of recommendation tasks.
Awesome Georgian Datasets
⭐
46
Useful datasets, specific to Georgia
Smart Vocoder
⭐
43
Airports
⭐
42
A complete list of IATA Airports including IATA code, ICAO code, Time zone, name, city code, two-letter ISO country code, URL, elevation above sea level in feet, coordinates in decimal degrees, geo encoded city, county and state.
Rdataretriever
⭐
40
R interface to the Data Retriever
Active Orient
⭐
39
Pure Ruby interface to OrientDB
Fifa Fut Data
⭐
39
Web-scraping script that writes the data of all players from FutHead and FutBin to a CSV file or a DB
Agnostos Wf
⭐
39
Database Web Api
⭐
37
Dynamically generate RESTful APIs from the contents of a database table. Provides JSON, XML, and HTML. Supports most popular databases
Postgap
⭐
36
Linking GWAS studies to genes through cis-regulatory datasets
Datasets
⭐
36
A bunch of some 200 datasets. You can call it mini-kaggle :)
Codexmicroorm
⭐
36
An alternative to ORM's such as Entity Framework, offers light-weight database mapping to your existing CLR objects. Visit "Design Goals" on GitHub to see more rationale and guidance.
Wdpar
⭐
35
Interface to the World Database on Protected Areas
Py Gtfs Mysql
⭐
35
Python scripts to import a GTFS dataset into a basic MySQL database.
Estimators
⭐
35
Machine Learning Versioning made Simple
Postgis Baselayers
⭐
35
Web application to download and import popular vector datasets (Natural Earth, GADM, Geonames, etc) into a PostGIS database with the click of a button.
Myanimelist Data Set Creator
⭐
35
Collection of some simple python scripts to create https://myanimelist.net/ anime and user data set.
Dataset
⭐
34
Data set is PHP package for importing & exporting data within CSV & Database with data manipulation
Bothub
⭐
33
Bothub is an open platform for predicting, training and sharing NLP datasets in multiple languages
Trough
⭐
33
Trough: Big data, small databases.
Sequel Combine
⭐
32
The Sequel extension adds the Sequel::Dataset#combine method, which returns object from database composed with childrens, parents or any object where exists any relationship. Now it is possible in one query!
Rdmp
⭐
32
Research Data Management Platform (RDMP) is an open source application for the loading,linking,anonymisation and extraction of datasets stored in relational databases.
Mlcomp
⭐
31
Website for standardized execution and evaluation of algorithms on datasets.
Covid19 Italy Integrated Surveillance Data
⭐
30
COVID-19 integrated surveillance data provided by the Italian Institute of Health and processed via UnrollingAverages.jl to deconvolve the weekly moving averages.
Guthriesolv
⭐
29
Experimental small molecule hydration free energy dataset
Hrtf Individualization
⭐
28
Head-related Transfer Function Customization Process through Slider using PCA and SH in Matlab
Deep_learning_projects
⭐
28
Cnn Filter Db
⭐
28
A database of over 1.4 billion 3x3 convolution filters extracted from hundreds of diverse CNN models with relevant meta information (CVPR 2022 ORAL)
Imdb
⭐
28
My own IMDb dataset importer - loads into a Marten DB document store.
Docker Dataset
⭐
27
Docker database images with pre-populated data for testing and/or practice.
Data Catalog
⭐
27
The NYU Data Catalog facilitates researchers’ access to large datasets available either publicly or through institutional or individual licensing. It also includes descriptions of internally-generated research datasets from NYU researchers.
Sql Dataset
⭐
26
Run SQL queries and send the results to Geckoboard Datasets
Levar
⭐
25
Machine learning evaluation database
Opencompare
⭐
25
Immunespacer
⭐
25
An R Interface to the ImmuneSpace database portal
Entityjustworks
⭐
24
Data first or code first ORM. Entity/object/class/poco to SQL repository mapping, and vice versa; data to C# class code or emit assembly. Can dynamically generate assemblies, C# code, and SQL scripts for CREATE TABLE, INSERT, & UPDATE. Uses: Reflection, Emit, DataTable and CodeDOM.
Datagrabber
⭐
24
A query tool
Movielens.sql
⭐
24
The MovieLens database in SQL
Restatapi
⭐
24
An R package to search and retrieve data from Eurostat database using SDMX
Cosore
⭐
22
Data, metadata, and software tools for the COSORE database of continuous soil respiration measurements
Dafter
⭐
21
📥 Command-line downloader for public datasets
Scut_foru_db_release
⭐
20
Flickr OCR Universal Database (SCUT_FORU_DB_Release)
Predicting Refactoring Ml
⭐
20
Refactoring recommendation via ML
Phash Graph Mvp
⭐
20
Graph a MVP tree from pHash
Aiwatch
⭐
19
Website to track people, organizations, and products (tools, websites, etc.) in AI safety
Inloc_dataset
⭐
19
Sqlserver
⭐
19
Aprender scripts de consulta e manipulação de dados no SQL Server
Db_text_minimal
⭐
19
[WIP] A Pytorch implementation of DB-Text - Real-time Scene Text Detection with Differentiable Binarization
Geocoder Nlp
⭐
18
Geocoder library based on libpostal normalization of libosmscout generated database
Datasets
⭐
16
Datasets is a Java library for conveniently working with machine learning datasets.
Dns Lots Of Lookups
⭐
16
dnslol is a command line tool for performing lots of DNS lookups.
Data Multi Subject
⭐
16
Multi-subject data for the Spine Generic project
Data Engineering Salaries
⭐
16
A Streamlit app to explore data engineering salary data.
Related Searches
Command Line Database (33,932)
Python Dataset (14,792)
Javascript Database (9,210)
Python Database (7,277)
Jupyter Notebook Dataset (6,824)
Database Mysql (6,245)
Php Database (5,990)
Database Postgresql (5,359)
Java Database (4,190)
Database Sql (3,816)
1-100 of 116 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.