Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for database data science
data-science
x
database
x
95 search results found
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Tad
⭐
2,939
A desktop application for viewing and analyzing tabular data
Data Science Best Resources
⭐
2,718
Carefully curated resource links for data science in one place
Data Diff
⭐
2,707
Compare tables within or across databases
Chdb
⭐
2,237
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
Awesome Business Intelligence
⭐
1,862
Actively curated list of awesome BI tools. PRs welcome!
Arcticdb
⭐
1,614
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
Supabase Py
⭐
1,246
Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.
Vectordb
⭐
829
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Bayeslite
⭐
828
BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
Datasets For Recommender Systems
⭐
821
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Moviegeek
⭐
730
A django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
Preql
⭐
612
An interpreted relational query language that compiles to SQL.
Turbodbc
⭐
596
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.
Datacleaner
⭐
557
The premier open source Data Quality solution
Lantern
⭐
530
PostgreSQL vector database extension for building AI applications
Holoclean
⭐
485
A Machine Learning System for Data Enrichment.
Versatile Data Kit
⭐
389
One framework to develop, deploy and operate data workflows with Python and SQL.
Workshops
⭐
362
Workshops organized to introduce students to security, AI, blockchain, AR/VR, hardware and software
Tellery
⭐
350
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Elastic
⭐
242
R client for the Elasticsearch HTTP API
Pydbgen
⭐
199
Random dataframe and database table generator
Metaflow Service
⭐
166
🚀 Metadata tracking and UI service for Metaflow!
Tennis Crystal Ball
⭐
150
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Jazz
⭐
145
The Scripting Engine that Combines Speed, Safety, and Simplicity
Web Database Analytics
⭐
144
Web scrapping and related analytics using Python tools
Oxen
⭐
118
Oxen.ai's core rust library, server, and CLI
Polygondb
⭐
114
PolygonDB is an alternative to MongoDB that provides a developer-friendly experience and less resources hungry.
Teach Data Science Ucla Master Appl Stats
⭐
108
Materials for STATS 418 - Tools in Data Science course taught in the Master of Applied Statistics at UCLA
Books
⭐
106
Books related to AI/ML/DL/GENAI
Cloud Clubs Learner Library
⭐
104
A library for learners! Whether or not you're a part of AWS Cloud Clubs, take a look in this library for free, open, leveled content for students 18+ worldwide
Terpene Profile Parser For Cannabis Strains
⭐
93
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Openml R
⭐
90
R package to interface with OpenML
Classix
⭐
85
Fast and explainable clustering in Python
Dataengineeringpilipinas
⭐
80
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Pyexasol
⭐
72
Exasol Python driver with low overhead, fast HTTP transport and compression
Linkedingiveaway
⭐
70
👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Data Wrangling With Python
⭐
66
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Highlightedcs
⭐
60
Popular computer science books (PDF) with highlighting :) add yours now
Docker Kafka Alpine
⭐
59
Alpine Linux based Kafka Docker Image
Estudados
⭐
57
Banco de Dados para Estudo
Python
⭐
56
this resporatory have ml,ai,nlp,data science etc.python language related material from many websites eg. datacamp,geeksforgeeks,linkedin,youtube,udemy etc. also it include programming challange/competion solutions
Recommender System Datasets
⭐
50
A list of compatible datasets, noting other major repositories containing popular real-world datasets, along with sample code for a range of recommendation tasks.
Wallet Tracker
⭐
42
Detect real scammers with Wallet-Tracker CLI from anywhere.
Rdataretriever
⭐
40
R interface to the Data Retriever
Pygm
⭐
39
🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage
Malicious Urlv5
⭐
38
A multi-layered and multi-tiered Machine Learning security solution, it supports always on detection system, Django REST framework used, equipped with a web-browser extension that uses a REST API call.
Datasets
⭐
36
A bunch of some 200 datasets. You can call it mini-kaggle :)
Pandas Sqlalchemy Tutorial
⭐
29
🐼 💻 Load or insert data into a SQL database using Pandas DataFrames.
Veri_cekme
⭐
24
Beautifulsoup and Selenium
Frames Beam
⭐
24
Accessing Postgres in a data frame in Haskell
Events
⭐
23
Materials related to events I might attend, and to talks I am giving
Easy Data Processing Library
⭐
23
An extreme easy python local data base
Heavyai.jl
⭐
22
Julia client for OmniSci GPU-accelerated SQL engine and analytics platform
Core
⭐
21
ITF Market Analysis and Signal Services
Python Twitch Chatbot
⭐
21
A custom, 100% Python Twitch Chatbot that stores chat/viewership data in a PostgreSQL database.
Postgis
⭐
20
Spatial Data Management with PostgreSQL and PostGIS https://gishub.org/sdm
Machine_learning_in_python
⭐
19
Demo of basic machine learning models in python with Jupter Notebook
Awesome Prestosql
⭐
19
A list of Presto/Trino resources
Datatonic
⭐
19
🌟DataTonic : A Data-Capable AGI-style Agent Builder of Agents , that creates swarms , runs commands and securely processes and creates datasets, databases, visualisations, and analyses.
Sqlserver
⭐
19
Aprender scripts de consulta e manipulação de dados no SQL Server
Road To Data Science In 50 Days
⭐
18
Rivery_cli
⭐
17
Rivery CLI
Ethereumdb
⭐
17
Dbsim
⭐
16
The codebase for DBSim
Lazlodb
⭐
16
Lazlo DB
Machine Learning And Data Processing
⭐
15
A collection of resources on machine learning, data processing and related areas
Computing With Data
⭐
15
Code samples for my book "Computing with Data: An Introduction to the Data Industry"
Py4 Ds
⭐
15
🐍 Data Science Boot-Camp : UC San DiegoX
Mindbase
⭐
15
A database for convergent intersubjectivity
Datasciencecurriculum
⭐
14
A curated list of free courses from reputable universities that meet the requirements of a Data Science undergraduate curriculum, minus general education. With projects, support materials in an organized structure.
Azure Sql
⭐
14
Aprender scripts de consulta e manipulação de dados no Azure SQL
Odsc Sql For Data Science
⭐
13
SQL for Data Science Workshop at ODSC
Monggregate
⭐
12
Library to make MongoDB aggregation framework and pipelines easy to use in python.
Tableio.jl
⭐
12
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
Data Pipeline With Dbt Using Airflow On Gcp
⭐
10
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Recommender System In Postgresql Using Python
⭐
10
Using python to build a movie recommendation engine inside postgresql database
Various Data Science Scripts
⭐
10
A collection of coding scripts, notes, and mini-projects with reference to a series of Data Science, Web Development, programming concepts and foundations, and miscellaneous tech topics.
Open Russian Data
⭐
10
Подборка ресурсов открытых данных, ориентированная на использование в странах СНГ, или если вы делаете продукт и исследование про страны СНГ. Данные разбиты по темам и подходят как для дата-журналистики, так и для Data Science.
Hust Projects
⭐
9
My labs in college of CS and some interesting projects at HUST.
Generalledger.jl
⭐
9
ML for GL - A complete Julialang ERP Data Science framework for Finance, Supply chain accounting
Chapter 3
⭐
9
Code examples for Chapter 3 of Data Wrangling with JavaScript
Magicbox Maps
⭐
9
Map mobility data in a NodeJS + React front-end application with data served by magicbox-open-api
Sql_resources
⭐
8
A summary of SQL exercises
Linwin Db Server
⭐
8
在广袤无垠的现代大数据海洋之中,计算机深度的和信息以及数据绑定,承载这亿万数据的就是数据库软件。 Linwin Data Server,基于Java开发的国产高性能数据库软件。支持国产和Linux操作系统,支持多用户操作。 用户数据的增删改查全部在内存内操作,与硬盘的交互写入读取交由专门的线程管理,无不妨碍.
Hfcommunity
⭐
8
HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub
Data Science Jumpstart With 10 Projects Course
⭐
7
Data Science Jumpstart with 10 Projects Course
Octopus
⭐
7
R Package for Interacting with Databases
Data Science With Python
⭐
7
😎 Data Science with Python from Scratch
Common_datasets
⭐
7
Common-datasets is a GitHub repository dedicated to providing a wide collection of common datasets for practicing and learning data science and machine learning.
Textdirectory
⭐
7
TextDirectory allows you to filter, transform, and combine multiple text files into one aggregated file.
Artificial Intelligence Research And Development Projects
⭐
7
The field of Artificial Intelligence (AI) is a frontier of computer science that focuses on creating systems capable of performing tasks that would typically require human intelligence. This encompasses a wide range of capabilities such as visual perception, speech recognition, decision-making, and language translation.
Full Stack Data Science
⭐
7
Full stack Data science : How to become a data scientist
Data Science With R
⭐
6
Data Science | Machine Learning | Data Analysis
Magist Algorithm
⭐
6
Multi-Agent Generally Intelligent Simultaneous Training Algorithm for Project Zeta
Chromosomedna
⭐
6
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
Triadb
⭐
6
Self-Service Data Management & Interactive Visual Analytics Development Framework
Stapy
⭐
6
An easy to use SensorThings API Client written in Python
Sql Examples
⭐
5
sql examples (sqlite)
Ragger
⭐
5
Identifies highly upvoted removed comments and posts on reddit by aggregating historical data. Results are displayed on Reveddit's subreddit top pages.
Related Searches
Command Line Database (33,932)
Javascript Database (9,210)
Python Database (7,277)
Python Data Science (6,993)
Database Mysql (6,245)
Php Database (5,990)
Java Database (5,934)
Database Sql (5,557)
Machine Learning Data Science (5,390)
Database Postgresql (5,359)
1-95 of 95 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.