Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for sql data science
data-science
x
sql
x
132 search results found
Modin
⭐
9,275
Modin: Scale your Pandas workflows by changing a single line of code
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Mage Ai
⭐
6,324
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Sql Translator
⭐
3,842
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
Evidence
⭐
2,776
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..
Data Science Best Resources
⭐
2,718
Carefully curated resource links for data science in one place
Data Diff
⭐
2,707
Compare tables within or across databases
Quadratic
⭐
2,485
Quadratic | Data Science Spreadsheet with Python & SQL
Data Science Roadmap
⭐
2,445
Data Science Roadmap from A to Z
Data Science Question Answer
⭐
2,389
A repo for data science related questions and answers
Chdb
⭐
2,237
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
Awesome Business Intelligence
⭐
1,862
Actively curated list of awesome BI tools. PRs welcome!
Bayeslite
⭐
828
BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
Curriculum
⭐
762
👩🏫 👨🏫 The open-source curriculum of Enki!
Ipython Dashboard
⭐
635
A stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Preql
⭐
612
An interpreted relational query language that compiles to SQL.
Krangl
⭐
559
krangl is a {K}otlin DSL for data w{rangl}ing
Data_sci_guide
⭐
519
A community-sourced data science repo.
Machine_learning_and_deep_learning
⭐
492
Versatile Data Kit
⭐
389
One framework to develop, deploy and operate data workflows with Python and SQL.
Dataframe Js
⭐
383
A javascript library providing a new data structure for datascientists and developpers
Mais
⭐
375
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/mais/
Tellery
⭐
350
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Astro Sdk
⭐
303
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Best Data Science Resources
⭐
282
This repository contains the best Data Science free hand-picked resources to equip you with all the industry-driven skills and interview preparation kit.
My Data Competition Experience
⭐
271
本人多次机器学习与大数据竞赛Top5的经验总结,满满的干货,拿好不谢
Data Science
⭐
269
Projects and awesome list for all Data Science fields
Jupysql
⭐
261
Better SQL in Jupyter. 📊
Rasgoql
⭐
258
Write python locally, execute SQL in your data warehouse
Snowpark Python
⭐
215
Snowflake Snowpark Python API
Data Analyst Roadmap
⭐
170
I am sharing my Journey of 66DaysofData into Data Analytics by participating in Ken Jee's #66daysofdata challenge
Web Database Analytics
⭐
144
Web scrapping and related analytics using Python tools
Data Science Learning Material
⭐
139
These are Github repositories for all data science material I feel important. I would update it daily.
Data_analysis_portfolio
⭐
120
This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.
Sayn
⭐
117
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Data Science Case Studies
⭐
115
Data Science Case Studies for computer science students.
Books
⭐
106
Books related to AI/ML/DL/GENAI
Patterns Devkit
⭐
101
Data pipelines from re-usable components
Data Validator
⭐
90
A tool to validate data, built around Apache Spark.
Dataengineeringpilipinas
⭐
80
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Flatiron School Data Science Curriculum Resources
⭐
75
Lesson material on data science and machine learning topics/concepts
Ds Student Resources
⭐
74
Data Science Student Companion Notebooks and Data Lake
Portfolio Guide
⭐
72
A guide and summary to my projects and case studies.
Semester Biology
⭐
70
Forkable teaching materials for course on working with data in R
Ineuron Full Stack Data Science Assignments
⭐
68
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Beneath
⭐
64
Beneath is a serverless real-time data platform ⚡️
Awesome Data Science Resources
⭐
51
Resources about data science, machine learning, deep learning, data engineering, and SQL.
Mit 15 003 Data Science Tools
⭐
50
Study guides for MIT's 15.003 Data Science Tools
Dqlab
⭐
46
This is a repository for storing and sharing data resulting from working on projects and materials in DQLab
Coursera_ibm Data Analyst Professional Certificate_op
⭐
44
Quizzes & Assignment Solutions for IBM Data Analyst Professional Certificate on Coursera. Also included a few resources on side that I found helpful.
Work At Olist Data
⭐
41
Apply for a job at Olist's Data Team: https://olist.gupy.io/
Analyticswithanand
⭐
40
This repository contains all the codes,ppts,project & interview questions which I have used in my LIVE CLASS on YouTube and any other relevant documents and assignments related to the course.
Ibmdataengineeringcoursera
⭐
40
IBM Data Engineering Courses from Coursera
Tablite
⭐
36
multiprocessing enabled out-of-memory data analysis library for tabular data.
Cousera_google Data Analytics Professional Certificate
⭐
34
Quizzes & Assignment Solutions for Google Data Analytics Professional Certificate on Coursera. Also included a few resources on side that I found helpful.
Data Analytics Services
⭐
33
This repo collects the open-source work of the Analytics Service within NHS Digital Data Services
Ides
⭐
32
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
Statistics For Data Science Using Python
⭐
32
Sharing the solved Exercises & Project of Statistics for Data Science using Python course on Coursera by Ankit Gupta
Blast
⭐
31
Blast is a data orchestration tool that can run SQL and Python against Google BigQuery and Snowflake. It supports templating with Jinja, data quality tests, query validation, environment management and more.
Teaching Notes
⭐
29
Notes from courses and workshops I've taught or assisted with at UC Davis and UC Berkeley.
Pandas Sqlalchemy Tutorial
⭐
29
🐼 💻 Load or insert data into a SQL database using Pandas DataFrames.
Hdk
⭐
28
A low-level execution library for analytic data processing.
Skillshare Data Science
⭐
27
Skillshare Data Science and Business Analytics in Python
Data_science
⭐
27
8 Week Sql Challenge
⭐
27
#8WeekSQLChallenge by Danny Ma.
Data Science For Architecture
⭐
24
Repository for data science study for architecture, engineering and construction (AEC)
Cl Duckdb
⭐
23
Common Lisp CFFI wrapper around the DuckDB C API
Workshop_intro_to_sql
⭐
23
Reader for the Intro to SQL workshop series.
Springboard Data Science Immersive
⭐
23
Heavyai.jl
⭐
22
Julia client for OmniSci GPU-accelerated SQL engine and analytics platform
Csvz
⭐
21
The hot new standard in open databases
Awesome Prestosql
⭐
19
A list of Presto/Trino resources
Sqlserver
⭐
19
Aprender scripts de consulta e manipulação de dados no SQL Server
Data Science Ebooks
⭐
19
Data Science E-books, Interview Resources and Cheat-sheets
Datacamp Courses Megacollection
⭐
18
70+ DataCamp Course Notes, Projects, Codes, Exercises on Python, R and SQL with full DS & ML Certification,
Road To Data Science In 50 Days
⭐
18
Datademo
⭐
17
提供資料集與範例分享.
Chdb Server
⭐
17
API Server for chDB, an in-process SQL OLAP Engine powered by ClickHouse
Karpov_courses
⭐
16
🐳 Проектная деятельность. Здесь хранятся лекции, практические задания и проекты с karpov_courses. Ссылка: https://karpov.courses/
Computing With Data
⭐
15
Code samples for my book "Computing with Data: An Introduction to the Data Industry"
Awesome Data Science
⭐
15
Data science and programming resources for daily work
Data Scientist In Python
⭐
15
This repository contains notes and projects of Data scientist track from dataquest course work.
Cheatsheets For Ai
⭐
14
Cheatsheets on numerous topics ranging from DataScience | ML | DL | AI | Big Data.
Azure Sql
⭐
14
Aprender scripts de consulta e manipulação de dados no Azure SQL
Udacity Programming For Data Science With Python Nanodegree
⭐
13
This reprositry contain all the codes of Udacity programming for data science course
Odsc Sql For Data Science
⭐
13
SQL for Data Science Workshop at ODSC
Google Data Analytics Professional Certificate
⭐
13
Google Data Analytics Professional Certificate on Coursera. (Grade Achieved: 10000%)
Ride
⭐
13
A nice R development and analytics environment, for the Renjin JVM implementation of R
Labs
⭐
12
This repository contains materials for the lab sessions of the Introduction to Data Science course at the Hertie School in Berlin, fall semester 2021, taught by Lisa Oswald and Tom Arend.
Trainity_data_analytics_trainee
⭐
12
This repository has all the Pdfs along with Data Analysis Portfolio and Data Analytics Certificates from Trainity. Click on the below link for enrolling yourself into Data Analytics internship from Trainity.
Data Portfolio
⭐
11
📊 ⚙️ My professional data analysis portfolio. Check out my works by clicking the link.
Featuretools_sql
⭐
11
Automated creation of EntitySets from relational data stored in SQL databases
Join
⭐
11
SQL-style joins for Python iterables
Welcome_to_blazingsql_notebooks
⭐
11
RAPIDS data science. No setup required.
8 Week Sql Challenge
⭐
11
Solutions for #8WeekSQLChallenge using SQL Server.
Various Data Science Scripts
⭐
10
A collection of coding scripts, notes, and mini-projects with reference to a series of Data Science, Web Development, programming concepts and foundations, and miscellaneous tech topics.
Workshops
⭐
10
CartoCamp Workshops
Ob_pysh Db
⭐
10
pysh-db - The Data Science Toolkit (DSK)
Data Science Toolbox Bootcamp
⭐
10
A 4 week program to get started with Data Science. Useful for beginners who want to get started by themselves.
Data Pipeline With Dbt Using Airflow On Gcp
⭐
10
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Related Searches
Python Data Science (6,905)
Database Sql (5,501)
Machine Learning Data Science (5,390)
Python Sql (3,922)
Jupyter Notebook Data Science (3,734)
Mysql Sql (2,867)
Java Sql (2,781)
Javascript Sql (2,662)
C Sharp Sql (2,429)
Postgresql Sql (2,411)
1-100 of 132 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.