Awesome Open Source
Awesome Open Source
Combined Topics
dataframe
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 52 Dataframe Open Source Projects
Categories
>
Control Flow
>
Dataframe
Vaex
⭐
5,662
Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Modin
⭐
5,646
Modin: Speed up your Pandas workflows by changing a single line of code
Smile
⭐
5,127
Statistical Machine Intelligence & Learning Engine
Koalas
⭐
2,591
Koalas: pandas API on Apache Spark
Tablesaw
⭐
2,449
Java dataframe and visualization library
Mars
⭐
1,997
Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, pandas, Scikit-learn and Python functions.
Pandasgui
⭐
1,987
A GUI for Pandas DataFrames
Ballista
⭐
1,633
Distributed compute platform implemented in Rust, using Apache Arrow memory model.
Danfojs
⭐
1,219
danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Mobius
⭐
927
C# and F# language binding and extensions to Apache Spark
Pandas Ta
⭐
771
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 120+ Indicators
Dataframe
⭐
769
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Spark Redis
⭐
756
A connector for Spark that allows reading and writing to/from Redis cluster
Polars
⭐
631
Rust DataFrame library
Pyjanitor
⭐
618
Clean APIs for data cleaning. Python implementation of R package Janitor
Datafusion
⭐
598
DataFusion has now been donated to the Apache Arrow project
Datasheets
⭐
591
Read data from, write data to, and modify the formatting of Google Sheets
Pdpipe
⭐
576
Easy pipelines for pandas DataFrames.
Spark Daria
⭐
529
Essential Spark extensions and helper methods ✨😲
Sequoia
⭐
515
A股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Dataframe Go
⭐
427
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Pandastable
⭐
365
Table analysis in Tkinter using pandas DataFrames.
Dataframe Js
⭐
364
A javascript library providing a new data structure for datascientists and developpers
Awesome Cybersecurity Datasets
⭐
342
A curated list of amazingly awesome Cybersecurity datasets
Pandasvault
⭐
311
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Optopsy
⭐
309
A nimble options backtesting library for Python
Pystore
⭐
304
Fast data store for Pandas time-series data
Sparkflow
⭐
279
Easy to use library to bring Tensorflow on Apache Spark
Qframe
⭐
277
Immutable data frame for Go
Arquero
⭐
272
Query processing and transformation of array-backed data tables.
Rust Dataframe
⭐
264
A Rust DataFrame implementation, built on Apache Arrow
Nimdata
⭐
249
DataFrame API written in Nim, enabling fast out-of-core data processing
Styleframe
⭐
248
A library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Eland
⭐
226
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Datatable
⭐
213
A go in-memory table
Static Frame
⭐
206
The StaticFrame library defines the Series and Frame, immutable data structures for one- and two-dimensional calculations with self-aligning, labelled axes.
Morpheus Core
⭐
199
The foundational library of the Morpheus data science framework
Tech.ml.dataset
⭐
191
A Clojure high performance data processing system
Inspectdf
⭐
186
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Technical
⭐
182
Different indicators developed or collected for the Freqtrade
Peroxide
⭐
169
Rust numeric library with R, MATLAB & Python syntax
Panthera
⭐
166
Data-frames & arrays on Clojure
Ditching Excel For Python
⭐
163
Functionalities in Excel translated to Python
Spark With Python
⭐
140
Fundamentals of Spark with Python (using PySpark), code examples
Design Of Experiment Python
⭐
138
Design-of-experiment (DOE) generator for science, engineering, and statistics
Geni
⭐
136
A Clojure dataframe library that runs on Spark
Pandahouse
⭐
118
Pandas interface for Clickhouse database
Jardin
⭐
80
A pandas.DataFrame-based ORM.
Dframcy
⭐
67
Dataframe Integration with spaCy.
Net.jgp.labs.spark
⭐
55
Apache Spark examples exclusively in Java
Boltzmannclean
⭐
23
Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Foxcross
⭐
18
AsyncIO serving for data science models
1-52 of 52 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210