Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataframe
dataframe
x
1,064 search results found
Polars
⭐
24,900
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Modin
⭐
9,275
Modin: Scale your Pandas workflows by changing a single line of code
Pygwalker
⭐
8,698
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Vaex
⭐
8,161
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Cudf
⭐
6,936
cuDF - GPU DataFrame Library
Smile
⭐
5,833
Statistical Machine Intelligence & Learning Engine
Datasciencepython
⭐
4,776
common data analysis and machine learning tasks using python
Arrow Datafusion
⭐
4,514
Apache Arrow DataFusion SQL Query Engine
Danfojs
⭐
4,416
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Pandas Ta
⭐
4,337
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Mimesis
⭐
4,298
Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
Tablesaw
⭐
3,328
Java dataframe and visualization library
Koalas
⭐
3,291
Koalas: pandas API on Apache Spark
Pandasgui
⭐
3,130
A GUI for Pandas DataFrames
Pandera
⭐
2,807
A light-weight, flexible, and expressive statistical data testing library
Pandas Datareader
⭐
2,733
Extract data from a wide range of Internet sources into a pandas DataFrame.
Sklearn Pandas
⭐
2,724
Pandas integration with sklearn
Sweetviz
⭐
2,687
Visualize and compare datasets, target values and associations, with one line of code.
Mars
⭐
2,664
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Swifter
⭐
2,407
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Ballista
⭐
2,244
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Dataframe
⭐
2,129
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
Sketch
⭐
2,106
AI code-writing assistant that understands data content
Vincent
⭐
2,054
A Python to Vega translator
Tabula Py
⭐
1,986
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
100 Pandas Puzzles
⭐
1,977
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Tv
⭐
1,956
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
Pandas Videos
⭐
1,808
Jupyter notebook and datasets from the pandas Q&A video series
Dask Tutorial
⭐
1,802
Dask tutorial
Tiledb
⭐
1,700
The Universal Storage Engine
Marketstore
⭐
1,687
DataFrame Server for Financial Timeseries Data
Finta
⭐
1,677
Common financial technical indicators implemented in Pandas.
Connector X
⭐
1,668
Fastest library to load data from DB to DataFrames in Rust and Python
Dataframes.jl
⭐
1,663
In-memory tabular data in Julia
Autoviz
⭐
1,550
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
D3py
⭐
1,349
a plottling library for python, based on D3
Sequoia
⭐
1,349
A股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Quandl Python
⭐
1,309
Pyjanitor
⭐
1,282
Clean APIs for data cleaning. Python implementation of R package Janitor
Hamilton
⭐
1,272
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Statistical Analysis Python Tutorial
⭐
1,233
Statistical Data Analysis in Python
Arquero
⭐
1,125
Query processing and transformation of array-backed data tables.
Arrow Ballista
⭐
1,111
Apache Arrow Ballista Distributed Query Engine
Arcticdb
⭐
1,071
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
Siuba
⭐
1,052
Python library for using dplyr like syntax with pandas and SQL
Kotlin Jupyter
⭐
1,018
Kotlin kernel for Jupyter/IPython
Kangas
⭐
1,015
🦘 Explore multimedia datasets at scale
Daft
⭐
1,012
Distributed DataFrame for Python designed for the cloud, powered by Rust
Graphframes
⭐
944
Mobius
⭐
937
C# and F# language binding and extensions to Apache Spark
Fast Pandas
⭐
928
Benchmark for different operations in pandas against various dataframe sizes.
Spark Redis
⭐
926
A connector for Spark that allows reading and writing to/from Redis cluster
Explorer
⭐
915
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
Ppscore
⭐
893
Predictive Power Score (PPS) in Python
Hvplot
⭐
883
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
Hamilton
⭐
877
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Optopsy
⭐
832
A nimble options backtesting library for Python
Jqdatasdk
⭐
811
简单易用的量化金融数据包(easy utility for getting financial market data of China)
Pandashells
⭐
786
🐼 Bringing the python data stack to the shell prompt
Pointblank
⭐
785
Data quality assessment and metadata reporting for data frames and database tables
Pyspark Examples
⭐
778
Pyspark RDD, DataFrame and Dataset Examples in Python language
Awesome Cybersecurity Datasets
⭐
765
A curated list of amazingly awesome Cybersecurity datasets
Spark Daria
⭐
738
Essential Spark extensions and helper methods ✨😲
Dfply
⭐
734
dplyr-style piping operations for pandas dataframes
Django Pandas
⭐
724
Tools for working with pandas in your Django projects
Pdpipe
⭐
710
Easy pipelines for pandas DataFrames.
Joinery
⭐
676
Data frames for Java
Technical
⭐
663
Various indicators developed or collected for the Freqtrade
Dataframe
⭐
642
Structured data processing in Kotlin
Dataframe Go
⭐
642
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Datafusion
⭐
626
DataFusion has now been donated to the Apache Arrow project
Tech.ml.dataset
⭐
616
A Clojure high performance data processing system
Datasheets
⭐
613
Read data from, write data to, and modify the formatting of Google Sheets
Fst
⭐
599
Lightning Fast Serialization of Data Frames for R
Pandastable
⭐
592
Table analysis in Tkinter using pandas DataFrames.
Eland
⭐
588
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Lumibot
⭐
544
Backtesting and Trading Bots Made Easy for Crypto, Stocks, Options, Futures, FOREX and more
Machine Learning
⭐
537
Practical Full-Stack Machine Learning
Metorikku
⭐
536
A simplified, lightweight ETL Framework based on Apache Spark
Spark Avro
⭐
535
Avro Data Source for Apache Spark
Data Science Your Way
⭐
532
Ways of doing Data Science Engineering and Machine Learning in R and Python
Pykrx
⭐
531
KRX 주식 정보 스크래핑
Flatten
⭐
508
Flatten JSON in Python
Traceml
⭐
490
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Pandas.net
⭐
487
Pandas port for C# and F#, data analysis tool, process multi-dim array in DataFrame.
Shc
⭐
484
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
Julia Dataframes Tutorial
⭐
474
A tutorial on Julia DataFrames package
Dataframesmeta.jl
⭐
457
Metaprogramming tools for DataFrames
Assertr
⭐
457
Assertive programming for R analysis pipelines
Chispa
⭐
443
PySpark test helper methods with beautiful error messages
Spark Scala Examples
⭐
443
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Spark Solr
⭐
440
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Pyaf
⭐
438
PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.
Datawig
⭐
434
Imputation of missing values in tables.
Publicdatareader
⭐
430
공공 데이터 조회를 위한 오픈소스 파이썬 라이브러리
Spark Excel
⭐
421
A Spark plugin for reading and writing Excel files
Peroxide
⭐
418
Rust numeric library with R, MATLAB & Python syntax
Cdqa
⭐
418
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Ballista
⭐
411
Experimental Distributed Compute Platform based on Kubnernetes and Apache Arrow
Learningspark
⭐
406
Scala examples for learning to use Spark
Related Searches
Python Dataframe (1,170)
Pandas Dataframe (737)
R Dataframe (581)
Jupyter Notebook Dataframe (552)
1-100 of 1,064 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.