Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python dataframe
dataframe
x
python
x
673 search results found
Polars
āĀ
20,625
Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends
Modin
āĀ
8,990
Modin: Scale your Pandas workflows by changing a single line of code
Vaex
āĀ
7,985
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second š
Pygwalker
āĀ
7,423
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Cudf
āĀ
5,974
cuDF - GPU DataFrame Library
Datasciencepython
āĀ
4,776
common data analysis and machine learning tasks using python
Arrow Datafusion
āĀ
4,069
Apache Arrow DataFusion SQL Query Engine
Mimesis
āĀ
4,067
Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
Pandas Ta
āĀ
3,993
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Koalas
āĀ
3,291
Koalas: pandas API on Apache Spark
Sklearn Pandas
āĀ
2,724
Pandas integration with sklearn
Pandas Datareader
āĀ
2,668
Extract data from a wide range of Internet sources into a pandas DataFrame.
Mars
āĀ
2,643
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Sweetviz
āĀ
2,604
Visualize and compare datasets, target values and associations, with one line of code.
Pandera
āĀ
2,602
A light-weight, flexible, and expressive statistical data testing library
Swifter
āĀ
2,311
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Vincent
āĀ
2,054
A Python to Vega translator
100 Pandas Puzzles
āĀ
1,977
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Tabula Py
āĀ
1,918
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Pandas Videos
āĀ
1,808
Jupyter notebook and datasets from the pandas Q&A video series
Sketch
āĀ
1,759
AI code-writing assistant that understands data content
Finta
āĀ
1,677
Common financial technical indicators implemented in Pandas.
Connector X
āĀ
1,527
Fastest library to load data from DB to DataFrames in Rust and Python
Autoviz
āĀ
1,350
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
D3py
āĀ
1,349
a plottling library for python, based on D3
Sequoia
āĀ
1,349
Ač”čŖåØéč”ēØåŗļ¼å®ē°äŗęµ·é¾äŗ¤ęę³åćē¼ äøčÆ“ē¦ ēåøä¹°ē¹ļ¼ä»„åå ¶ä»č„å¹²ē§ęęÆå½¢ę
Quandl Python
āĀ
1,309
Statistical Analysis Python Tutorial
āĀ
1,233
Statistical Data Analysis in Python
Pyjanitor
āĀ
1,197
Clean APIs for data cleaning. Python implementation of R package Janitor
Siuba
āĀ
1,052
Python library for using dplyr like syntax with pandas and SQL
Hamilton
āĀ
963
Your single tool to express data, ML, and LLM pipelines with simple python functions. Runs anywhere that python runs, E.G. spark, airflow, jupyter, fastapi, etc. Incrementally adoptable. Use Hamilton to build testable, reusable, and self-documenting dataflows with lineage and metadata out of the box.
Arrow Ballista
āĀ
930
Apache Arrow Ballista Distributed Query Engine
Fast Pandas
āĀ
928
Benchmark for different operations in pandas against various dataframe sizes.
Daft
āĀ
900
The Python DataFrame for Complex Data
Ppscore
āĀ
893
Predictive Power Score (PPS) in Python
Hamilton
āĀ
877
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Hvplot
āĀ
806
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
Pandashells
āĀ
786
š¼ Bringing the python data stack to the shell prompt
Pyspark Examples
āĀ
778
Pyspark RDD, DataFrame and Dataset Examples in Python language
Jqdatasdk
āĀ
770
ē®åęēØēéåéčę°ę®å (easy utility for getting financial market data of China)
Dfply
āĀ
734
dplyr-style piping operations for pandas dataframes
Django Pandas
āĀ
724
Tools for working with pandas in your Django projects
Dataframe Go
āĀ
642
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Technical
āĀ
632
Various indicators developed or collected for the Freqtrade
Datasheets
āĀ
580
Read data from, write data to, and modify the formatting of Google Sheets
Pandastable
āĀ
578
Table analysis in Tkinter using pandas DataFrames.
Eland
āĀ
557
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Data Science Your Way
āĀ
532
Ways of doing Data Science Engineering and Machine Learning in R and Python
Pykrx
āĀ
531
KRX 주ģ ģ 볓 ģ¤ķ¬ėķ
Quinn
āĀ
518
pyspark methods to enhance developer productivity š£ šÆ š
Traceml
āĀ
486
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Lumibot
āĀ
448
Backtesting and Trading Bots Made Easy for Crypto, Stocks, Options, Futures, FOREX and more
Pyaf
āĀ
438
PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.
Chispa
āĀ
435
PySpark test helper methods with beautiful error messages
Cdqa
āĀ
418
ā [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Optopsy
āĀ
417
A nimble options backtesting library for Python
Publicdatareader
āĀ
407
공공 ė°ģ“ķ° ģ”°ķ넼 ģķ ģ¤ķģģ¤ ķģ“ģ¬ ė¼ģ“ėøė¬ė¦¬
Pystore
āĀ
404
Fast data store for Pandas time-series data
Pyupbit
āĀ
373
python wrapper for upbit API
Gspread Pandas
āĀ
371
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Flatten
āĀ
366
Flatten JSON in Python
Pandasvault
āĀ
353
Advanced Pandas Vault ā Utilities, Functions and Snippets (by @firmai).
Static Frame
āĀ
351
Immutable and grow-only Pandas-like DataFrames with a more explicit and consistent interface.
Pyterrier
āĀ
336
A Python framework for performing information retrieval experiments, building on http://terrier.org/
Riptable
āĀ
333
64bit multithreaded python data analytics tools for numpy arrays and datasets
Datacompy
āĀ
316
Pandas and Spark DataFrame comparison for humans and more!
Sidetable
āĀ
310
sidetable builds simple but useful summary tables of your data
Styleframe
āĀ
308
A library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Data Science Hacks
āĀ
300
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Sparkflow
āĀ
290
Easy to use library to bring Tensorflow on Apache Spark
Vnquant
āĀ
283
VietNam Data Stock Market Price
Opendartreader
āĀ
267
Open DART Reader
Pandasticsearch
āĀ
265
An Elasticsearch client exposing DataFrame API
Pyspark Style Guide
āĀ
264
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Pandas Workshop
āĀ
253
An introductory workshop on pandas with notebooks and exercises for following along.
Geostatspy
āĀ
252
GeostatsPy Python package for spatial data analytics and geostatistics. Mostly a reimplementation of GSLIB, Geostatistical Library (Deutsch and Journel, 1992) in Python. Geostatistics in a Python package. I hope this resources is helpful, Prof. Michael Pyrcz
Plydata
āĀ
232
A grammar for data manipulation in Python
Fast Trade
āĀ
232
low code backtesting library utilizing pandas and technical analysis indicators
Nlp_profiler
āĀ
227
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Pydbgen
āĀ
199
Random dataframe and database table generator
Snowpark Python
āĀ
197
Snowflake Snowpark Python API
Gspread Dataframe
āĀ
194
Read/write Google spreadsheets using pandas DataFrames
Xgbmagic
āĀ
192
Xfeat
āĀ
183
Flexible Feature Engineering & Exploration Library using GPUs and Optuna.
Git Pandas
āĀ
177
A wrapper around gitpython to produce pandas dataframes for analysis
Ditching Excel For Python
āĀ
175
Functionalities in Excel translated to Python
Pandas.jl
āĀ
172
A Julia front-end to Python's Pandas package.
Autosklearn Zeroconf
āĀ
163
autosklearn-zeroconf is a fully automated binary classifier. It is based on the AutoML challenge winner auto-sklearn. Give it a dataset with known outcomes (labels) and it returns a list of predicted outcomes for your new data. It even estimates the precision for you! The engine is tuning massively parallel ensemble of machine learning pipelines for best precision/recall.
Rightmove_webscraper.py
āĀ
161
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Visualize_ml
āĀ
160
Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.
Argopy
āĀ
159
A python library for Argo data beginners and experts
Tensorflow Recorder
āĀ
158
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
Castra
āĀ
153
Partitioned storage system based on blosc. **No longer actively maintained.**
Data Algorithms With Spark
āĀ
151
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Wbdata
āĀ
150
A python library for accessing world bank data
Pyspark Cheatsheet
āĀ
140
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Panthera
āĀ
140
Data-frames & arrays on Clojure
D3graph
āĀ
135
Creation of interactive networks using d3 Javascript
Py Market Profile
āĀ
134
A library to calculate Market Profile (aka Volume Profile) for financial data from a Pandas DataFrame.
Tableone
āĀ
134
Create "Table 1" for research papers in Python
Related Searches
Python Python3 (857,414)
Python Pytorch (17,410)
Python Dataset (14,792)
Python Tensorflow (14,628)
Python Machine Learning (14,099)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Html (9,891)
Python Testing (9,432)
Python Natural Language Processing (8,742)
1-100 of 673 search results
Next >
Privacy
Ā |Ā
About
Ā |Ā
Terms
Ā |Ā
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source.Ā All rights reserved.