Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataframe
dataframe
x
1,064 search results found
Pystore
⭐
404
Fast data store for Pandas time-series data
Static Frame
⭐
388
Immutable and statically-typeable DataFrames with runtime type and data validation
Spark Fast Tests
⭐
385
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Dataframe Js
⭐
383
A javascript library providing a new data structure for datascientists and developpers
Pytorch Frame
⭐
377
Tabular Deep Learning Library for PyTorch
Pyupbit
⭐
373
python wrapper for upbit API
Qframe
⭐
372
Immutable data frame for Go
Gspread Pandas
⭐
371
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Pyterrier
⭐
359
A Python framework for performing information retrieval experiments, building on http://terrier.org/
Pandasvault
⭐
353
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Googlesheets4
⭐
345
Google Spreadsheets R API (reboot of the googlesheets package)
Pyculiarity
⭐
341
A Python port of Twitter's AnomalyDetection R Package
Styleframe
⭐
341
A library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Datacompy
⭐
339
Pandas and Spark DataFrame comparison for humans and more!
Riptable
⭐
339
64bit multithreaded python data analytics tools for numpy arrays and datasets
Row Oriented Workflows
⭐
331
Row-oriented workflows in R with the tidyverse
Rsruby
⭐
327
Ruby - R bridge.
Vnquant
⭐
325
VietNam Data Stock Market Price
Fast Trade
⭐
311
low code backtesting library utilizing pandas and technical analysis indicators
Sidetable
⭐
310
sidetable builds simple but useful summary tables of your data
Py Market Profile
⭐
303
A library to calculate Market Profile (aka Volume Profile) for financial data from a Pandas DataFrame.
Sparkflow
⭐
301
Easy to use library to bring Tensorflow on Apache Spark
Data Science Hacks
⭐
300
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Neo4j Spark Connector
⭐
300
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Spark Hbase Connector
⭐
287
Connect Spark to HBase for reading and writing data with ease
Cylon
⭐
286
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
Pandas Workshop
⭐
285
An introductory workshop on pandas with notebooks and exercises for following along.
Nimdata
⭐
276
DataFrame API written in Nim, enabling fast out-of-core data processing
Geni
⭐
268
A Clojure dataframe library that runs on Spark
Opendartreader
⭐
267
Open DART Reader
Pandasticsearch
⭐
265
An Elasticsearch client exposing DataFrame API
Pyspark Style Guide
⭐
264
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Geostatspy
⭐
252
GeostatsPy Python package for spatial data analytics and geostatistics. Mostly a reimplementation of GSLIB, Geostatistical Library (Deutsch and Journel, 1992) in Python. Geostatistics in a Python package. I hope this resources is helpful, Prof. Michael Pyrcz
Tablecloth
⭐
250
Dataset manipulation library built on the top of tech.ml.dataset
Rust Dataframe
⭐
250
A Rust DataFrame implementation, built on Apache Arrow
Dataframe_image
⭐
244
A python package for embedding pandas DataFrames as images into pdf and markdown documents
Sql Spark Connector
⭐
242
Apache Spark Connector for SQL Server and Azure SQL
Morpheus Core
⭐
239
The foundational library of the Morpheus data science framework
Inspectdf
⭐
236
🛠️ 📊 Tools for Exploring and Comparing Data Frames
Ggplot2 Tutorial
⭐
233
Quick introduction to ggplot2 (no knowledge of R assumed)
Plydata
⭐
232
A grammar for data manipulation in Python
Nlp_profiler
⭐
227
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Rsocrata
⭐
227
Provides easier interaction with Socrata open data portals http://dev.socrata.com. Users can provide a 'Socrata' data set resource URL, or a 'Socrata' Open Data API (SoDA) web query, or a 'Socrata' "human-friendly" URL, returns an R data frame. Converts dates to 'POSIX' format. Manages throttling by 'Socrata'.
Rasterframes
⭐
226
Geospatial Raster support for Spark DataFrames
Rightmove_webscraper.py
⭐
219
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Datatable
⭐
218
A go in-memory table
Ethnicolr
⭐
218
Predict Race and Ethnicity Based on the Sequence of Characters in a Name
Abris
⭐
215
Avro SerDe for Apache Spark structured APIs.
Snowpark Python
⭐
215
Snowflake Snowpark Python API
Isolation Forest
⭐
211
A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Writexl
⭐
206
Portable, light-weight data frame to xlsx exporter for R
Pydbgen
⭐
199
Random dataframe and database table generator
Edibble
⭐
199
An R-package that encapsulate elements of experimental design for better planning, management, and workflow
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Gspread Dataframe
⭐
194
Read/write Google spreadsheets using pandas DataFrames
Xgbmagic
⭐
192
Send Email With R
⭐
189
How to send a bunch of email from R
Dlookr
⭐
186
Tools for Data Diagnosis, Exploration, Transformation
Xfeat
⭐
183
Flexible Feature Engineering & Exploration Library using GPUs and Optuna.
Rlist
⭐
182
A Toolbox for Non-Tabular Data Manipulation
Git Pandas
⭐
177
A wrapper around gitpython to produce pandas dataframes for analysis
Ditching Excel For Python
⭐
175
Functionalities in Excel translated to Python
Pandas.jl
⭐
172
A Julia front-end to Python's Pandas package.
Gdeltpyr
⭐
170
Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.
Allaboutscala
⭐
168
Source code for www.allaboutscala.com tutorials
Tidyquery
⭐
164
Query R data frames with SQL
Autosklearn Zeroconf
⭐
163
autosklearn-zeroconf is a fully automated binary classifier. It is based on the AutoML challenge winner auto-sklearn. Give it a dataset with known outcomes (labels) and it returns a list of predicted outcomes for your new data. It even estimates the precision for you! The engine is tuning massively parallel ensemble of machine learning pipelines for best precision/recall.
Argopy
⭐
162
A python library for Argo data beginners and experts
Visualize_ml
⭐
160
Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.
Ddf
⭐
160
Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine
Tensorflow Recorder
⭐
158
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
Collapsibletree
⭐
154
Create Interactive Collapsible Tree Diagrams in R using D3.js
Spark Binlog
⭐
153
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Candlestick Patterns
⭐
153
Candlestick patterns detector
Castra
⭐
153
Partitioned storage system based on blosc. **No longer actively maintained.**
Data Algorithms With Spark
⭐
151
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Wbdata
⭐
150
A python library for accessing world bank data
Tableone
⭐
144
Create "Table 1" for research papers in Python
D3graph
⭐
143
Creation of interactive networks using d3 Javascript
Panthera
⭐
140
Data-frames & arrays on Clojure
Pyspark Cheatsheet
⭐
140
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Osd Bike Routes
⭐
139
Open source release of bike routes in Chicago.
Apache Spark Node
⭐
134
Node.js bindings for Apache Spark DataFrame APIs
Repurrrsive
⭐
133
Recursive lists to use in teaching and examples, because there is no mtcars for lists.
Woodwork
⭐
133
Woodwork is a Python library that provides robust methods for managing and communicating data typing information.
Bioframe
⭐
129
Pandas utilities for tab-delimited and other genomic data files
Handyspark
⭐
129
HandySpark - bringing pandas-like capabilities to Spark dataframes
Aut
⭐
128
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Datamaid
⭐
128
An R package for data screening
Df2gspread
⭐
127
Manage Google Spreadsheets in Pandas DataFrame with Python
Pandavro
⭐
127
Apache Avro <-> pandas DataFrame
Mikeio
⭐
124
Read, write and manipulate dfs0, dfs1, dfs2, dfs3, dfsu and mesh files.
Utah
⭐
123
Dataframe structure and operations in Rust
Pandas Dedupe
⭐
123
Simplifies use of the Dedupe library via Pandas
Inmemorydatasets.jl
⭐
122
Multithreaded package for working with tabular data in Julia
Useless_r_functions
⭐
121
Useless R Functions. That's it
Dh Core
⭐
118
Functional data science
Spotify Tensorflow
⭐
117
Provides Spotify-specific TensorFlow helpers
Pandas_redshift
⭐
117
Load data from redshift into a pandas DataFrame and vice versa.
Datamancer
⭐
117
A dataframe library with a dplyr like API
Related Searches
Python Dataframe (1,170)
Pandas Dataframe (737)
R Dataframe (581)
Jupyter Notebook Dataframe (552)
101-200 of 1,064 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.