Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data processing
data-processing
x
python
x
139 search results found
Dali
⭐
4,770
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Pandera
⭐
2,807
A light-weight, flexible, and expressive statistical data testing library
Texar
⭐
2,008
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow
Bonobo
⭐
1,548
Extract Transform Load for Python 3.5+
Satpy
⭐
980
Python package for earth-observing satellite data processing
Bytewax
⭐
957
Python Stream Processing
Texar Pytorch
⭐
711
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Haupt
⭐
451
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Nonechucks
⭐
315
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Lithops
⭐
305
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Dolma
⭐
302
Data and tools for generating and inspecting OLMo pre-training data.
Fondant
⭐
293
Production-ready data processing made easy and shareable
Rapidtables
⭐
284
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Pysparkling
⭐
253
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Scramjet
⭐
243
Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.
Machine Learning Notebooks
⭐
241
Machine Learning notebooks for refreshing concepts.
50 Days Of Ml
⭐
237
A day to day plan for this challenge (50 Days of Machine Learning) . Covers both theoretical and practical aspects
Padasip
⭐
236
Python Adaptive Signal Processing
Vaspy
⭐
220
Manipulating VASP files with Python.
Forte
⭐
215
Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
Batchflow
⭐
195
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Dataflows
⭐
182
DataFlows is a simple, intuitive lightweight framework for building data processing flows in python.
Convtools Ita
⭐
176
convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.
Salem
⭐
161
Add geolocalised subsetting, masking, and plotting operations to xarray
Brutalityextractor
⭐
160
适用于高性能系统的多进程解压缩软件(A multiprocess decompression software for high-performance system)
Rsgislib
⭐
130
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Machine_learning_a Z
⭐
130
Learning to create Machine Learning Algorithms
Sayn
⭐
117
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Libertem
⭐
104
Open pixelated STEM framework
Dampr
⭐
101
Python Data Processing library
Cotk
⭐
93
Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation
Breast Cancer Risk Prediction
⭐
83
Classification of Breast Cancer diagnosis Using Support Vector Machines
Financial Statement Pdf Extractor
⭐
70
Python script to extract as much structured information as possible from annual/quarterly reports.
Vip
⭐
68
VIP is a python package/library for angular, reference star and spectral differential imaging for exoplanet/disk detection through high-contrast imaging.
Deep Learn Oil
⭐
68
Deep learning tools for predicting oil well data
Perke
⭐
67
A keyphrase extractor for Persian
Machine Learning For Solar Energy Prediction
⭐
57
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Tubes
⭐
55
A series of tubes.
Pyseis
⭐
54
Pure python seismic data processing
Data_processing_course
⭐
53
Some class materials for a data processing course using PySpark
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Pyint
⭐
52
Python&GAMMA based interfermetry toolbox for single or time-series of InSAR data processing.
Tqdj
⭐
51
A progress bar that plays lofi music
Itertable
⭐
49
⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.
Brepnet
⭐
46
BRepNet: A topological message passing system for solid models
Pygaps
⭐
42
A framework for processing adsorption data and isotherm fitting
Data Science Using Python University Course Module
⭐
40
“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.
Kafka_stock
⭐
34
A financial data processing and visualization platform using Apache Kafka, Apache Cassandra, and Bokeh.
Data Visualization With Python
⭐
30
Data Visualization Tutorial | Matplotlib | Seaborn | Pandas
Handsondeeplearningwithpytorch
⭐
30
Code snippets and applications explained in the book - HandsOnDeepLearningWithPytorch
Developer Guide Hands On App
⭐
26
Handson application for Industrial Edge Developer Guide
Python36
⭐
25
These are python sample projects which are written in python
Zenaton Python
⭐
24
🐍 Python library to run and orchestrate background jobs with Zenaton Workflow Engine
Gpuparallel
⭐
21
Joblib-like interface for parallel GPU computations (e.g. data preprocessing)
Cdp Backend
⭐
20
Data storage utilities and processing pipelines used by CDP instances.
Prairie
⭐
20
A visual programming environment for Python
Glide
⭐
19
Easy ETL
Data Processing And Visualization
⭐
19
This document forms the basis of several workshops/talks that get into everyday programming with R, but also includes mirrored code in Python as Jupyter notebooks.
Restaurant Finder Featurereviews
⭐
19
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
An Overview Of Python Datatable Package
⭐
18
Python library for efficient multi-threaded data processing, with the support for out-of-memory datasets.
Bonobo Sqlalchemy
⭐
18
PREVIEW - SQL databases in Bonobo, using sqlalchemy
Speech Recognition
⭐
18
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Goofi Pipe
⭐
18
real-time neuro-/biosignal processing and streaming pipeline
Pax
⭐
16
The XENON1T raw data processor [deprecated]
Smartpipeline
⭐
16
A framework for rapid development of robust data pipelines following a simple design pattern
Pyvaspflow
⭐
16
vasp calculation flow
Stepist
⭐
16
Framework for data processing
Accelerator Project_skeleton
⭐
16
Python
⭐
16
python code for data processing
Reki
⭐
16
A data preparation tool in CEMC/CMA.
Sparklanes
⭐
16
A lightweight data processing framework for Apache Spark
Computing With Data
⭐
15
Code samples for my book "Computing with Data: An Introduction to the Data Industry"
Machine Learning And Data Processing
⭐
15
A collection of resources on machine learning, data processing and related areas
Neutompy Toolbox
⭐
15
Python package for tomographic data processing and reconstruction
Online_store
⭐
15
End to end data engineering project
Querido Diario Data Processing
⭐
15
Text processing repository to free brazilian municipal gazettes from closed file formats for the Querido Diário project.
Easyml
⭐
14
A Python Package for data processing and building ML models, primarily based on pandas and sklearn libraries.
Tasrif
⭐
14
Tasrif is a python library for processing of wearable data from fitness trackers and wearable health devices
Automated Data Preprocessing
⭐
14
A command-line utility program for automating the trivial, frequently occurring data preparation tasks: missing value interpolation, outlier removal, and encoding categorical variables.
Codraft
⭐
13
The Codra Filtering Tool, an open-source Signal and Image Processing Software
Msdlib
⭐
13
This is a custom library for data processing, visualization and machine learning tools.
High Performance Data Processing In Python
⭐
12
Talk demonstrating how to massively optimise data processing and numerical computation in Python
Qmm
⭐
12
Python Quadratic Majorization-Minimization (MM) optimization algorithms of half-quadratic criteria. Inverses problems, image restoration, denoising, ...
Data Paths
⭐
11
Awareness
⭐
11
The new architecture of co-computation for data processing and machine learning.
Mercury Dataschema
⭐
11
Utility package that, given a Pandas DataFrame, it uses the DataSchema class which auto-infers feature types and automatically calculates different statistics depending on the types.
Thepipe
⭐
11
A simplistic, general purpose pipeline framework.
Connectome
⭐
10
A library for datasets containing heterogeneous data
Tumor Cell Segmentation
⭐
10
tumor cell segmentation by inception-v3 and FCN model
Pipe21
⭐
10
Simple functional pipes
Datasetops
⭐
10
Fluent dataset operations, compatible with your favorite libraries
Rpi
⭐
10
RPJiOS: RPJ's RPi OS, a sensor data platform for the Raspberry Pi built with python2.7 and redis.
Problematic
⭐
10
Python library for processing serial electron diffraction data
Cs229 Machine Learning Solar Energy Predictions
⭐
10
Predicting solar energy using machine learning (LSTM, PCA, boosting). This is our CS 229 project from autumn 2017. Report and poster are included.
Panoptes Pipeline
⭐
10
PANOPTES Data Processing Pipeline
Chipexo
⭐
10
Instagrampredictor
⭐
9
Machine Learning project to predict popularity of Instagram posts
Fortehealth
⭐
9
The project is in the incubation stage and still under development. ForteHealth is a flexible and powerful ML workflow builder for biomedical and clinical scenarios. This is part of the CASL project: http://casl-project.ai/
Amical
⭐
9
Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of the last generation of instruments (ground-based and space).
Picoss
⭐
9
Python Interface for the Classification of Seismic Signals
Related Searches
Python Machine Learning (20,195)
Python Deep Learning (20,138)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Jupyter Notebook (12,976)
Python Network (11,495)
Python Testing (9,479)
Python Natural Language Processing (9,064)
Python Pytorch (7,877)
Python Neural (7,444)
1-100 of 139 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.