Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data transformation
data-transformation
x
93 search results found
Glom
⭐
1,783
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
Optimus
⭐
1,447
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Pglogical
⭐
839
Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Transformerkit
⭐
831
A block-based API for NSValueTransformer, with a growing collection of useful examples.
Zingg
⭐
828
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Optimus
⭐
707
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Prose
⭐
606
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Porter
⭐
601
💄 Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.
Collapse
⭐
556
Advanced and Fast Data Transformation in R
Graylog Docker
⭐
323
Official Graylog Docker image
Sqawk
⭐
272
Like Awk but with SQL and table joins
Naas
⭐
266
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
Temme
⭐
258
📄 Concise selector to extract JSON from HTML.
Fastverse
⭐
207
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
Setl
⭐
173
A simple Spark-powered ETL framework that just works 🍺
Datagene
⭐
170
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
Sjmisc
⭐
154
Data transformation and utility functions for R
Data Algorithms With Spark
⭐
151
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Scalaml
⭐
144
Project, source code and data files for 1st edition "Scala for Machine Learning"
Cq
⭐
139
Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Big Data Mapreduce Course
⭐
135
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Allie
⭐
126
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
Clojure Dsl Resources
⭐
120
A curated list of Clojure resources for dealing with domain-specific languages.
Dataanim
⭐
104
R package for visualising data transformation using animations.
Plyranges
⭐
103
A grammar of genomic data transformation
Weaverbird
⭐
91
A visual data pipeline builder with various backends
Wrangler
⭐
83
Wrangler Transform: A DMD system for transforming Big Data
Gallia Core
⭐
79
A schema-aware Scala library for data transformation
Maximum Plaid
⭐
78
Template driven data visualisation for Ember
Java8
⭐
74
Java 8 features
Dry Transformer
⭐
72
Data transformation toolkit
Pycontw2017
⭐
59
Files for PyCon TW 2017
Transform
⭐
57
Using io.Reader for data transformation in Go
Daany
⭐
54
Daany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Semantic Bus
⭐
51
object flow treatment, data transformation
Dataweaveinapex
⭐
49
Examples for working with DataWeave scripts from Apex.
Aws Dbs Refarch Datalake
⭐
47
Reference Architectures for Datalakes on AWS
Php Serializer
⭐
44
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
Foundationsinr
⭐
44
This repository will contain getting started material with R using the StatsBomb dataset.
Pipe_envy
⭐
44
Elixir style pipe operator for Ruby
Typestream
⭐
39
⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first
Data Lens
⭐
29
Functional utilities for Common Lisp
Real Time 3d Pose Estimation With Unity3d Public
⭐
28
Autonomio
⭐
26
Core functionality for the Autonomio augmented intelligence workbench.
Basic Rotating Machine Vibration Analysis
⭐
25
These codes realize data transformation and simple data processing for fault diagnosis.
Datapackage M
⭐
24
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
Py Responsys
⭐
23
Foofah
⭐
21
Foofah: programming-by-example data transformation program synthesizer
Serializer
⭐
21
A PHP serialization component focused on performance
Photon
⭐
21
Photon is an event store with cold+hot event streaming
Pycsvw
⭐
20
A tool to read CSV files with CSVW metadata and transform them into other formats.
Data Refinery
⭐
19
Data transformation
Glide
⭐
19
Easy ETL
Bamboolib_binder_template
⭐
19
bamboolib - template for creating your own binder notebook
Peer Blender
⭐
18
Peer assessment-based homework tool
Field_mapper
⭐
17
Data mapping & transformation
Ldwizard Old
⭐
17
A generic framework for simplifying the creation of linked data.
Object Mapper
⭐
16
Maps generically data from source to target object via extensible strategies and controls
Svizzle
⭐
16
Svelte components for data visualisation and utilities for data transformation.
Unquery
⭐
15
Command line query tool for JSON files
Fragments
⭐
15
Transform and compose data for HTTP transactions.
Tutorials
⭐
13
Short programming tutorials pertaining to data analysis.
Serializer Benchmark
⭐
13
A PHP benchmark application to compare PHP serializer libraries
Richflow
⭐
13
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Python Pipefitter
⭐
12
The SAS pipefitter package provides a Python API for developing pipelines for data transformation and model fitting as stages of a repeatable machine learning workflow in either SAS v9 or SAS Viya.
Datapipe
⭐
12
dataPipe is a data processing and data analytics library for JavaScript. Inspired by LINQ (C#) and Pandas (Python)
Houston
⭐
12
Houston is a data transformation and synchronization framework that has support baked in for pulling and pushing data to and from other services, a sophisticated job queue for fault tolerance, and a parent child model that make complex data structures easier to work with across systems
Dynamic.yaml
⭐
12
DEPRECATED: YAML-based data transformations
Vue Models
⭐
11
Backbone inspired plugin for handling models in Vue.js with built-in serialization
Pipeline
⭐
11
support for asynchronous networking and data transformation
I2o Transform
⭐
11
PCORnet Ontology to OMOP - beta!
Vscode Atlasmap
⭐
10
VS Code plugins providing Atlasmap tooling
Purescript Morello
⭐
10
Cherry-picking 🍒 for your data
Hk Atm Locator
⭐
10
🏧 香港自動櫃員機定位器 🏧 Centralising Automated Teller Machine (ATM) Data in Hong Kong in a well-defined yet standardised format and display in a web portal for public use
Ldwizard
⭐
10
🧙 LDWizard
Datacooker Etl
⭐
10
Data transformation framework for ETL processing with SQL-like syntax and GIS extensions, based on Apache Spark
Swallow
⭐
9
Python framework for data transformation
Scrape
⭐
9
When you need those jobs hypersonic 🚀 scrape 🔪
Sparqlist
⭐
9
SPARQList: Repository server for working SPARQL snippets
Transformer
⭐
8
A few JavaScript one-liners for rapid data transformation.
Robologs Ros Actions
⭐
8
A collection of actions for working with ROS data
Apipage
⭐
7
API Frontend Page
Sqltask
⭐
7
ETL tool for performing mostly SQL-based data transformation
Index Array By
⭐
7
A utility function to index arrays by any criteria
R_programming_for_research
⭐
6
R Programming for Research workshop outline and materials
Dbingestor
⭐
6
This library provides wrappers to data ingestion into various DB systems, such as MySQL, SQLite3, and ODBC compatible systems. You can use this C++ library to ingest data in any format (you need to implement the reading routines though). The library provides an ingestion buffer that allows for multiple rows to be ingested at one time. This greatly improves ingestion performance. Further assertion and conversion functions are provided for elementary data transformation and processing. These can e
Geolatte Common
⭐
6
Basic components for spatial data handling and processing in Java
Object Model Transform
⭐
6
Transforms objects and its object property into model.
Kiba Extend
⭐
6
Extensions to Kiba ETL
Lib Rest Client Common
⭐
5
Generic library for RESTful API clients
Dlpipelines.jl
⭐
5
Interfaces for deep learning data pipelines.
Kgfarm
⭐
5
A Holistic Platform for Automating Data Preparation
Select Prism
⭐
5
Use a Monocle Prism to handle <select> conflict between ADTs and Strings
1-93 of 93 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.