Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Dataprofiler | 1,310 | 3 | 3 months ago | 53 | November 14, 2023 | 56 | apache-2.0 | Python | ||
What's in your data? Extract schema, statistics and entities from datasets | ||||||||||
Choetl | 693 | 1 | 9 | 7 months ago | 177 | September 21, 2023 | 62 | mit | C# | |
ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files) | ||||||||||
Vscode Data Preview | 447 | a year ago | 54 | apache-2.0 | TypeScript | |||||
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files | ||||||||||
Kafka Connect File Pulse | 289 | 5 | 5 months ago | 5 | July 05, 2023 | 30 | apache-2.0 | Java | ||
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka | ||||||||||
Rumble | 194 | a year ago | 4 | December 03, 2019 | 134 | other | Java | |||
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more | ||||||||||
Bdt | 125 | 5 months ago | 21 | November 22, 2023 | 6 | apache-2.0 | Rust | |||
Boring Data Tool | ||||||||||
Schemer | 89 | 4 years ago | 15 | March 02, 2018 | apache-2.0 | Scala | ||||
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API. | ||||||||||
Daflow | 24 | 4 years ago | 8 | other | Scala | |||||
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules. | ||||||||||
Serialization Benchmark | 10 | 6 years ago | mit | Scala | ||||||
benchmark for modern serialization systems: Apache Avro, Protocol Buffers, Apache Thrift and MessagePack written in Scala | ||||||||||
Flinkparquet | 10 | 9 years ago | 1 | Java | ||||||
Using the Parquet file format (with Avro) to process data with Apache Flink |