Dendrite

Dendrite is a library for querying large datasets on a single host at near-interactive speeds.
Alternatives To Dendrite
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Roapi2,969
4 months ago17March 20, 202237apache-2.0Rust
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Petastorm1,69385 months ago86February 03, 2023174apache-2.0Python
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Tech.ml.dataset616
3 months ago251January 05, 202110epl-1.0Clojure
A Clojure high performance data processing system
Kartothek163
a year ago38December 10, 202177mitPython
A consistent table management library in python
Datasets.jl104
5 months ago22mitJulia
Scientificsummarizationdatasets88
5 years ago2Jupyter Notebook
Datasets I have created for scientific summarization, and a trained BertSum model
Dendrite67
3 years ago27February 09, 20203otherJava
Dendrite is a library for querying large datasets on a single host at near-interactive speeds.
Spark Mail45
5 years ago3otherHTML
Tutorial on parsing Enron email to Avro and then explore the email set using Spark.
Snowset41
3 years ago1Jupyter Notebook
Snowflake dataset containing statistics for 70 million queries over 14 day period
Rasterly38
4 years ago2June 08, 2020otherR
Rapidly generate raster images from large datasets in R with Plotly.js
Alternatives To Dendrite
Select To Compare


Alternative Project Comparisons
Popular Dataset Projects
Popular Parquet Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Dataset
Parquet