Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Data Science Ipython Notebooks	25,668			7 months ago			34	other	Python
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Deeplearning4j	13,397	175	119	a month ago	54	August 10, 2022	624	apache-2.0	Java
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.
H2o 3	6,618	62	33	3 months ago	49	August 09, 2023	2,746	apache-2.0	Jupyter Notebook
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Bigdl	4,728		10	3 months ago	16	April 19, 2021	958	apache-2.0	Jupyter Notebook
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm
Tensorflowonspark	3,851	5		10 months ago	32	April 21, 2022	13	apache-2.0	Python
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Xlearning	1,729			a year ago			44	apache-2.0	Java
AI on Hadoop
Caffeonspark	1,272			4 years ago			78	apache-2.0	Jupyter Notebook
Distributed deep learning on Hadoop and Spark clusters.
Tony	697		2	6 months ago	52	May 26, 2022	26	other	Java
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
Dist Keras	611			6 years ago	2	October 26, 2017	35	gpl-3.0	Python
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Metronome	103			10 years ago			3	apache-2.0	Java
Suite of parallel iterative algorithms built on top of Iterative Reduce

Alternatives To Druid Hadoop Inputformat

Select To Compare

Data Science Ipython Notebooks ⭐ 25,668

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

most recent commit 7 months ago

Deeplearning4j ⭐ 13,397

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

dependent packages 119total releases 54most recent commit a month ago

H2o 3 ⭐ 6,618

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

dependent packages 33total releases 49most recent commit 3 months ago

Bigdl ⭐ 4,728

Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm

dependent packages 10total releases 16most recent commit 3 months ago

Tensorflowonspark ⭐ 3,851

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

total releases 32most recent commit 10 months ago

Xlearning ⭐ 1,729

AI on Hadoop

most recent commit a year ago

Caffeonspark ⭐ 1,272

Distributed deep learning on Hadoop and Spark clusters.

most recent commit 4 years ago

Tony ⭐ 697

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

dependent packages 2total releases 52most recent commit 6 months ago

Dist Keras ⭐ 611

Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.

total releases 2most recent commit 6 years ago

Metronome ⭐ 103

Suite of parallel iterative algorithms built on top of Iterative Reduce

most recent commit 10 years ago

Suggest An Alternative To druid-hadoop-inputformat

Alternative Project Comparisons

Druid Hadoop Inputformat vs Data Science Ipython Notebooks

Druid Hadoop Inputformat vs Deeplearning4j

Druid Hadoop Inputformat vs H2o 3

Druid Hadoop Inputformat vs Bigdl

Druid Hadoop Inputformat vs Tensorflowonspark

Druid Hadoop Inputformat vs Xlearning

Druid Hadoop Inputformat vs Caffeonspark

Druid Hadoop Inputformat vs Tony

Druid Hadoop Inputformat vs Dist Keras

Druid Hadoop Inputformat vs Metronome

Popular Hadoop Projects

Spark ⭐ 37,661

Apache Spark - A unified analytics engine for large-scale data processing

dependent packages 939total releases 46latest release May 09, 2021most recent commit 3 months ago

Xgboost ⭐ 25,253

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

dependent packages 972total releases 79latest release November 13, 2023most recent commit 3 months ago

Luigi ⭐ 17,046

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

dependent packages 76total releases 80latest release October 05, 2023most recent commit 3 months ago

Apijson ⭐ 16,586

🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码，前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.

most recent commit 22 days ago

Bigdata Notes ⭐ 14,872

大数据入门指南 :star:

most recent commit 4 months ago

Popular Deep Learning Projects

Tensorflow ⭐ 180,196

An Open Source Machine Learning Framework for Everyone

dependent packages 78total releases 46latest release October 23, 2019most recent commit 3 months ago

Transformers ⭐ 124,049

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

dependent packages 2,484total releases 125latest release November 15, 2023most recent commit 16 days ago

Stable Diffusion Webui ⭐ 118,856

Stable Diffusion web UI

total releases 2latest release January 17, 2022most recent commit 3 months ago

Opencv ⭐ 75,512

Open Source Computer Vision Library

dependent packages 37total releases 29latest release February 15, 2023most recent commit 4 days ago

Pytorch ⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

dependent packages 8,272total releases 39latest release November 15, 2023most recent commit 3 months ago

Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

Java

Deep Learning

Spark

Hadoop

Druid

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.