Optimus Alternatives

Name: hi-primus/optimus
Brand: hi-primus/optimus
SKU: project/hi-primus/optimus
Rating: 4.94 (1540 reviews)

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Categories > Data Processing > Machine Learning

Suggest Alternative

Stars

1,540

Alternatives

License

apache-2.0

Open Issues

Most Recent Commit

over 1 year ago

Programming Language

Python

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

Latest Release

June 19, 2022

Categories

Machine Learning > Machine Learning

Data Processing > Data Science

Data Processing > Spark

Data Processing > Data Analysis

Data Processing > Big Data

Data Processing > Pyspark

Data Processing > Data Cleaning

Data Processing > Data Transformation

Site

Repo

Alternatives To hi-primus/optimus

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
microsoft/SynapseML	5,228	0	6	about 1 month ago	12	November 27, 2023	335	mit	Scala
Simple and Distributed Machine Learning
ethen8181/machine-learning	2,607	0	0	over 2 years ago	0		6	mit	HTML
:earth_americas: machine learning tutorials (mainly in Python3)
hi-primus/optimus	1,540	0	0	over 1 year ago	32	June 19, 2022	29	apache-2.0	Python
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
jadianes/spark-py-notebooks	1,515	0	0	over 3 years ago	0		9	other	Jupyter Notebook
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
logicalclocks/hopsworks	1,041	0	0	over 2 years ago	1	September 11, 2019	12	agpl-3.0	Java
Hopsworks - Data-Intensive AI platform with a Feature Store
AlexIoannides/pyspark-example-project	1,034	0	0	over 3 years ago	0		11		Python
Example project implementing best practices for PySpark ETL jobs and applications.
kuwala-io/kuwala	610	0	0	almost 4 years ago	0		22	apache-2.0	JavaScript
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
firmai/pandapy	483	0	0	over 4 years ago	22	January 25, 2020	2		Python
PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
capitalone/datacompy	339	0	10	over 2 years ago	20	November 15, 2023	16	apache-2.0	Python
Pandas and Spark DataFrame comparison for humans and more!
Ibotta/sk-dist	283	2	0	over 3 years ago	12	May 14, 2020	8	apache-2.0	Python
Distributed scikit-learn meta-estimators in PySpark

Alternatives To hi-primus/optimus

Select To Compare

microsoft/SynapseML ⭐ 5,228

Simple and Distributed Machine Learning

dependent packages 6 total releases 12 most recent commit about 1 month ago

ethen8181/machine-learning ⭐ 2,607

:earth_americas: machine learning tutorials (mainly in Python3)

dependent packages 0 total releases 0 most recent commit over 2 years ago

hi-primus/optimus ⭐ 1,540

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

dependent packages 0 total releases 32 most recent commit over 1 year ago downloads badge

jadianes/spark-py-notebooks ⭐ 1,515

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

dependent packages 0 total releases 0 most recent commit over 3 years ago

logicalclocks/hopsworks ⭐ 1,041

Hopsworks - Data-Intensive AI platform with a Feature Store

dependent packages 0 total releases 1 most recent commit over 2 years ago downloads badge

AlexIoannides/pyspark-example-project ⭐ 1,034

Example project implementing best practices for PySpark ETL jobs and applications.

dependent packages 0 total releases 0 most recent commit over 3 years ago

kuwala-io/kuwala ⭐ 610

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times

dependent packages 0 total releases 0 most recent commit almost 4 years ago

firmai/pandapy ⭐ 483

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

dependent packages 0 total releases 22 most recent commit over 4 years ago downloads badge

capitalone/datacompy ⭐ 339

Pandas and Spark DataFrame comparison for humans and more!

dependent packages 10 total releases 20 most recent commit over 2 years ago downloads badge

Ibotta/sk-dist ⭐ 283

Distributed scikit-learn meta-estimators in PySpark

dependent packages 0 total releases 12 most recent commit over 3 years ago downloads badge

Suggest An Alternative To optimus

Alternative Project Comparisons

hi-primus/optimus vs Synapseml

hi-primus/optimus vs Machine Learning

hi-primus/optimus vs Optimus

hi-primus/optimus vs Spark Py Notebooks

hi-primus/optimus vs Hopsworks

hi-primus/optimus vs Pyspark Example Project

hi-primus/optimus vs Kuwala

hi-primus/optimus vs Pandapy

hi-primus/optimus vs Datacompy

hi-primus/optimus vs Sk Dist

Popular Pyspark Projects

kailashahirwar/cheatsheets-ai⭐ 13,281

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

JohnSnowLabs/spark-nlp⭐ 3,578

State of the Art Natural Language Processing

apache/linkis⭐ 3,407

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

ibis-project/ibis⭐ 3,404

The flexibility of Python with the scale and performance of modern SQL.

uber/petastorm⭐ 1,693

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

Popular Data Science Projects

microsoft/ML-For-Beginners⭐ 63,698

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

keras-team/keras⭐ 60,198

Deep Learning for humans

scikit-learn/scikit-learn⭐ 57,160

scikit-learn: machine learning in Python

apache/superset⭐ 56,358

Apache Superset is a Data Visualization and Data Exploration Platform

pandas-dev/pandas⭐ 41,008

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper