Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java data science
data-science
x
java
x
88 search results found
Ray
⭐
29,596
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Openrefine
⭐
10,106
OpenRefine is a free, open source power tool for working with messy data and improving it
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
H2o 3
⭐
6,618
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Smile
⭐
5,833
Statistical Machine Intelligence & Learning Engine
Tablesaw
⭐
3,328
Java dataframe and visualization library
Dex
⭐
1,193
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Sklearn Porter
⭐
1,150
Transpile trained scikit-learn estimators to C, Java, JavaScript and others.
Datumbox Framework
⭐
1,089
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Odd Platform
⭐
1,047
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Hopsworks
⭐
1,041
Hopsworks - Data-Intensive AI platform with a Feature Store
Dataflowjavasdk
⭐
853
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Zingg
⭐
828
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Curriculum
⭐
762
👩🏫 👨🏫 The open-source curriculum of Enki!
Elki
⭐
746
ELKI Data Mining Toolkit
Tech.ml.dataset
⭐
616
A Clojure high performance data processing system
Courses
⭐
590
Answers for Quizzes & Assignments that I have taken
Krangl
⭐
559
krangl is a {K}otlin DSL for data w{rangl}ing
Datacleaner
⭐
557
The premier open source Data Quality solution
Project Guidance
⭐
367
:octocat:🌟 The Ultimate resources for beginner to advance level projects all at one place 💻 🎯🚀
Datavines
⭐
275
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
Morpheus Core
⭐
239
The foundational library of the Morpheus data science framework
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
D2l Java
⭐
153
The Java implementation of Dive into Deep Learning (D2L.ai)
Tennis Crystal Ball
⭐
150
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Blockchain2graph
⭐
135
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Artificial Intelligence And Machine Learning
⭐
135
A repository for implementation of artificial intelligence algorithm which includes machine learning and deep learning algorithm as well as classical AI search algorithm
Algocode
⭐
115
Welcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Toolbox
⭐
104
A Java Toolbox for Scalable Probabilistic Machine Learning
Classifai
⭐
96
🔥 One of the most comprehensive open-source data annotation platform.
Topic Modeling Tool
⭐
84
A point-and-click tool for creating and analyzing topic models produced by MALLET.
Wrangler
⭐
83
Wrangler Transform: A DMD system for transforming Big Data
Anomalydetecttool
⭐
83
A tool of detecting anomaly points from data
R Course
⭐
59
Una introduccion al analisis de datos con R y R Studio
Tombolodigitalconnector
⭐
58
The Tombolo Digital Connector enables users to combine different sources of data in a transparent and reproducible way.
Platzi Courses
⭐
57
All the courses which appear here is what I have taken in platzi.com and I own all the files except the files inside folders named shared, with much pleasure Fork and download all my sources if you like it and if you want to improve this repo just make a Pull request 😀.
Ml Models
⭐
54
Machine Learning Procedures and Functions for Neo4j
Books
⭐
53
A collection of online books for data science, computer science and coding!
Ai Platform
⭐
51
An open-source platform for automating tasks using machine learning models
Dataframe
⭐
51
DataFrame Library for Java
Data Structures
⭐
49
Computer science data structures and algorithms implementation from scratch
100 Days Of Code
⭐
43
100 Days of Code Learning program to keep a habit of coding daily and learn things at your own pace with help from our remote community.
Liblevenshtein Java
⭐
40
Various utilities regarding Levenshtein transducers. (Java)
Data Polygamy
⭐
38
Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Dtcleaner
⭐
37
DTCleaner: data cleaning using multi-target decision trees.
Nocodefunctions Web App
⭐
30
The code base of the front-end of nocodefunctions.com
Learning Spark
⭐
29
Tidy up Spark and Hadoop tutorials.
Java For Data Science
⭐
28
Code reposiory for Java for Data Science, published by Packt
Mastering Java For Data Science
⭐
27
Mastering-Java-for-Data-Science, published by Packt
Geneticalgorithm
⭐
24
Refactoring and improving a gist from Vijini Mallawaarachchi
Metacat
⭐
23
Data repository software that helps researchers preserve, share, and discover data
180protocol
⭐
22
Confidential compute for sensitive data sharing and commercial collaboration
Pyrefine
⭐
20
Execute OpenRefine JSON scripts without OpenRefine (or Java)
Datapackage Java
⭐
19
A Java library for working with Frictionless Data Data Packages.
Ylib
⭐
19
📖 Kişisel Kütüphanem
Powerlaws
⭐
17
Java library for the analysis of power law distributed data. Scroll down for README.
Vita
⭐
17
A Versatile Toolkit for Generating Indoor Mobility Data for Real-World Buildings
Computing With Data
⭐
15
Code samples for my book "Computing with Data: An Introduction to the Data Industry"
Code Backup
⭐
14
All naive programs done by me till now!
Feedzai Openml
⭐
14
API for Feedzai's Open Machine Learning that allows to integrate ML algorithms in Feedzai's platform.
Genstar
⭐
14
Generation of Synthetic Populations Library
Vaults
⭐
14
Notes of CS Concept & Code Snippet
Ride
⭐
13
A nice R development and analytics environment, for the Renjin JVM implementation of R
Agepredictor
⭐
13
Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum
Nyc_taxi_trip_duration
⭐
13
Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS
Datascience
⭐
13
Introduction to Data Science - Bill Howe -- Spring 2013/4
Fione
⭐
12
Fione is Enterprise AI Platform
Uminho
⭐
12
📚 University projects, exercises & notes
Mastering Java Data Science
⭐
11
The code for the book "Mastering Java for Data Science"
Pstl
⭐
10
Parallel Streaming Transformation Loader
Jnotebook
⭐
10
Notebook for Java.
Hust Projects
⭐
9
My labs in college of CS and some interesting projects at HUST.
Klab
⭐
9
Rcoboldi
⭐
9
R COBOL DI (Data Integration) Package : Import COBOL CopyBook data files directly into R as properly structured data frames.
Hackerranksolutions
⭐
9
My HackerRank Solutions for Python, Java, C, C++, Shell, SQL, JavaScript and Interview Preparation Kit
Reposchina
⭐
8
Mirrors and registries in Mainland China to speedup your package installation for data science
Certificates
⭐
8
All Certificate Achieved Till 2023
Linwin Db Server
⭐
8
在广袤无垠的现代大数据海洋之中,计算机深度的和信息以及数据绑定,承载这亿万数据的就是数据库软件。 Linwin Data Server,基于Java开发的国产高性能数据库软件。支持国产和Linux操作系统,支持多用户操作。 用户数据的增删改查全部在内存内操作,与硬盘的交互写入读取交由专门的线程管理,无不妨碍.
Drill Logfile Plugin
⭐
8
Generic log parser for Apache Drill
Clusterless
⭐
8
Clusterless is a tool for scheduling decentralized, scalable, and secure data pipelines for continuously arriving data, across clouds.
Project_based_learning
⭐
7
This repository includes links to forked repositories which hold a list of projects that could be helpful for project-based learning.
Fepipeline
⭐
7
A easy to get start, scalable, distributed feature engineering framework based on Spark.
Wudac
⭐
7
Welcome to Wharton Undergraduate Data Analytics Club. In this repository we host and compile resources for students in the hopes that this will aid in their learning process.
Computer Science B.sc Materials
⭐
7
Computer Science B.Sc coureses materials
Drill Network Functions
⭐
6
Networking functions for Apache Drill
Metadig Engine
⭐
6
MetaDig Engine: multi-dialect metadata assessment engine
Chromosomedna
⭐
6
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
Building Neural Networks From Scratch
⭐
6
Building Neural Networks from Scratch book repository.
Hands On Data Science With Java
⭐
6
Code Repository for Hands-on Data Science with Java, published by Packt
Tessellate
⭐
5
A data engineering cli for reading and writing data to/from multiple locations across multiple formats.
Quantumics Opensource
⭐
5
This is Quantumics.AI's public repository, inviting people from arround the world to contrubute and take advantage of free No code DataOps platform
Feedzai Openml R
⭐
5
Implementations for Feedzai's OpenML APIs to allow for usage of machine learning models in the R programming language.
Drill Useragent Function
⭐
5
Drill UDF for parsing User Agent Strings.
Jobs
⭐
5
Job openings at Quod AI
Hacktoberfest2019
⭐
5
Disease Pattern Miner
⭐
5
Disease Pattern Miner is a free, open-source mining framework for interactively discovering sequential disease patterns in medical health record datasets.
Everanalyzer
⭐
5
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Sonarissuescoring
⭐
5
Where do we refactor next? A predictive maintenance approach to java code smells.
Practical_bscit_mscit_ninad
⭐
5
Practical of B.Sc. IT and M.Sc. IT
Sourceafis Visualization Java
⭐
5
Visualizations of biometric features in fingerprint templates produced by SourceAFIS and in algorithm transparency data captured during feature extraction and matching in SourceAFIS.
Related Searches
Java Spring (21,350)
Java Spring Boot (11,982)
Java Video Game (8,093)
Java Gradle (8,072)
Python Data Science (6,905)
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
Javascript Java (5,468)
1-88 of 88 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.