Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning dataset
dataset
x
machine-learning
x
818 search results found
Tensorflow Examples
⭐
43,109
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
Datasets
⭐
18,390
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Vision
⭐
15,059
Datasets, Transforms and Models specific to Computer Vision
Tensor2tensor
⭐
13,701
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Fashion Mnist
⭐
9,856
A MNIST-like fashion product database. Benchmark 👇
Doccano
⭐
8,980
Open source annotation tool for machine learning practitioners.
Latex Ocr
⭐
8,088
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Deeplake
⭐
7,689
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Techniques
⭐
7,678
Techniques for deep learning with satellite & aerial imagery
Awesome Project Ideas
⭐
6,856
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Fiftyone
⭐
6,327
The open-source tool for building high-quality datasets and computer vision models
Seq2seq Couplet
⭐
5,447
Play couplet with seq2seq model. 用深度学习对对联。
Datasets
⭐
4,094
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Deep Person Reid
⭐
4,029
Torchreid: Deep learning person re-identification in PyTorch.
Artline
⭐
3,349
A Deep Learning based project for creating line art portraits.
Lstm Human Activity Recognition
⭐
3,074
Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier
Igel
⭐
3,037
a delightful machine learning tool that allows you to train, test, and use models without writing code
Alae
⭐
2,850
[CVPR2020] Adversarial Latent Autoencoders
Textattack
⭐
2,597
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Whylogs
⭐
2,533
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Deepdanbooru
⭐
2,280
AI based multi-label girl image classification system, implemented by using TensorFlow.
Datascience Pizza
⭐
2,199
🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e dicionário de conceitos
Pytorch Nlp
⭐
2,180
Basic Utilities for PyTorch Natural Language Processing (NLP)
Datasets
⭐
2,063
🎁 4,800,000+ Unsplash images made available for research and machine learning
Codesearchnet
⭐
2,054
Datasets, tools, and benchmarks for representation learning of code.
Objectron
⭐
1,958
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following cat
Deepvoice3_pytorch
⭐
1,906
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Sdv
⭐
1,787
Synthetic data generation for tabular data
Diffgram
⭐
1,772
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Coco Annotator
⭐
1,743
✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Simplehtr
⭐
1,719
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
Pytorch Cpp
⭐
1,710
C++ Implementation of PyTorch Tutorials for Everyone
Petastorm
⭐
1,693
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Universal Data Tool
⭐
1,612
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
Autoviz
⭐
1,550
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Deepmoji
⭐
1,462
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Wikisql
⭐
1,370
A large annotated semantic parsing corpus for developing natural language interfaces.
Face Mask Detection
⭐
1,355
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
Uncertainty Baselines
⭐
1,324
High-quality implementations of standard and SOTA methods on a variety of tasks.
Twitter Sentiment Analysis
⭐
1,322
Sentiment analysis on tweets using Naive Bayes, SVM, CNN, LSTM, etc.
Fastdup
⭐
1,313
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
Dataprofiler
⭐
1,310
What's in your data? Extract schema, statistics and entities from datasets
Machine Learning With Python
⭐
1,155
Small scale machine learning projects to understand the core concepts . Give a Star 🌟If it helps you. BONUS: Interview Bank coming up..!
Graph Fraud Detection Papers
⭐
1,148
A curated list of graph-based fraud, anomaly, and outlier detection papers & resources
Scikit Lego
⭐
1,099
Extra blocks for scikit-learn pipelines.
Data Juicer
⭐
994
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Insuranceqa Corpus Zh
⭐
989
🚁 保险行业语料库,聊天机器人
Audino
⭐
988
Open source audio annotation tool for humans
Datasets
⭐
957
Machine learning datasets used in tutorials on MachineLearningMastery.com
Bmw Tensorflow Training Gui
⭐
954
This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.
Chatgpt Comparison Detection
⭐
921
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Otto
⭐
913
Otto makes machine learning an intuitive, natural language experience. 🏆 Facebook AI Hackathon winner ⭐️ #1 Trending on MadeWithML.com ⭐️ #4 Trending JavaScript Project on GitHub ⭐️ #15 Trending (All Languages) on GitHub
Medmnist
⭐
903
[pip install medmnist] 18x Standardized Datasets for 2D and 3D Biomedical Image Classification
Tdc
⭐
889
Therapeutics Data Commons: Artificial Intelligence Foundation for Therapeutic Science
Torchmoji
⭐
882
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Facerecognitiondotnet
⭐
866
The world's simplest facial recognition api for .NET on Windows, MacOS and Linux
What If Tool
⭐
864
Source code/webpage/demos for the What-If Tool
Awesome Twitter Data
⭐
847
A list of Twitter datasets and related resources.
Deepsvg
⭐
829
[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.
Facerank
⭐
821
FaceRank - Rank Face by CNN Model based on TensorFlow (add keras version). FaceRank-人脸打分基于 TensorFlow (新增 Keras 版本) 的 CNN 模型(QQ群:167122861)。技术支持:http://tensorflow123.com
Awesome Robotics
⭐
817
A curated list of awesome links and software libraries that are useful for robots.
Emotion Recognition Neural Networks
⭐
803
Emotion recognition using DNN with tensorflow
Graph2vec
⭐
791
A parallel implementation of "graph2vec: Learning Distributed Representations of Graphs" (MLGWorkshop 2017).
Awesome Cybersecurity Datasets
⭐
765
A curated list of amazingly awesome Cybersecurity datasets
Datastream.io
⭐
761
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Torchxrayvision
⭐
760
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
Streaming
⭐
755
A Data Streaming Library for Efficient Neural Network Training
Self Driving Car In Video Games
⭐
729
A deep neural network that learns to drive in video games
Awesome Dataset Tools
⭐
714
🔧 A curated list of awesome dataset tools
Hate Speech And Offensive Language
⭐
698
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
Machinelearning
⭐
684
Machine learning resources,including algorithm, paper, dataset, example and so on.
Thoughtsource
⭐
680
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
Meta Dataset
⭐
679
A dataset of datasets for learning to learn from few examples
Conversational Datasets
⭐
678
Large datasets for conversational AI
Semantic Kitti Api
⭐
665
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
Curated List Of Awesome 3d Morphable Model Software And Data
⭐
664
The idea of this list is to collect shared data and algorithms around 3D Morphable Models. You are invited to contribute to this list by adding a pull request. The original list arised from the Dagstuhl seminar on 3D Morphable Models https://www.dagstuhl.de/19102 in March 2019.
Mimic3 Benchmarks
⭐
661
Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.
Openml
⭐
637
Open Machine Learning
Cam2bev
⭐
627
TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.
Proteinnet
⭐
623
Standardized data set for machine learning of protein structure
Tech.ml.dataset
⭐
616
A Clojure high performance data processing system
Sr Gnn
⭐
607
[AAAI 2019] Source code and datasets for "Session-based Recommendation with Graph Neural Networks"
Datasets Server
⭐
578
Lightweight web API for visualizing and exploring all types of datasets - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
Single Parameter Fit
⭐
548
Real numbers, data science and chaos: How to fit any dataset with a single parameter
Moabb
⭐
546
Mother of All BCI Benchmarks
Tensorflow Value Iteration Networks
⭐
544
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper
Pycococreator
⭐
532
Helper functions to create COCO datasets
Php Ml Examples
⭐
525
Examples use case of PHP-ML library.
Datasets
⭐
521
A repository of pretty cool datasets that I collected for network science and machine learning research.
Free Spoken Digit Dataset
⭐
518
A free audio dataset of spoken digits. Think MNIST for audio.
Csl
⭐
513
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
Ml4se
⭐
511
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
Awsome Deep Learning For Video Analysis
⭐
507
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Time Series Forecasting With Python
⭐
499
A use-case focused tutorial for time series forecasting with python
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Daisyrec
⭐
496
This is the repository of our article published in RecSys 2020 "Are We Evaluating Rigorously? Benchmarking Recommendation for Reproducible Evaluation and Fair Comparison" and of several follow-up studies.
Convokit
⭐
483
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
Rnnlg
⭐
476
RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Related Searches
Python Dataset (15,316)
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Jupyter Notebook Dataset (6,824)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
1-100 of 818 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.