Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataset artificial intelligence
artificial-intelligence
x
dataset
x
96 search results found
Deeplake
⭐
7,601
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Transformers Tutorials
⭐
6,731
This repository contains demos I made with the Transformers library by HuggingFace.
Fiftyone
⭐
6,327
The open-source tool for building high-quality datasets and computer vision models
Datasets
⭐
4,094
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Artline
⭐
3,349
A Deep Learning based project for creating line art portraits.
Igel
⭐
3,037
a delightful machine learning tool that allows you to train, test, and use models without writing code
Data Competition Topsolution
⭐
2,847
Data competition Top Solution 数据竞赛top解决方案开源整理
Objectron
⭐
1,958
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following cat
Pytorch Cpp
⭐
1,710
C++ Implementation of PyTorch Tutorials for Everyone
Fluid
⭐
1,488
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
Chatgpt Comparison Detection
⭐
921
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Otto
⭐
913
Otto makes machine learning an intuitive, natural language experience. 🏆 Facebook AI Hackathon winner ⭐️ #1 Trending on MadeWithML.com ⭐️ #4 Trending JavaScript Project on GitHub ⭐️ #15 Trending (All Languages) on GitHub
Tdc
⭐
889
Therapeutics Data Commons: Artificial Intelligence Foundation for Therapeutic Science
Trustworthyai
⭐
853
trustworthy AI related projects
Replica Dataset
⭐
744
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
Prompt4reasoningpapers
⭐
717
Repository for the ACL2023 paper "Reasoning with Language Model Prompting: A Survey".
Ai_challenger_2018
⭐
625
AI Challenger, a platform for open datasets and programming competitions to artificial intelligence (AI) talents around the world. https://challenger.ai/
Game Datasets
⭐
584
🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
Stardata
⭐
549
Starcraft AI Research Dataset
Deep Trading Agent
⭐
514
Deep Reinforcement Learning based Trading Agent for Bitcoin
Neuralnetwork.net
⭐
472
A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Oie Resources
⭐
435
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Cleora
⭐
434
Cleora AI is a general-purpose model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data.
Bmsg Gan
⭐
429
[MSG-GAN] Any body can GAN! Highly stable and robust architecture. Requires little to no hyperparameter tuning. Pytorch Implementation
Pytorch Cyclegan
⭐
371
A clean and readable Pytorch implementation of CycleGAN
Image Quality
⭐
312
Image quality is an open source software library for Image Quality Assessment (IQA).
Dalle Mtf
⭐
296
Open-AI's DALL-E for large scale training in mesh-tensorflow.
Datascience_course
⭐
289
Curso de Data Science em Português
Textbook_quality
⭐
274
Generate textbook-quality LLM pretraining data
Squirrel Core
⭐
271
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰
Driverlessai Recipes
⭐
222
Recipes for Driverless AI
Aidl_kb
⭐
218
A Knowledge Base for the FB Group Artificial Intelligence and Deep Learning (AIDL)
Eccv2022 Papers With Code Demo
⭐
207
收集 ECCV 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!
Ai Audio Datasets
⭐
199
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Data Science Resources
⭐
197
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Deep Learning With Python
⭐
197
Deep learning codes and projects using Python
Bert Attributeextraction
⭐
185
USING BERT FOR Attribute Extraction in KnowledgeGraph. fine-tuning and feature extraction. 使用基于bert的微调和特征提取方法来进行知识图谱百度百科人物词条属性抽取。
Fakenewscorpus
⭐
184
A dataset of millions of news articles scraped from a curated list of data sources.
Awesome Llm Eval
⭐
183
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs.
Logo_builder
⭐
182
Free AI powered logo builder
Starwhale
⭐
178
an MLOps/LLMOps platform
Hello Kaggle Guide Kor
⭐
175
Kaggle을 처음 접하는 사람들을 위한 문서
Pureml
⭐
174
Developer platform for production ML.
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Fiftyone Examples
⭐
169
Examples of using FiftyOne
Trustllm
⭐
164
TrustLLM: Trustworthiness in Large Language Models
Qb
⭐
160
QANTA Quiz Bowl AI
Dspp Keras
⭐
160
Protein order and disorder data for Keras, Tensor Flow and Edward frameworks with automated update cycle made for continuous learning applications.
Aitlas
⭐
157
AiTLAS implements state-of-the-art AI methods for exploratory and predictive analysis of satellite images.
Csghub
⭐
157
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周 Glance管理虚拟机镜像、Harbor管理容器镜像以及Sonatype Nexus管理制品的方式,实现对LLM资产的管理。欢迎关注反馈和Star⭐️
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Tabformer
⭐
144
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Conversationalir
⭐
133
Overview of venues, research themes and datasets relevant for conversational search.
Learnpaddle2
⭐
130
PaddlePaddle Fluid 版本系列教程,CSDN博客专栏:
Chatgpt Retrievalqa
⭐
130
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
Sscbench
⭐
127
SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving
Computer Go Dataset
⭐
126
datasets for computer go
Papers With Data
⭐
117
A curated list of papers that released datasets along with their work
Bimcv Covid 19
⭐
112
Valencia Region Image Bank (BIMCV) that combines data from the PadChest dataset with future datasets based on COVID-19 pathology to provide the open scientific community with data of clinical-scientific value that helps early detection of COVID-19
Sat2graph
⭐
112
Sat2Graph: Road Graph Extraction through Graph-Tensor Encoding
Personalized Dialog
⭐
112
Code for the paper 'Personalization in Goal-oriented Dialog' (NeurIPS 2017 Conversational AI Workshop)
Ai Tod
⭐
108
Official code for "Tiny Object Detection in Aerial Images".
Spoken_language_identification
⭐
105
Identify a spoken language using artificial intelligence (LID).
Tegridy Midi Dataset
⭐
103
Tegridy MIDI Dataset for precise and effective Music AI models creation.
Chatgirl
⭐
99
ChatGirl is an AI ChatBot based on TensorFlow Seq2Seq Model. ChatGirl 一个基于 TensorFlow Seq2Seq 模型的聊天机器人。(包含预处理过的 twitter 英文数据集,训练,运行,工具代码,来波 Star 。)QQ群:167122861
Text_predictor
⭐
99
Char-level RNN LSTM text generator📄.
Meme Generator
⭐
87
MemeGen is a web application where the user gives an image as input and our tool generates a meme at one click for the user.
Go Dataset
⭐
85
21.1 million Go games, 18k-9p
Student Teacher Anomaly Detection
⭐
83
Student–Teacher Anomaly Detection with Discriminative Latent Embeddings
3d Medical Segmentation Gan
⭐
81
3D Liver Segmentation with GAN
Practicalmachinelearning
⭐
79
A curated collection of machine learning resources, including notebooks, code, and books, all of which are either free or open-source
Hello Kaggle Guide
⭐
78
For someone who is new at Kaggle
Nn Scratch
⭐
74
Coding up a Neural Network Classifier from Scratch
Pathvqa
⭐
74
Rid Covid
⭐
71
Image-based COVID-19 diagnosis. Links to software, data, and other resources.
Sordi Ai Evaluation Gui
⭐
68
This repository allows you to evaluate a trained computer vision model and get general information and evaluation metrics with little configuration.
Face Mask Detector
⭐
67
𝐑𝐞𝐚𝐥-𝐓𝐢𝐦𝐞 𝐅𝐚𝐜𝐞 𝐦𝐚𝐬𝐤 𝐝𝐞𝐭𝐞𝐜𝐭𝐢𝐨𝐧 𝐮𝐬𝐢𝐧𝐠 𝐝𝐞𝐞𝐩𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 𝐰𝐢𝐭𝐡 𝐀𝐥𝐞𝐫𝐭 𝐬𝐲𝐬𝐭𝐞𝐦 💻🔔
Brihaspati
⭐
66
Collection of various implementations and Codes in Machine Learning, Deep Learning and Computer Vision ✨💥
Danes
⭐
65
DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)
Ds With Pysimplegui
⭐
62
Data science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
Monitors4codegen
⭐
60
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.
Cppe Dataset
⭐
58
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
Xray
⭐
53
List of datasets and papers in X-ray security images (Computer vision/Machine Learning)
Awesome Healthmetrics
⭐
52
A curated list of awesome resources at the intersection of healthcare and AI
Dikedataset
⭐
51
Dataset with labeled benign and malicious files 🗃️
Recommender System Datasets
⭐
50
A list of compatible datasets, noting other major repositories containing popular real-world datasets, along with sample code for a range of recommendation tasks.
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Kogpt
⭐
48
GPT-2 pretrained on Korean datasets.
Rams
⭐
45
Official TensorFlow code for paper "Multi-Image Super Resolution of Remotely Sensed Images Using Residual Attention Deep Neural Networks".
Covid Net
⭐
43
Launched in March 2020 in response to the coronavirus disease 2019 (COVID-19) pandemic, COVID-Net is a global open source, open access initiative dedicated to accelerating advancement in machine learning to aid front-line healthcare workers and clinical institutions around the world fighting the continuing pandemic. Towards this goal, our global multi-disciplinary team of researchers, developers, and clinicians have made publicly available a suite of tailored deep neural network models for tackl
Ossdc Visionai Core
⭐
42
OSSDC Vision AI - a platform for live testing and developing of computer vision and artificial intelligence algorithms targeting robotics, home automation and autonomous vechicles
Gaia Dataset
⭐
42
GAIA, with the full name Generic AIOps Atlas, is an overall dataset for analyzing operation problems such as anomaly detection, log analysis, fault localization, etc.
Stereo_depth_estimator
⭐
40
Stereo depth estimation for self-driving cars 🚗
Crest
⭐
40
A Causal Relation Schema for Text
Physics Benchmarking Neurips2021
⭐
40
Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines", presented at NeurIPS 2021 (Datasets & Benchmarks track)
Ai Sentiment Analysis On Imdb Dataset
⭐
40
Sentiment Analysis using Stochastic Gradient Descent on 50,000 Movie Reviews Compiled from the IMDB Dataset
Deep Learning Resources
⭐
39
A place to gather deep learning resources
Vocalforge
⭐
39
Your one-stop solution for voice dataset creation
Watch_and_help
⭐
39
Code for the paper Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration
Wikiwhy
⭐
38
WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000+ "why" question-answer-rationale triplets.
Related Searches
Python Dataset (14,792)
Jupyter Notebook Dataset (6,824)
Python Artificial Intelligence (6,759)
Machine Learning Artificial Intelligence (3,856)
Jupyter Notebook Artificial Intelligence (3,189)
Deep Learning Artificial Intelligence (2,507)
Deep Learning Dataset (2,364)
Machine Learning Dataset (2,279)
Dataset Pytorch (1,847)
Artificial Intelligence Neural Network (1,732)
1-96 of 96 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.