Awesome Open Source
Awesome Open Source
Combined Topics
data-augmentation
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 59 Data Augmentation Open Source Projects
Categories
>
Data Processing
>
Data Augmentation
Snorkel
⭐
4,399
A system for quickly generating training data with weak supervision
Dali
⭐
3,027
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
Face.evolve.pytorch
⭐
2,217
🔥🔥High-Performance Face Recognition Library on PyTorch🔥🔥
Torchsample
⭐
1,639
High-Level Training, Data Augmentation, and Utilities for Pytorch
Textattack
⭐
1,178
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP
Nlp_xiaojiang
⭐
886
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Eda_nlp
⭐
866
Data augmentation for NLP, presented at EMNLP 2019
Dataaugmentationforobjectdetection
⭐
787
Data Augmentation For Object Detection
Data Augmentation Review
⭐
718
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
Inltk
⭐
688
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Yolo Tf2
⭐
675
yolo(v3/v4) implementation in keras and tensorflow 2.3
Eda_nlp_for_chinese
⭐
623
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Torchio
⭐
569
Medical image preprocessing and augmentation toolkit for deep learning
Paddleclas
⭐
533
A treasure chest for image classification powered by PaddlePaddle
Random Erasing
⭐
492
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
Pba
⭐
456
Efficient Learning of Augmentation Policy Schedules
Deepconvsep
⭐
421
Deep Convolutional Neural Networks for Musical Source Separation
Mobilepose Pytorch
⭐
413
Light-weight Single Person Pose Estimator
Specaugment
⭐
395
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Mixup
⭐
370
Implementation of the mixup training method
Audiomentations
⭐
361
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Nlpcda
⭐
359
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Amazon Forest Computer Vision
⭐
344
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks
Deltapy
⭐
334
DeltaPy - Tabular Data Augmentation (by @firmai)
Image_augmentor
⭐
317
Data augmentation tool for images
Caer
⭐
305
A lightweight, scalable Computer Vision library for high-performance AI research.
Dab
⭐
294
Data Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ
Drq
⭐
251
DrQ: Data regularized Q
Solt
⭐
247
Streaming over lightweight data transformations
Mixup Generator
⭐
242
An implementation of "mixup: Beyond Empirical Risk Minimization"
Zeroth
⭐
240
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Torchsat
⭐
237
🔥TorchSat 🌏 is an open-source deep learning framework for satellite imagery analysis based on PyTorch.
Nlp Data Augmentation
⭐
224
Data Augmentation for NLP. NLP数据增强
Syndata Generation
⭐
201
Code used to generate synthetic scenes and bounding box annotations for object detection. This was used to generate data used in the Cut, Paste and Learn paper
Tensorflow Mnist Cnn
⭐
180
MNIST classification using Convolutional NeuralNetwork. Various techniques such as data augmentation, dropout, batchnormalization, etc are implemented.
Scaper
⭐
177
A library for soundscape synthesis and augmentation
Muda
⭐
172
A library for augmenting annotated audio data
Tsaug
⭐
172
A Python package for time series augmentation
Stylealign
⭐
171
[ICCV 2019]Aggregation via Separation: Boosting Facial Landmark Detector with Semi-Supervised Style Transition
Torch_videovision
⭐
166
Transforms for video datasets in pytorch
Imagecorruptions
⭐
148
Python package to corrupt arbitrary images.
Semsegpipeline
⭐
122
A simpler way of reading and augmenting image segmentation data into TensorFlow
Unsupervised Data Augmentation
⭐
122
Unofficial PyTorch Implementation of Unsupervised Data Augmentation.
Evoskeleton
⭐
119
Official project website for the CVPR 2020 paper (Oral Presentation) "Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data"
Ghost Free Shadow Removal
⭐
115
[AAAI 2020] Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN
All Conv Keras
⭐
114
All Convolutional Network: (https://arxiv.org/abs/1412.6806#) implementation in Keras
What I Have Read
⭐
109
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
Fcn_train
⭐
104
The code includes all the file that you need in the training stage for FCN
Cutmix
⭐
94
a Ready-to-use PyTorch Extension of Unofficial CutMix Implementations with more improved performance.
Synthetic Occlusion
⭐
88
Synthetic Occlusion Augmentation
Pose Adv Aug
⭐
84
Code for "Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation" (CVPR 2018)
Pedestrian Synthesis Gan
⭐
72
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
Data_generator_object_detection_2d
⭐
70
A data generator for 2D object detection
Dips
⭐
58
NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
Dda
⭐
57
Differentiable Data Augmentation Library
Doccreator
⭐
51
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
Handwriting_recogition_using_adversarial_learning
⭐
50
[CVPR 2019] "Handwriting Recognition in Low-resource Scripts using Adversarial Learning ”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
Veri Artirma Data Augmentation
⭐
23
Bu repoda veri artırma (data augmentation) ile ilgili pratik uygulamalara ulaşabilirsiniz.
All Classifiers 2019
⭐
21
A collection of computer vision projects for Acute Lymphoblastic Leukemia classification/early detection.
1-59 of 59 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210