Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data generation
data-generation
x
116 search results found
Grounded Segment Anything
⭐
12,291
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Generatedata
⭐
2,164
A powerful, feature-rich, random test data generator.
Sdv
⭐
1,787
Synthetic data generation for tabular data
Data Augmentation Review
⭐
1,499
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Awesome Ai Ml Dl
⭐
1,375
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Synth
⭐
1,324
The Declarative Data Generator
Ctgan
⭐
1,066
Conditional GAN for generating synthetic tabular data.
Stream_data
⭐
817
Data generation and property-based testing for Elixir. 🔮
Openmixup
⭐
538
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Mockneat
⭐
511
MockNeat - the modern faker lib.
Rebel
⭐
493
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
Regexp Examples
⭐
483
Generate strings that match a given regular expression
Copulas
⭐
454
A library to model multivariate data using copulas.
Deepconvsep
⭐
397
Deep Convolutional Neural Networks for Musical Source Separation
Ratatool
⭐
333
A tool for data sampling, data generation, and data diffing
Pytorch Vdsr
⭐
261
VDSR (CVPR2016) pytorch implementation
Genalog
⭐
243
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Wakefield
⭐
234
Generate random data sets
Dbldatagen
⭐
234
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Be_great
⭐
207
A novel approach for synthesizing tabular data using pretrained large language models
Pydbgen
⭐
199
Random dataframe and database table generator
Vgn
⭐
175
Real-time 6 DOF grasp detection in clutter.
Faker Cxx
⭐
161
C++ Faker library for generating fake (but realistic) data for testing and development.
Realtabformer
⭐
151
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Datahelix
⭐
137
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
Rapiddweller Benerator Ce
⭐
121
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Picka
⭐
108
pip install picka - Picka is a python based data generation and randomization module which aims to increase coverage by increasing the amount of tests you _dont_ have to write by hand.
Sketchcnn
⭐
79
Robust freeform surface modeling from user 2d sketches.
Imagedataaugmentor
⭐
77
Custom image data generator for TF Keras that supports the modern augmentation module albumentations
Deepecho
⭐
75
Synthetic Data Generation for mixed-type, multivariate time series.
Unrealrox
⭐
75
Gratis
⭐
74
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
Simstudy
⭐
69
simstudy: Illuminating research methods through data generation
Exquisite
⭐
69
LINQ-like match_spec generation for Elixir.
Ranger
⭐
53
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
Kerasgen
⭐
53
A Keras/Tensorflow compatible image data generator for TripletLoss
Neuralyzer
⭐
47
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
Fake Data Generator
⭐
46
Just a small open-source script to create fake data given a simple JSON model.
Mockingbird
⭐
46
Mockingbird is a mock streaming data generator
Jazznet
⭐
43
jazznet dataset of piano patterns for music audio machine learning research
Hypothesis Graphql
⭐
39
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend implementation.
Scalacheck Faker
⭐
39
Fake Data Generation using ScalaCheck
Touchstone
⭐
34
query-aware data generation
Private Data Generation
⭐
32
A toolbox for differentially private data generation
Tvr
⭐
29
💥 Transformation Driven Visual Reasoning - CVPR 2021
Awesome Synthetic Data
⭐
28
📖 A curated list of resources dedicated to synthetic data
Datamaker
⭐
27
Data generator command-line tool and library. Create JSON, CSV, XML data from templates.
Noisemix
⭐
27
NoiseMix - data generation for natural language
Paper Learning To Pivot
⭐
27
Repository for the paper "Learning to Pivot with Adversarial Networks"
Trainer
⭐
27
Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.
Flexkbqa
⭐
25
FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering
Table Spec
⭐
25
Specs from SQL database schema for data generation and validation
Grade Rr
⭐
24
GRADE: Generating Animated Dynamic Environments for Robotics Research
Mocki
⭐
24
Mock your APIs at scale using Mocki 🦅
Fsspec
⭐
22
FsSpec represents value constraints as data to reuse one constraint declaration for validation, data generation, error explanation, and more.
Synthia
⭐
22
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
Bodo Examples
⭐
22
Generatedtir_tracking
⭐
22
Synthetic data generation for end-to-end TIR tracking (TIP2018)
Silverstripe Seeder
⭐
21
Declarative data generation for SilverStripe
Mobile Robotics
⭐
20
Deep Learning(PoseNet) Application in SLAM
Bin2ml
⭐
18
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
Fixturereplacement
⭐
17
FixtureReplacement rails plugin
Figureqa
⭐
16
Themis
⭐
16
Repository for OMOP CDM conventions as defined by THEMIS. These can be reference lists of concepts, pieces of standardized code for data generation or quality certification, and debates.
Tdk Demo
⭐
16
This is a collection of TDK demo projects that use different databases and options
Autofillr
⭐
15
A browser extension that fills registration forms with randomly but consistently generated fake data.
Ssb Dbgen
⭐
14
Star Schema Benchmark data set generator (dbgen) - unified repository
Symgen
⭐
13
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
Superpixelgridmasks
⭐
12
SuperpixelGridMasks is an approach for sensor-based data augmentation towards image classification tasks and so on.
Faker.portable
⭐
12
C# faked data generation for testing and prototyping purpose.
Quipp Pipeline
⭐
12
Privacy preserving synthetic data generation workflows
Data Rapid
⭐
12
Realistic Data Generation tool for Big Data Appliances and AI Solutions
Traffic Sign Recognition Basd On Synthesised Training Data
⭐
12
Using synthetic data in combination with Deep Learning, to determine if a system can be made that will be able to recognise and classify correctly real traffic signs.
Milkstraw Python Client
⭐
11
Generate artificial data with AI to augment your existing datasets and improve your AI performance.
Gantransferlimiteddata
⭐
11
This is a pytorch implementation of the paper "On Leveraging Pretrained GANs for Limited-Data Generation".
Simplefixture
⭐
11
Testing fixture for .Net
Datagenerator
⭐
11
Impactgen
⭐
11
Python script and Lua extension using BeamNG.tech to generate low impact crash scenarios and ground truth data for imitation learning.
Tpcds
⭐
10
TPC-DS benchmarks including data generation with Spark and queries with Spark
Spark Tss
⭐
9
Spark Time Series Set data analysis
Mincong H.github.io
⭐
9
Mincong's Personal Blog
Kaleidoscopegenerator
⭐
9
Implementation of kaleidoscope pattern generator written in C#.
Django Data Seeder
⭐
9
A data seeder for models for Django
Rna Seq Vae
⭐
9
Synthetic gene expression data generation using Variational Auto-Encoder
Public Talks
⭐
8
My public talks, their abstracts, code snippets, and sample projects
Clsgan
⭐
8
Synthetic financial time series generation with regime clustering
Dpautogan
⭐
7
Mock Data
⭐
7
Generate realistic test data.
Nifi Datasynthesizer
⭐
7
Apache NiFi Data Synthesizer
Data Faker
⭐
7
Fake Data Generation in Scala
Grade_tools
⭐
7
GRADE evaluation and processing scripts
Cocoa
⭐
7
datasets for testing anonymization algorithms
Registry Hedgehog
⭐
7
registry utilities to work with Hedgehog generators
Ddfmm
⭐
7
Distributed Directional Fast Multipole Method
Fake Data For Learning
⭐
7
Sample interesting fake data for machine and human learning
Vesselextract
⭐
7
U-net based CNN for segmenting blood vessel and thereafter removal of vessels from fundus image
Time Series Prediction
⭐
7
This repo contain the code related to the Medium post: https://medium.com/p/168b47e54d54
Self Driving Car Simulator
⭐
7
A self driving car created using Python, Keras/Tensorflow, Udacity simulator engine.
Codemixed Text Generator
⭐
6
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Intphys
⭐
6
Data generation for the Intuitive Physics Challenge
1-100 of 116 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.