Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Transformers | 112,668 | 64 | 1,869 | 13 hours ago | 114 | July 18, 2023 | 865 | apache-2.0 | Python | |
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. | ||||||||||
Stable Diffusion Webui | 104,033 | 15 hours ago | 2 | January 17, 2022 | 1,522 | agpl-3.0 | Python | |||
Stable Diffusion web UI | ||||||||||
Pytorch | 71,230 | 3,341 | 6,728 | 13 hours ago | 37 | May 08, 2023 | 12,813 | other | Python | |
Tensors and Dynamic neural networks in Python with strong GPU acceleration | ||||||||||
Keras | 59,447 | 578 | 13 hours ago | 80 | June 27, 2023 | 100 | apache-2.0 | Python | ||
Deep Learning for humans | ||||||||||
Real Time Voice Cloning | 47,152 | 7 days ago | 168 | other | Python | |||||
Clone a voice in 5 seconds to generate arbitrary speech in real-time | ||||||||||
Yolov5 | 41,939 | 2 days ago | 8 | September 21, 2021 | 223 | agpl-3.0 | Python | |||
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite | ||||||||||
Annotated_deep_learning_paper_implementations | 36,223 | 1 | 11 days ago | 78 | September 24, 2022 | 27 | mit | Jupyter Notebook | ||
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠 | ||||||||||
Made With Ml | 34,217 | 4 days ago | 5 | May 15, 2019 | 4 | mit | Jupyter Notebook | |||
Learn how to design, develop, deploy and iterate on production-grade ML applications. | ||||||||||
Gfpgan | 32,185 | 9 | 19 days ago | 11 | September 20, 2022 | 271 | other | Python | ||
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. | ||||||||||
Mockingbird | 30,784 | a month ago | 2 | February 28, 2022 | 446 | other | Python | |||
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time |
This repo contains training code, inference code and pre-trained model for Cloths Parsing from human portrait. Here clothes are parsed into 3 category: Upper body(red), Lower body(green) and Full body(yellow)
This model works well with any background and almost all poses. For more samples visit samples.md
U2NET : This project uses an amazing U2NET as a deep learning model. Instead of having 1 channel output from u2net for typical salient object detection task it outputs 4 channels each respresting upper body cloth, lower body cloth, fully body cloth and background. Only categorical cross-entropy loss is used for a given version of the checkpoint.
Dataset : U2net is trained on 45k images iMaterialist (Fashion) 2019 at FGVC6 dataset. To reduce complexity, I have clubbed the original 42 categories from dataset labels into 3 categories (upper body, lower body and full body). All images are resized into square \_()_/
768 x 768 px for training. (This experiment was conducted with 768 px but around 384 px will work fine too if one is retraining on another dataset).
train
folder which contains training images and train.csv
which is label csv file in options/base_options.py
python setup_model_weights.py
and it will generate weights after model surgey in prev_checkpoints
folder.options/base_options.py
like checkpoint saving folder, logs folder etc.distributed = False
in options/base_options.py
, for multi gpu set it to True
.python train.py
python -m torch.distributed.launch --nnodes=1 --node_rank=0 --nproc_per_node=4 --use_env train.py
trained_checkpoint
folder.input_images
folderpython infer.py
for inference.output_images