Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for deep learning vision transformer
deep-learning
x
vision-transformer
x
88 search results found
Latex Ocr
⭐
8,088
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Awesome Transformer Attention
⭐
3,895
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Mmpretrain
⭐
3,172
OpenMMLab Pre-training Toolbox and Benchmark
Scenic
⭐
2,733
Scenic: A Jax Library for Computer Vision Research and Beyond
Transformer Explainability
⭐
1,596
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Vitpose
⭐
950
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation"
Voxformer
⭐
937
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
Dat
⭐
649
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
Vision Centric Bev Perception
⭐
541
Vision-Centric BEV Perception: A Survey
Fastervit
⭐
539
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Openmixup
⭐
538
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Gcvit
⭐
399
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
Geoseg
⭐
382
UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.
Vitae Transformer Remote Sensing
⭐
379
A comprehensive list of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining" has been moved to: https://github.com/ViTAE-Transformer/RSP
Libai
⭐
371
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Hipt
⭐
341
Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)
Gfnet
⭐
310
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
Swin2sr
⭐
303
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 1.8 M runs https://replicate.com/mv-lab/swin2sr
Transmorph_transformer_for_medical_image_registration
⭐
302
TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)
Crossformer
⭐
302
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
Actionformer_release
⭐
285
Code release for ActionFormer (ECCV 2022)
Alphaclip
⭐
273
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Vit Explain
⭐
260
Explainability for Vision Transformers
Passl
⭐
234
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PV 等基础视觉算法
Semantic Segmentation
⭐
228
SOTA Semantic Segmentation Models in PyTorch
Vitae Transformer Matting
⭐
223
A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, including papers, codes, datasets, demos, and citations. Note: The repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving" has been moved to: https://github.com/ViTAE-Transformer/P3M-Net
Visual_token_matching
⭐
213
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Sam Detr
⭐
211
[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation
Vt Unet
⭐
210
[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation
V2x Vit
⭐
205
[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"
Interpretdl
⭐
203
InterpretDL: Interpretation of Deep Learning Models,基于『飞桨』的模型可解释性算法库。
Seq2seqsharp
⭐
188
Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
Vitae Transformer
⭐
187
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
Machinelearning Ai
⭐
184
This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).
Awesome Mim
⭐
178
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Maniqa
⭐
159
[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Visualization
⭐
142
a collection of visualization function
Lamda Pilot
⭐
134
🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox
Maxvit
⭐
123
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].
Swin Transformer V2
⭐
106
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].
Awesome Transformer In Medical Imaging
⭐
103
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Tutorial
⭐
94
Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)
Combining Efficientnet And Vision Transformers For Video Deepfake Detection
⭐
81
Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.
Image Classification Pytorch
⭐
80
Learning and Building Convolutional Neural Networks using PyTorch
Qformer
⭐
73
The official repo for [Arxiv'23] "Vision Transformer with Quadrangle Attention"
Simpool
⭐
72
This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"
Resvit
⭐
70
Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis
Sota Backbones
⭐
64
A collection of SOTA Image Classification Models in PyTorch
Docentr
⭐
62
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
Vitae Transformer Scene Text Detection
⭐
51
A comprehensive list of our research works related to scene text detection and spotting, including papers, codes, and citations. Note: The official repo for [IJCV'22] "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection" has been moved to: https://github.com/ViTAE-Transformer/I3CL
Self Supervised Vit Path
⭐
50
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)
Evo Vit
⭐
46
Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
P3m Net
⭐
38
The official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"
Oreilly Hands On Transformers
⭐
37
Hands on NLP and Computer Vision with Transformers
G Universal Clip
⭐
33
4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022
Vit Finetune
⭐
32
Fine-tuning Vision Transformers on various classification datasets
Soft Mixture Of Experts
⭐
30
PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)
Vision Diffmask
⭐
26
Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.
Ats
⭐
25
Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)
Polsarformer
⭐
22
This code is for the paper "Local Window Attention Transformer for Polarimetric SAR Image Classification" that is published in the IEEE Geoscience and Remote Sensing Letters journal.
Kaggle_leaf_disease_classification
⭐
22
Cassava leaf disease classification with CNNs and Transformers (top-1% Kaggle solution)
Sdvit
⭐
22
Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)
Transoar
⭐
22
A 3D medical Detection Transformer library. Papers accepted @ MIDL & MELBA.
Mdvit
⭐
22
[MICCAI 2023] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets (an official implementation)
Adversarial Automixup
⭐
21
Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)
Vitas
⭐
21
Code for ViTAS_Vision Transformer Architecture Search
Deep Hash Distillation
⭐
20
Deep Hash Distillation for Image Retrieval - ECCV 2022
Celebfaces_attributes_classification
⭐
19
This repository is related to a project of the Introduction to Numerical Imaging (i.e, Introduction à l'Imagerie Numérique in French), given by the MVA Masters program at ENS-Paris Saclay. It was entirely build from scratch and contains code in PyTorch Lightning to train and then use a neural network for image classification. We used it to create a classifier allowing semantic attributes classification of faces with the dataset CelebA-HQ.
Deepvision
⭐
18
PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), ResNetV2, EfficientNetV2, NeRF, SegFormer, MixTransformer, (planned...) DeepLabV3+, ConvNeXtV2, YOLO, etc.
Vit Pytorch
⭐
17
PyTorch implementation of the vision transformer
Lithography Hotspot Detection
⭐
17
Detected Hotspots in the Lithography process using Vision Transformers, Convolution Neural Networks and Artificial Neural Networks, and compared the results obtained using ANNs & CNNs
Doprompt
⭐
17
Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"
Vitasd
⭐
16
[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis
Buildformer
⭐
15
Building Extraction from remote sensing image using Vision Transformer, IEEE Transactions on Geoscience and Remote Sensing, 2022
Documentclip
⭐
14
Ssl Ocr
⭐
12
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
Object Depth Detection Based Hybrid Distance Estimator
⭐
10
We use our VDEmodel. Our purpose is that predict the distance between car based on Deep-Learning.
Swindepth
⭐
10
"SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network" (ICRA 2023)
Avit
⭐
9
[MICCAI ISIC Workshop 2023] AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets (an official implementation)
Mimdepth
⭐
9
Image Masking for Robust Self-Supervised Monocular Depth Estimation, accepted at ICRA 2023
V1t
⭐
9
Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"
Droppos
⭐
8
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Lung Sound Classification
⭐
8
This project is about classifying respiratory sounds using Attention and Vision Transformer on ICBHI dataset..
Sm Vit
⭐
8
Official repository for the paper "Salient Mask-Guided Vision Transformer for Fine-Grained Classification" (VISIGRAPP '23)
Qtclassification
⭐
7
A lightweight and extensible toolbox for image classification
Dss
⭐
7
📳 From training and deployment of ViTs to development of real-time cross-platform mobile apps!
Vit_pytorch
⭐
7
A PyTorch Implementation of ViT (Vision Transformer)
Hierattn
⭐
7
Official implementation of Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention
Face To Bmi Vit
⭐
6
Predict the Body Mass Index with one image of a human face, with state-of-the-art results.
Roodmri
⭐
6
AICONSlab's DL benchmarking platform to OOD data in MRI
Gsoc 22 Tensorflow Resources And Notebooks
⭐
5
GSoC'22 @ TensorFlow Notebooks, Code and much more
Related Searches
Python Deep Learning (18,839)
Jupyter Notebook Deep Learning (10,328)
Deep Learning Tensorflow (5,868)
Deep Learning Neural Network (5,801)
Deep Learning Pytorch (4,667)
Deep Learning Convolutional Neural Networks (3,932)
Deep Learning Neural (3,734)
Deep Learning Network (3,573)
Deep Learning Keras (3,258)
Deep Learning Computer Vision (3,035)
1-88 of 88 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.