Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python vision transformer
python
x
vision-transformer
x
166 search results found
Mmdetection
⭐
26,886
OpenMMLab Detection Toolbox and Benchmark
Latex Ocr
⭐
8,088
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Scenic
⭐
2,733
Scenic: A Jax Library for Computer Vision Research and Beyond
Easycv
⭐
1,614
An all-in-one toolkit for computer vision
Cream
⭐
1,446
This is a collection of our NAS and Vision Transformer work.
Eva
⭐
1,430
EVA Series: Visual Representation Fantasies from BAAI
Vit Adapter
⭐
1,003
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Vitpose
⭐
950
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation"
Vrt
⭐
882
VRT: A Video Restoration Transformer (official repository)
Videomae
⭐
864
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Internvideo
⭐
736
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
Efficientvit
⭐
732
EfficientViT is a new family of vision models for efficient high-resolution vision.
One Peace
⭐
714
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Awesome Attention Mechanism In Cv
⭐
686
Awesome List of Attention Modules and Plug&Play Modules in Computer Vision
Dat
⭐
649
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
Imagenet21k
⭐
576
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
How Do Vits Work
⭐
571
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
Fastervit
⭐
539
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Openmixup
⭐
538
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Parseq
⭐
429
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Gcvit
⭐
399
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
Geoseg
⭐
382
UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.
Libai
⭐
371
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Mimdet
⭐
314
[ICCV 2023] You Only Look at One Partial Sequence
Swin2sr
⭐
303
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 1.8 M runs https://replicate.com/mv-lab/swin2sr
Crossformer
⭐
302
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
Transmorph_transformer_for_medical_image_registration
⭐
302
TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)
Hornet
⭐
296
[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
Actionformer_release
⭐
285
Code release for ActionFormer (ECCV 2022)
Vit Explain
⭐
260
Explainability for Vision Transformers
Passl
⭐
234
PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PV 等基础视觉算法
Dehazeformer
⭐
229
[IEEE TIP] Vision Transformers for Single Image Dehazing
Semantic Segmentation
⭐
228
SOTA Semantic Segmentation Models in PyTorch
Pytorch Vit
⭐
217
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Visual_token_matching
⭐
213
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Sam Detr
⭐
211
[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation
V2x Vit
⭐
205
[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"
Interpretdl
⭐
203
InterpretDL: Interpretation of Deep Learning Models,基于『飞桨』的模型可解释性算法库。
Cotnet
⭐
201
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Vitae Transformer
⭐
187
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
Vit V Net_for_3d_image_registration_pytorch
⭐
185
Vision Transformer for 3D medical image registration (Pytorch).
Awesome Mim
⭐
178
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Maniqa
⭐
159
[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Mpvit
⭐
158
MPViT:Multi-Path Vision Transformer for Dense Prediction in CVPR 2022
Mobilevit Pytorch
⭐
152
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".
Fq Vit
⭐
151
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
Cls_kd
⭐
147
'NKD and USKD' (ICCV 2023) and 'ViTKD'
Lm4visualencoding
⭐
144
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"
Unicom
⭐
142
universal visual model trained on LAION-400M
Visualization
⭐
142
a collection of visualization function
Vformer
⭐
138
A modular PyTorch library for vision transformer models
Lamda Pilot
⭐
134
🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox
Greenmim
⭐
129
[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.
Maxvit
⭐
123
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].
Koclip
⭐
117
KoCLIP: Korean port of OpenAI CLIP, in Flax
Vts Drloc
⭐
116
NeurIPS 2021, Official codes for "Efficient Training of Visual Transformers with Small Datasets".
Swin Transformer V2
⭐
106
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].
Adaptformer
⭐
101
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Tutorial
⭐
94
Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)
Aiatrack
⭐
90
[ECCV'22] The official PyTorch implementation of our ECCV 2022 paper: "AiATrack: Attention in Attention for Transformer Visual Tracking".
Spvit
⭐
89
[TPAMI 2024] This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.
Boosting Crowd Counting Via Multifaceted Attention
⭐
87
Official Implement of CVPR 2022 paper 'Boosting Crowd Counting via Multifaceted Attention'
Fgvc Pim
⭐
86
Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.
Mediar
⭐
83
(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"
Vilio
⭐
82
🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle
Cf Vit
⭐
82
Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"
Hit Gan
⭐
80
Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).
Gpvit
⭐
80
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Image Classification Pytorch
⭐
80
Learning and Building Convolutional Neural Networks using PyTorch
Rt X
⭐
77
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Qformer
⭐
73
The official repo for [Arxiv'23] "Vision Transformer with Quadrangle Attention"
Simpool
⭐
72
This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"
Resvit
⭐
70
Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis
Patchmix
⭐
68
The official implementation of paper: "Inter-Instance Similarity Modeling for Contrastive Learning"
Sota Backbones
⭐
64
A collection of SOTA Image Classification Models in PyTorch
Clipcap
⭐
64
Using pretrained encoder and language models to generate captions from multimedia inputs.
Vit Anti Oversmoothing
⭐
62
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang
Rel_pose
⭐
59
Official Repository for the 3D 2022 paper "The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs"
Grm
⭐
59
[CVPR'23] The official PyTorch implementation of our CVPR 2023 paper: "Generalized Relation Modeling for Transformer Tracking".
Countr
⭐
59
CounTR: Transformer-based Generalised Visual Counting
Lfm
⭐
55
Official PyTorch implementation of the paper: Flow Matching in Latent Space
Upop
⭐
54
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Vmformer
⭐
54
[Preprint] VMFormer: End-to-End Video Matting with Transformer
Imted
⭐
53
[ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Icolorit
⭐
48
Official PyTorch implementation of "iColoriT: Towards Propagating Local Hint to the Right Region in Interactive Colorization by Leveraging Vision Transformer." (WACV 2023)
Cvt2distilgpt2
⭐
46
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Evo Vit
⭐
46
Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
Sret
⭐
44
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
Nsdp
⭐
43
The official implementation for NeurIPS 2022 Spotlight Neural Shape Deformation Priors
Ecoformer
⭐
42
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
Aps
⭐
41
The official implementation of "Asymmetric Patch Sampling for Contrastive Learning"
Vdtr
⭐
41
Video Deblurring with Transformer
Point2vec
⭐
40
Self-Supervised Representation Learning on Point Clouds (GCPR 2023 | T4V Workshop @ CVPR 2023)
Uvc
⭐
40
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu, Zhangyang Wang
P3m Net
⭐
38
The official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"
Efficient Attention
⭐
37
[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling
Mvd
⭐
37
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Sparseformer
⭐
36
the official implementation of SparseFormer
Cae
⭐
35
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Deep Learning (18,839)
Python Flask (17,643)
Python Jupyter Notebook (17,115)
Python Pytorch (16,110)
Python Dataset (14,793)
Python Tensorflow (14,278)
Python Docker (14,113)
Python Command Line (13,351)
1-100 of 166 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.