Awesome Open Source

Programming Languages

Search results for python vision transformer

vision-transformer x

166 search results found

Mmdetection ⭐ 26,886

OpenMMLab Detection Toolbox and Benchmark

Latex Ocr ⭐ 8,088

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Towhee ⭐ 2,903

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Scenic ⭐ 2,733

Scenic: A Jax Library for Computer Vision Research and Beyond

Easycv ⭐ 1,614

An all-in-one toolkit for computer vision

Cream ⭐ 1,446

This is a collection of our NAS and Vision Transformer work.

EVA Series: Visual Representation Fantasies from BAAI

Vit Adapter ⭐ 1,003

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Vitpose ⭐ 950

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation"

VRT: A Video Restoration Transformer (official repository)

Videomae ⭐ 864

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Internvideo ⭐ 736

InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)

Efficientvit ⭐ 732

EfficientViT is a new family of vision models for efficient high-resolution vision.

One Peace ⭐ 714

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Awesome Attention Mechanism In Cv ⭐ 686

Awesome List of Attention Modules and Plug&Play Modules in Computer Vision

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Imagenet21k ⭐ 576

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

How Do Vits Work ⭐ 571

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Fastervit ⭐ 539

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Openmixup ⭐ 538

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

[ICCV 2023] You Only Look at One Partial Sequence

Swin2sr ⭐ 303

Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 1.8 M runs https://replicate.com/mv-lab/swin2sr

Crossformer ⭐ 302

The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI

Transmorph_transformer_for_medical_image_registration ⭐ 302

TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Actionformer_release ⭐ 285

Code release for ActionFormer (ECCV 2022)

Vit Explain ⭐ 260

Explainability for Vision Transformers

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PV 等基础视觉算法

Dehazeformer ⭐ 229

[IEEE TIP] Vision Transformers for Single Image Dehazing

Semantic Segmentation ⭐ 228

SOTA Semantic Segmentation Models in PyTorch

Pytorch Vit ⭐ 217

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Visual_token_matching ⭐ 213

[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Sam Detr ⭐ 211

[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation

V2x Vit ⭐ 205

[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"

Interpretdl ⭐ 203

InterpretDL: Interpretation of Deep Learning Models，基于『飞桨』的模型可解释性算法库。

This is an official implementation for "Contextual Transformer Networks for Visual Recognition".

Vitae Transformer ⭐ 187

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

Vit V Net_for_3d_image_registration_pytorch ⭐ 185

Vision Transformer for 3D medical image registration (Pytorch).

Awesome Mim ⭐ 178

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

MPViT:Multi-Path Vision Transformer for Dense Prediction in CVPR 2022

Mobilevit Pytorch ⭐ 152

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer

'NKD and USKD' (ICCV 2023) and 'ViTKD'

Lm4visualencoding ⭐ 144

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

universal visual model trained on LAION-400M

Visualization ⭐ 142

a collection of visualization function

Vformer ⭐ 138

A modular PyTorch library for vision transformer models

Lamda Pilot ⭐ 134

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

Greenmim ⭐ 129

[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].

KoCLIP: Korean port of OpenAI CLIP, in Flax

Vts Drloc ⭐ 116

NeurIPS 2021, Official codes for "Efficient Training of Visual Transformers with Small Datasets".

Swin Transformer V2 ⭐ 106

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].

Adaptformer ⭐ 101

[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"

Tutorial ⭐ 94

Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)

Aiatrack ⭐ 90

[ECCV'22] The official PyTorch implementation of our ECCV 2022 paper: "AiATrack: Attention in Attention for Transformer Visual Tracking".

[TPAMI 2024] This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Boosting Crowd Counting Via Multifaceted Attention ⭐ 87

Official Implement of CVPR 2022 paper 'Boosting Crowd Counting via Multifaceted Attention'

Fgvc Pim ⭐ 86

Pytorch implementation for "A Novel Plug-in Module for Fine-Grained Visual Classification". fine-grained visual classification task.

(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"

🥶Vilio: State-of-the-art VL models in PyTorch & PaddlePaddle

Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation

Image Classification Pytorch ⭐ 80

Learning and Building Convolutional Neural Networks using PyTorch

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

The official repo for [Arxiv'23] "Vision Transformer with Quadrangle Attention"

This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"

Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis

Patchmix ⭐ 68

The official implementation of paper: "Inter-Instance Similarity Modeling for Contrastive Learning"

Sota Backbones ⭐ 64

A collection of SOTA Image Classification Models in PyTorch

Using pretrained encoder and language models to generate captions from multimedia inputs.

Vit Anti Oversmoothing ⭐ 62

[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang

Rel_pose ⭐ 59

Official Repository for the 3D 2022 paper "The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs"

[CVPR'23] The official PyTorch implementation of our CVPR 2023 paper: "Generalized Relation Modeling for Transformer Tracking".

CounTR: Transformer-based Generalised Visual Counting

Official PyTorch implementation of the paper: Flow Matching in Latent Space

[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.

Vmformer ⭐ 54

[Preprint] VMFormer: End-to-End Video Matting with Transformer

[ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection

Icolorit ⭐ 48

Official PyTorch implementation of "iColoriT: Towards Propagating Local Hint to the Right Region in Interactive Colorization by Leveraging Vision Transformer." (WACV 2023)

Cvt2distilgpt2 ⭐ 46

Improving Chest X-Ray Report Generation by Leveraging Warm-Starting

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

The official implementation for NeurIPS 2022 Spotlight Neural Shape Deformation Priors

Ecoformer ⭐ 42

[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"

The official implementation of "Asymmetric Patch Sampling for Contrastive Learning"

Video Deblurring with Transformer

Point2vec ⭐ 40

Self-Supervised Representation Learning on Point Clouds (GCPR 2023 | T4V Workshop @ CVPR 2023)

[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu, Zhangyang Wang

The official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"

Efficient Attention ⭐ 37

[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Sparseformer ⭐ 36

the official implementation of SparseFormer

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Related Searches

Python Django (28,897)

Python Machine Learning (20,195)

Python Deep Learning (18,839)

Python Flask (17,643)

Python Jupyter Notebook (17,115)

Python Pytorch (16,110)

Python Dataset (14,793)

Python Tensorflow (14,278)

Python Docker (14,113)

Python Command Line (13,351)

1-100 of 166 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.