Awesome Open Source

Programming Languages

Search results for pytorch vision transformer

vision-transformer x

86 search results found

Mmdetection ⭐ 26,886

OpenMMLab Detection Toolbox and Benchmark

Latex Ocr ⭐ 8,088

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Transformers Tutorials ⭐ 7,486

This repository contains demos I made with the Transformers library by HuggingFace.

Efficient Ai Backbones ⭐ 3,770

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Mmpretrain ⭐ 3,114

OpenMMLab Pre-training Toolbox and Benchmark

Easycv ⭐ 1,614

An all-in-one toolkit for computer vision

Vitpose ⭐ 950

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation"

Videomae ⭐ 864

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

How Do Vits Work ⭐ 571

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Openmixup ⭐ 538

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Transformer_for_medical_image_analysis ⭐ 412

A collection of papers about Transformer in the field of medical image analysis.

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)

Crossformer ⭐ 302

The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Vit Explain ⭐ 260

Explainability for Vision Transformers

Semantic Segmentation ⭐ 228

SOTA Semantic Segmentation Models in PyTorch

Sam Detr ⭐ 211

[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation

Vt Unet ⭐ 210

[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

V2x Vit ⭐ 205

[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"

[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"

Machinelearning Ai ⭐ 184

This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).

Vision Transformer Pytorch ⭐ 164

Pytorch version of Vision Transformer (ViT) with pretrained models. This is part of CASL (https://casl-project.github.io/) and ASYML project.

[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer

'NKD and USKD' (ICCV 2023) and 'ViTKD'

Vformer ⭐ 138

A modular PyTorch library for vision transformer models

Lamda Pilot ⭐ 134

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

Greenmim ⭐ 129

[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].

Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)

Swin Transformer V2 ⭐ 106

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].

Moganet ⭐ 100

[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network

Aiatrack ⭐ 90

[ECCV'22] The official PyTorch implementation of our ECCV 2022 paper: "AiATrack: Attention in Attention for Transformer Visual Tracking".

Ceit Pytorch ⭐ 84

Implementation of Convolutional enhanced image Transformer

(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"

Combining Efficientnet And Vision Transformers For Video Deepfake Detection ⭐ 81

Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.

Image Classification Pytorch ⭐ 80

Learning and Building Convolutional Neural Networks using PyTorch

Pytorch Cifar Model Hub ⭐ 65

Implementation of Conv-based and Vit-based networks designed for CIFAR.

Sota Backbones ⭐ 64

A collection of SOTA Image Classification Models in PyTorch

[CVPR'23] The official PyTorch implementation of our CVPR 2023 paper: "Generalized Relation Modeling for Transformer Tracking".

Vitae Transformer Scene Text Detection ⭐ 51

A comprehensive list of our research works related to scene text detection and spotting, including papers, codes, and citations. Note: The official repo for [IJCV'22] "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection" has been moved to: https://github.com/ViTAE-Transformer/I3CL

Self Supervised Vit Path ⭐ 50

Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)

Cvt2distilgpt2 ⭐ 46

Improving Chest X-Ray Report Generation by Leveraging Warm-Starting

Ecoformer ⭐ 42

[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"

Point2vec ⭐ 40

Self-Supervised Representation Learning on Point Clouds (GCPR 2023 | T4V Workshop @ CVPR 2023)

The official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"

Efficient Attention ⭐ 37

[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

Oreilly Hands On Transformers ⭐ 37

Hands on NLP and Computer Vision with Transformers

G Universal Clip ⭐ 33

4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022

Vit Finetune ⭐ 32

Fine-tuning Vision Transformers on various classification datasets

Code of TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation, WACV 2023

Soft Mixture Of Experts ⭐ 30

PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)

Vision Diffmask ⭐ 26

Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.

Flexivit ⭐ 26

PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes

Rethinkvsralignment ⭐ 25

(NIPS 2022) Rethinking Alignment in Video Super-Resolution Transformers

Mintime Multi Identity Size Invariant Timesformer For Video Deepfake Detection ⭐ 25

Code for Video Deepfake Detector from "MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection", pre-print available on Arxiv

Fpvt_bmvc22 ⭐ 25

Code of Pyramid Vision Transformer at BMVC 2022

Retnet_vit Rmt ⭐ 25

An end-to-end masked contrastive video-and-language pre-training framework

Regionproxy ⭐ 23

[CVPR22] Official codebase of Semantic Segmentation by Early Region Proxy.

Kaggle_leaf_disease_classification ⭐ 22

Cassava leaf disease classification with CNNs and Transformers (top-1% Kaggle solution)

Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)

Transoar ⭐ 22

A 3D medical Detection Transformer library. Papers accepted @ MIDL & MELBA.

[MICCAI 2023] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets (an official implementation)

Adversarial Automixup ⭐ 21

Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)

Deep Hash Distillation ⭐ 20

Deep Hash Distillation for Image Retrieval - ECCV 2022

This repo contains the code for the CVPR 2023 paper: "CrOC : Cross-View Online Clustering for Dense Visual Representation Learning".

Deepvision ⭐ 18

PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), ResNetV2, EfficientNetV2, NeRF, SegFormer, MixTransformer, (planned...) DeepLabV3+, ConvNeXtV2, YOLO, etc.

Vit Pytorch ⭐ 17

PyTorch implementation of the vision transformer

Protopformer ⭐ 17

ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition

Buildformer ⭐ 15

Building Extraction from remote sensing image using Vision Transformer, IEEE Transactions on Geoscience and Remote Sensing, 2022

Icml 2023 Route Interpret Repeat ⭐ 12

Official repository of ICML 2023 paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

Transfusion Pose ⭐ 12

[BMVC2021] "TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation"

Vicinity Vision Transformer ⭐ 12

[TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".

[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Object Depth Detection Based Hybrid Distance Estimator ⭐ 10

We use our VDEmodel. Our purpose is that predict the distance between car based on Deep-Learning.

Swindepth ⭐ 10

"SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network" (ICRA 2023)

[MICCAI ISIC Workshop 2023] AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets (an official implementation)

Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"

Rtdetr Pytorch ⭐ 8

This repository provides a PyTorch implementation of RT-DeTR, a state-of-the-art Realtime Detection Transformer for object detection tasks.

Vit_pytorch ⭐ 7

A PyTorch Implementation of ViT (Vision Transformer)

Qtclassification ⭐ 7

A lightweight and extensible toolbox for image classification

Gan Augmented Pet Classifier ⭐ 7

Towards Fine-grained Image Classification with Generative Adversarial Networks and Facial Landmark Detection - Paper Implementation and Supplementary Materials

Vision_transformer_pytorch ⭐ 6

Simple Implementation of Vision Transformer (https://openreview.net/pdf?id=YicbFdNTTy)

Face To Bmi Vit ⭐ 6

Predict the Body Mass Index with one image of a human face, with state-of-the-art results.

Related Searches

Python Pytorch (16,110)

Deep Learning Pytorch (7,533)

Jupyter Notebook Pytorch (4,892)

Machine Learning Pytorch (2,934)

Dataset Pytorch (1,848)

Pytorch Convolutional Neural Networks (1,794)

Pytorch Neural Network (1,433)

Pytorch Natural Language Processing (1,408)

Tensorflow Pytorch (1,313)

Pytorch Neural (1,217)

1-86 of 86 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.