Awesome Open Source

Programming Languages

Search results for deep learning vision transformer

deep-learning x

vision-transformer x

88 search results found

Latex Ocr ⭐ 8,088

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Awesome Transformer Attention ⭐ 3,895

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Mmpretrain ⭐ 3,172

OpenMMLab Pre-training Toolbox and Benchmark

Scenic ⭐ 2,733

Scenic: A Jax Library for Computer Vision Research and Beyond

Transformer Explainability ⭐ 1,596

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Vitpose ⭐ 950

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [Arxiv'22] "ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation"

Voxformer ⭐ 937

Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Vision Centric Bev Perception ⭐ 541

Vision-Centric BEV Perception: A Survey

Fastervit ⭐ 539

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Openmixup ⭐ 538

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

Vitae Transformer Remote Sensing ⭐ 379

A comprehensive list of our research works related to remote sensing, including papers, codes, and citations. Note: The repo for [TGRS'22] "An Empirical Study of Remote Sensing Pretraining" has been moved to: https://github.com/ViTAE-Transformer/RSP

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral)

[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification

Swin2sr ⭐ 303

Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 1.8 M runs https://replicate.com/mv-lab/swin2sr

Transmorph_transformer_for_medical_image_registration ⭐ 302

TransMorph: Transformer for Unsupervised Medical Image Registration (PyTorch)

Crossformer ⭐ 302

The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI

Actionformer_release ⭐ 285

Code release for ActionFormer (ECCV 2022)

Alphaclip ⭐ 273

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Vit Explain ⭐ 260

Explainability for Vision Transformers

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PV 等基础视觉算法

Semantic Segmentation ⭐ 228

SOTA Semantic Segmentation Models in PyTorch

Vitae Transformer Matting ⭐ 223

A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, including papers, codes, datasets, demos, and citations. Note: The repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving" has been moved to: https://github.com/ViTAE-Transformer/P3M-Net

Visual_token_matching ⭐ 213

[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Sam Detr ⭐ 211

[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation

Vt Unet ⭐ 210

[MICCAI2022] This is an official PyTorch implementation for A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

V2x Vit ⭐ 205

[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"

Interpretdl ⭐ 203

InterpretDL: Interpretation of Deep Learning Models，基于『飞桨』的模型可解释性算法库。

Seq2seqsharp ⭐ 188

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

Vitae Transformer ⭐ 187

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

Machinelearning Ai ⭐ 184

This repository contains all the work that I regularly did and studied from Medium blogs, several research papers, and other Repos (related/unrelated to the research papers).

Awesome Mim ⭐ 178

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

Visualization ⭐ 142

a collection of visualization function

Lamda Pilot ⭐ 134

🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox

PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022].

Swin Transformer V2 ⭐ 106

PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].

Awesome Transformer In Medical Imaging ⭐ 103

[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Tutorial ⭐ 94

Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)

Combining Efficientnet And Vision Transformers For Video Deepfake Detection ⭐ 81

Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.

Image Classification Pytorch ⭐ 80

Learning and Building Convolutional Neural Networks using PyTorch

The official repo for [Arxiv'23] "Vision Transformer with Quadrangle Attention"

This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"

Official Implementation of ResViT: Residual Vision Transformers for Multi-modal Medical Image Synthesis

Sota Backbones ⭐ 64

A collection of SOTA Image Classification Models in PyTorch

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Vitae Transformer Scene Text Detection ⭐ 51

A comprehensive list of our research works related to scene text detection and spotting, including papers, codes, and citations. Note: The official repo for [IJCV'22] "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection" has been moved to: https://github.com/ViTAE-Transformer/I3CL

Self Supervised Vit Path ⭐ 50

Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

The official repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving"

Oreilly Hands On Transformers ⭐ 37

Hands on NLP and Computer Vision with Transformers

G Universal Clip ⭐ 33

4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022

Vit Finetune ⭐ 32

Fine-tuning Vision Transformers on various classification datasets

Soft Mixture Of Experts ⭐ 30

PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)

Vision Diffmask ⭐ 26

Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.

Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)

Polsarformer ⭐ 22

This code is for the paper "Local Window Attention Transformer for Polarimetric SAR Image Classification" that is published in the IEEE Geoscience and Remote Sensing Letters journal.

Kaggle_leaf_disease_classification ⭐ 22

Cassava leaf disease classification with CNNs and Transformers (top-1% Kaggle solution)

Official repository for "Self-Distilled Vision Transformer for Domain Generalization" (ACCV-2022 ORAL)

Transoar ⭐ 22

A 3D medical Detection Transformer library. Papers accepted @ MIDL & MELBA.

[MICCAI 2023] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets (an official implementation)

Adversarial Automixup ⭐ 21

Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)

Code for ViTAS_Vision Transformer Architecture Search

Deep Hash Distillation ⭐ 20

Deep Hash Distillation for Image Retrieval - ECCV 2022

Celebfaces_attributes_classification ⭐ 19

This repository is related to a project of the Introduction to Numerical Imaging (i.e, Introduction à l'Imagerie Numérique in French), given by the MVA Masters program at ENS-Paris Saclay. It was entirely build from scratch and contains code in PyTorch Lightning to train and then use a neural network for image classification. We used it to create a classifier allowing semantic attributes classification of faces with the dataset CelebA-HQ.

Deepvision ⭐ 18

PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), ResNetV2, EfficientNetV2, NeRF, SegFormer, MixTransformer, (planned...) DeepLabV3+, ConvNeXtV2, YOLO, etc.

Vit Pytorch ⭐ 17

PyTorch implementation of the vision transformer

Lithography Hotspot Detection ⭐ 17

Detected Hotspots in the Lithography process using Vision Transformers, Convolution Neural Networks and Artificial Neural Networks, and compared the results obtained using ANNs & CNNs

Doprompt ⭐ 17

Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"

[ICASSP 2023] Official Implementation of ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder Facial Diagnosis

Buildformer ⭐ 15

Building Extraction from remote sensing image using Vision Transformer, IEEE Transactions on Geoscience and Remote Sensing, 2022

Documentclip ⭐ 14

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023

Object Depth Detection Based Hybrid Distance Estimator ⭐ 10

We use our VDEmodel. Our purpose is that predict the distance between car based on Deep-Learning.

Swindepth ⭐ 10

"SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network" (ICRA 2023)

[MICCAI ISIC Workshop 2023] AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets (an official implementation)

Image Masking for Robust Self-Supervised Monocular Depth Estimation, accepted at ICRA 2023

Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"

[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

Lung Sound Classification ⭐ 8

This project is about classifying respiratory sounds using Attention and Vision Transformer on ICBHI dataset..

Official repository for the paper "Salient Mask-Guided Vision Transformer for Fine-Grained Classification" (VISIGRAPP '23)

Qtclassification ⭐ 7

A lightweight and extensible toolbox for image classification

📳 From training and deployment of ViTs to development of real-time cross-platform mobile apps!

Vit_pytorch ⭐ 7

A PyTorch Implementation of ViT (Vision Transformer)

Official implementation of Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention

Face To Bmi Vit ⭐ 6

Predict the Body Mass Index with one image of a human face, with state-of-the-art results.

AICONSlab's DL benchmarking platform to OOD data in MRI

Gsoc 22 Tensorflow Resources And Notebooks ⭐ 5

GSoC'22 @ TensorFlow Notebooks, Code and much more

Related Searches

Python Deep Learning (18,839)

Jupyter Notebook Deep Learning (10,328)

Deep Learning Tensorflow (5,868)

Deep Learning Neural Network (5,801)

Deep Learning Pytorch (4,667)

Deep Learning Convolutional Neural Networks (3,932)

Deep Learning Neural (3,734)

Deep Learning Network (3,573)

Deep Learning Keras (3,258)

Deep Learning Computer Vision (3,035)

1-88 of 88 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.