Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pytorch vision and language
pytorch
x
vision-and-language
x
32 search results found
Vl Bert
⭐
680
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Clipbert
⭐
649
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Calvin
⭐
210
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Pytorch_violet
⭐
130
A PyTorch implementation of VIOLET
Hero
⭐
125
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Vldet
⭐
117
[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)
Pseudo Q
⭐
116
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Regretful Agent
⭐
116
PyTorch code for CVPR 2019 paper: The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation
Selfmonitoring Agent
⭐
101
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
Ofasys
⭐
79
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Vl_adapter
⭐
75
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
Lightningdot
⭐
65
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
Robo Vln
⭐
56
Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
Hulc
⭐
52
Hierarchical Universal Language Conditioned Policies
Sugar Crepe
⭐
40
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
Pytorch_empirical Mvm
⭐
30
A PyTorch implementation of EmpiricalMVM
Vognet Pytorch
⭐
28
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
Lang2seg
⭐
25
Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019
Cross Modal Adapter
⭐
24
[arXiv] Cross-Modal Adapter for Text-Video Retrieval
Mac
⭐
24
An end-to-end masked contrastive video-and-language pre-training framework
Trar Vqa
⭐
23
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
Hulc2
⭐
22
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
Vote2cap Detr
⭐
22
Code release for ''End-to-End 3D Dense Captioning with Vote2Cap-DETR'' (CVPR2023)
Pytorch_ldast
⭐
19
A PyTorch implementation of LDAST
Cyclical Visual Captioning
⭐
18
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
Hero_video_feature_extractor
⭐
15
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Pytorch_tvc
⭐
14
A PyTorch implementation of TVC
Spacap3d
⭐
9
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
Vlpd
⭐
8
Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"
Mozuma
⭐
7
Model Zoo for Multimedia Applications
Refcontrast
⭐
5
Understanding Synonymous Referring Expressions via Contrastive Features
Naq
⭐
5
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.
Related Searches
Python Pytorch (18,107)
Deep Learning Pytorch (7,533)
Jupyter Notebook Pytorch (4,892)
Machine Learning Pytorch (2,934)
Dataset Pytorch (1,847)
Pytorch Neural Network (1,631)
Pytorch Computer Vision (1,607)
Pytorch Natural Language Processing (1,426)
Pytorch Neural (1,217)
Pytorch Generative Adversarial Network (1,199)
1-32 of 32 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.