Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for vision and language multimodal deep learning
multimodal-deep-learning
x
vision-and-language
x
11 search results found
Lavis
⭐
7,917
LAVIS - A One-stop Library for Language-Vision Intelligence
Awesome Vision Language Pretraining Papers
⭐
724
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
Awesome Vision And Language Pre Training
⭐
176
Recent Advances in Vision and Language Pre-training (VLP)
Pseudo Q
⭐
116
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Awesome Vision Language Models For Earth Observation
⭐
105
A curated list of awesome vision and language resources for earth observation.
Hateful_memes Hate_detectron
⭐
41
Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge. https://arxiv.org/abs/2012.12975
Visual Spatial Reasoning
⭐
38
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
Vote2cap Detr
⭐
22
Code release for ''End-to-End 3D Dense Captioning with Vote2Cap-DETR'' (CVPR2023)
C3vqg Official
⭐
14
Code for the paper "C3VQG: Category Consistent Cyclic Visual Question Generation".
Gvcci
⭐
7
[IROS 2023] GVCCI: Lifelong Learning of Visual Grounding for Language-Guided Robotic Manipulation
Vision Language Modelling Series
⭐
7
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
Related Searches
Python Vision And Language (69)
Python Multimodal Deep Learning (60)
Pytorch Multimodal Deep Learning (37)
Pytorch Vision And Language (34)
1-11 of 11 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.