Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for multimodal learning visual question answering
multimodal-learning
x
visual-question-answering
x
7 search results found
Awesome Multimodal In Medical Imaging
⭐
207
A collection of resources on applications of multi-modal learning in medical imaging.
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Frozenbilm
⭐
120
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Just Ask
⭐
101
[ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Upop
⭐
54
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Crossget
⭐
13
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
Vqa
⭐
6
Visual Question Answering System
1-7 of 7 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.