Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for multimodal vqa
multimodal
x
vqa
x
9 search results found
Mmf
⭐
5,414
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Interngpt
⭐
2,976
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Awesome Visual Question Answering
⭐
541
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
Lrv Instruction
⭐
160
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Vlmevalkit
⭐
137
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Sutd Trafficqa
⭐
35
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Dual Mfa Vqa
⭐
33
Co-attending Regions and Detections for VQA.
Omnifusion
⭐
16
OmniFusion — a multimodal model to communicate using text and images
Mplug
⭐
15
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Videonavqa
⭐
14
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Awesome Llm For Robotics Reasoning
⭐
13
LLM for robotics reasoning toward AGI / Awesome repos&surveys / Chain of Thought / LLM / Prompt engineering / Reasoning / Robot / Agent / Planning / Reinforcement Learning / Created by @shure-dev / Check Wiki
Maverics
⭐
8
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).
Related Searches
Python Multimodal (220)
Python Vqa (219)
Deep Learning Multimodal (102)
Artificial Intelligence Multimodal (84)
Pytorch Vqa (61)
Attention Vqa (61)
Dataset Vqa (57)
Jupyter Notebook Vqa (54)
Vqa Visual Question Answering (46)
Deep Learning Vqa (44)
1-9 of 9 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.