| facebookresearch/mmf |
5,357 |
|
0 |
0 |
over 2 years ago |
0 |
|
145 |
other |
Python |
| A modular framework for vision & language multimodal research from Facebook AI Research (FAIR) |
| OpenGVLab/InternGPT |
3,204 |
|
0 |
0 |
almost 2 years ago |
0 |
|
18 |
apache-2.0 |
Python |
| InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统) |
| BDBC-KG-NLP/QA-Survey-CN |
1,302 |
|
0 |
0 |
about 3 years ago |
0 |
|
1 |
|
|
| 北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。 |
| NVlabs/prismer |
1,245 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
other |
Python |
| The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". |
| microsoft/Oscar |
995 |
|
0 |
0 |
almost 3 years ago |
0 |
|
137 |
mit |
Python |
| Oscar and VinVL |
| ramprs/grad-cam |
652 |
|
0 |
0 |
over 9 years ago |
0 |
|
3 |
|
Lua |
| [ICCV 2017] Torch code for Grad-CAM |
| jayleicn/ClipBERT |
649 |
|
0 |
0 |
almost 3 years ago |
0 |
|
12 |
mit |
Python |
| [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks. |
| hengyuan-hu/bottom-up-attention-vqa |
606 |
|
0 |
0 |
over 6 years ago |
0 |
|
15 |
gpl-3.0 |
Python |
| An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge. |
| jokieleung/awesome-visual-question-answering |
541 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
|
|
| A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area. |
| Cadene/vqa.pytorch |
536 |
|
0 |
0 |
over 6 years ago |
0 |
|
19 |
|
Python |
| Visual Question Answering in Pytorch |