Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python video question answering
python
x
video-question-answering
x
24 search results found
Ask Anything
⭐
2,404
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Internvideo
⭐
736
InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)
Clipbert
⭐
649
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Pytorch_violet
⭐
130
A PyTorch implementation of VIOLET
Frozenbilm
⭐
120
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Alpro
⭐
109
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Tvqaplus
⭐
107
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
Hbi
⭐
75
[CVPR 2023 Highlight] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Next Qa
⭐
74
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
Emcl
⭐
55
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Pkol
⭐
43
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
Flipped Vqa
⭐
35
Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
Pytorch_empirical Mvm
⭐
30
A PyTorch implementation of EmpiricalMVM
Vgt
⭐
30
Video Graph Transformer for Video Question Answering (ECCV'22)
Causal Vidqa
⭐
25
[CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The code used in our paper "From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering", CVPR2022.
Igv
⭐
19
This repo contains code for Invariant Grounding for Video Question Answering
Next Gqa
⭐
16
Can I Trust Your Answer? Visually Grounded VideoQA
Tem Adapter
⭐
15
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Hqga
⭐
13
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
Knowit Rock
⭐
12
ROCK model for Knowledge-Based VQA in Videos
Shot2story
⭐
11
A new multi-shot video understanding benchmark Shot2Story20K with detailed shot-level captions and comprehensive video summaries.
Covgt
⭐
10
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
Dramaqa
⭐
8
DramaQA Starter Code (2021)
Lifeqa
⭐
7
Data and PyTorch code for the LifeQA LREC 2020 paper.
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Deep Learning (17,914)
Python Flask (17,643)
Python Dataset (14,962)
Python Pytorch (14,867)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Jupyter Notebook (12,976)
1-24 of 24 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.