Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for vision and language video captioning
video-captioning
x
vision-and-language
x
5 search results found
Xmodaler
⭐
929
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Vidchapters
⭐
93
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
Video_captioning_datasets
⭐
63
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Pytorch_empirical Mvm
⭐
30
A PyTorch implementation of EmpiricalMVM
Vlcap
⭐
26
[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Related Searches
Python Vision And Language (69)
Pytorch Vision And Language (34)
Python Video Captioning (24)
Pre Training Vision And Language (18)
1-5 of 5 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.