Awesome Open Source

Programming Languages

Search results for vision and language video captioning

video-captioning x

vision-and-language x

5 search results found

Xmodaler ⭐ 929

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Vidchapters ⭐ 93

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Video_captioning_datasets ⭐ 63

Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

Pytorch_empirical Mvm ⭐ 30

A PyTorch implementation of EmpiricalMVM

[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Related Searches

Python Vision And Language (69)

Pytorch Vision And Language (34)

Python Video Captioning (24)

Pre Training Vision And Language (18)

1-5 of 5 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.