Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for image captioning vision and language
image-captioning
x
vision-and-language
x
10 search results found
Lavis
⭐
7,917
LAVIS - A One-stop Library for Language-Vision Intelligence
Prismer
⭐
1,245
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Oscar
⭐
995
Oscar and VinVL
Xmodaler
⭐
929
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Image Captioning
⭐
188
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Clip Caption Reward
⭐
104
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Mia
⭐
42
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
Xmodal Ctx
⭐
18
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Spatial Reasoning
⭐
6
Grounding Language Models for Compositional and Spatial Reasoning
Pma Net
⭐
5
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
Related Searches
Python Image Captioning (175)
Captions Image Captioning (147)
Jupyter Notebook Image Captioning (141)
Deep Learning Image Captioning (115)
Python Vision And Language (69)
Attention Image Captioning (60)
Coco Image Captioning (40)
Pytorch Vision And Language (34)
1-10 of 10 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.