Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python image captioning
image-captioning
x
python
x
132 search results found
Interngpt
⭐
2,976
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Ofa
⭐
2,142
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
A Pytorch Tutorial To Image Captioning
⭐
2,084
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Caption Anything
⭐
1,374
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-A https://huggingface.co/spaces/VIPLab/Caption-Anyth
Prismer
⭐
1,245
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Oscar
⭐
995
Oscar and VinVL
Self Critical.pytorch
⭐
964
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Xmodaler
⭐
929
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Omml
⭐
528
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Virtex
⭐
506
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
Scan
⭐
442
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Omninet
⭐
426
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Meshed Memory Transformer
⭐
420
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Neuralmonkey
⭐
385
An open-source tool for sequence learning in NLP built on TensorFlow.
Cv Tricks.com
⭐
384
Repository for all the tutorials and codes shared at cv-tricks.com
Im2latex
⭐
289
Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow
Show Control And Tell
⭐
273
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Caption_generator
⭐
233
A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
Awesome Foundation And Multimodal Models
⭐
223
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code]
Unsupervised_captioning
⭐
194
Code for Unsupervised Image Captioning
Image Captioning
⭐
188
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Video2description
⭐
153
Video to Text: Natural language description generator for some given video. [Video Captioning]
Aoanet
⭐
149
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
Show Adapt And Tell
⭐
142
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Fairseq Image Captioning
⭐
134
Transformer-based image captioning extension for pytorch/fairseq
Magic
⭐
124
Language Models Can See: Plugging Visual Controls in Text Generation
Grit
⭐
119
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Sightseq
⭐
109
🔭 Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Clip Caption Reward
⭐
104
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
Taggui
⭐
103
Tag manager and captioner for image datasets
Pytorch Polygon Rnn
⭐
101
Pytorch implementation of Polygon-RNN(http://www.cs.toronto.edu/polyrnn/poly
L Verse
⭐
100
L-Verse: Bidirectional Generation Between Image and Text
Keras Image Captioning
⭐
100
An implementation of image captioning in Keras
Image Captioning
⭐
98
TensorFlow (TensorLayer) Implementation of Image Captioning
Rstnet
⭐
95
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
Arnet
⭐
89
CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Transform And Tell
⭐
85
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
Clip Gpt Captioning
⭐
71
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
S2 Transformer
⭐
70
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
Image Caption Generator
⭐
70
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Image Captioning
⭐
69
Medical Report Generation
⭐
69
A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.
Expansionnet_v2
⭐
68
Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"
Cvpr18 Caption Eval
⭐
66
Learning to Evaluate Image Captioning. CVPR 2018
Clipcap
⭐
64
Using pretrained encoder and language models to generate captions from multimedia inputs.
Show Attend And Tell
⭐
61
[Python 3] Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Catr
⭐
60
Image Captioning Using Transformer
Im2txt
⭐
58
Image captioning ready-to-go inference: show and tell model compatible with Tensorflow r1.9
Awesome Remote Image Captioning
⭐
55
A list of awesome remote sensing image captioning resources
Upop
⭐
54
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Stylenet
⭐
53
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
Knowing When To Look Adaptive Attention
⭐
48
PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning
Updown Baseline
⭐
48
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".
Cvt2distilgpt2
⭐
46
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Show_and_tell
⭐
43
Show and Tell : A Neural Image Caption Generator
Mia
⭐
42
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
Image_captioning
⭐
40
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Neural Image Captioning
⭐
39
Using scene-specific contexts and region-based attention in neural image captioning
Image Captioning
⭐
37
Image Captioning: Implementing the Neural Image Caption Generator with python
Image Caption Generator
⭐
37
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
Redco
⭐
35
MLSys Workshop NeurIPS 2023 - Redco: A Lightweight Tool to Automate Distributed Training and Inference
Punny_captions
⭐
31
An implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".
Machine Learning
⭐
31
The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Cavp
⭐
30
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)
Optimization_of_image_description_metrics_using_policy_gradient_methods
⭐
29
Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods
Aat
⭐
27
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
Skeleton Key
⭐
26
The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"
Im2latex
⭐
26
Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex
Inverse Dall E For Optical Character Recognition
⭐
24
Inverse DALL-E for Optical Character Recognition
Image Caption
⭐
24
Using LSTM or Transformer to solve Image Captioning in Pytorch
Camel
⭐
21
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
Soft Attention Image Captioning
⭐
20
tensorflow implementation of show, attend and tell (ICML'15)
Modecap
⭐
20
Controllable mage captioning model with unsupervised modes
Recurrent_fusion_network
⭐
19
Source code for "Recurrent Fusion Network for Image Captioning".
Multimodal Meta Learn
⭐
19
Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning" (published at ICLR 2023).
Labert
⭐
19
Gramtion
⭐
18
Twitter bot for generating photo descriptions (alt text)
Show Attend And Tell
⭐
18
A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Xmodal Ctx
⭐
18
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Butd_model
⭐
17
A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.
Fluent Cap
⭐
17
code for fluency-guided cross-lingual image captioning
Attn Gan
⭐
17
Pytorch implementation of paper: AttnGAN Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
Captioning_chainer
⭐
16
A fast implementation of Neural Image Caption by Chainer
Img2txt
⭐
16
End-to-end deep learning model for image captioning
Codalab Microsoft Coco Image Captioning Challenge
⭐
15
🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)
Mplug
⭐
15
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
Image Caption Tf
⭐
15
Image Caption
Bitters
⭐
14
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Sparse Image Captioning
⭐
14
Image captioning with weight pruning in PyTorch
Stt
⭐
14
A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
Compositional Image Captioning
⭐
13
Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Aralikatte and Desmond Elliott
Image Captioning
⭐
13
Image Captioning project for Computer Vision Course at NYU
Capeval
⭐
12
An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)
Interactive Keras Captioning
⭐
12
Interactive multimedia captioning with Keras
Self Critical
⭐
11
PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"
Pytorch Image Captioning
⭐
11
Transformer & CNN Image Captioning model in PyTorch.
Ntu 2022fall Dlcv
⭐
11
Deep Learning for Computer Vision 深度學習於電腦視覺 by Frank Wang 王鈺強
Causalvlr
⭐
11
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning
Image2latex
⭐
11
Image to Latex using Encoder-Decoder architecture
Look And Modify
⭐
10
PyTorch Implementation of our BMVC 2019 Paper Look and Modify: Modification Networks for Image Captioning
Related Searches
Python Machine Learning (19,284)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Natural Language Processing (9,064)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
Python Neural (7,444)
Python Keras (6,821)
1-100 of 132 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.