Awesome Open Source

Programming Languages

Search results for python image captioning

image-captioning x

132 search results found

Interngpt ⭐ 2,976

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

A Pytorch Tutorial To Image Captioning ⭐ 2,084

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Caption Anything ⭐ 1,374

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-A https://huggingface.co/spaces/VIPLab/Caption-Anyth

Prismer ⭐ 1,245

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Oscar and VinVL

Self Critical.pytorch ⭐ 964

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Xmodaler ⭐ 929

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Omninet ⭐ 426

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Meshed Memory Transformer ⭐ 420

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Neuralmonkey ⭐ 385

An open-source tool for sequence learning in NLP built on TensorFlow.

Cv Tricks.com ⭐ 384

Repository for all the tutorials and codes shared at cv-tricks.com

Im2latex ⭐ 289

Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow

Show Control And Tell ⭐ 273

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Caption_generator ⭐ 233

A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.

Awesome Foundation And Multimodal Models ⭐ 223

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code]

Unsupervised_captioning ⭐ 194

Code for Unsupervised Image Captioning

Image Captioning ⭐ 188

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Video2description ⭐ 153

Video to Text: Natural language description generator for some given video. [Video Captioning]

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

Show Adapt And Tell ⭐ 142

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Fairseq Image Captioning ⭐ 134

Transformer-based image captioning extension for pytorch/fairseq

Language Models Can See: Plugging Visual Controls in Text Generation

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Sightseq ⭐ 109

🔭 Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Clip Caption Reward ⭐ 104

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

Tag manager and captioner for image datasets

Pytorch Polygon Rnn ⭐ 101

Pytorch implementation of Polygon-RNN(http://www.cs.toronto.edu/polyrnn/poly

L Verse ⭐ 100

L-Verse: Bidirectional Generation Between Image and Text

Keras Image Captioning ⭐ 100

An implementation of image captioning in Keras

Image Captioning ⭐ 98

TensorFlow (TensorLayer) Implementation of Image Captioning

Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)

CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

Transform And Tell ⭐ 85

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

Clip Gpt Captioning ⭐ 71

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

S2 Transformer ⭐ 70

[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”

Image Caption Generator ⭐ 70

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Image Captioning ⭐ 69

Medical Report Generation ⭐ 69

A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.

Expansionnet_v2 ⭐ 68

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

Cvpr18 Caption Eval ⭐ 66

Learning to Evaluate Image Captioning. CVPR 2018

Using pretrained encoder and language models to generate captions from multimedia inputs.

Show Attend And Tell ⭐ 61

[Python 3] Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

Image Captioning Using Transformer

Image captioning ready-to-go inference: show and tell model compatible with Tensorflow r1.9

Awesome Remote Image Captioning ⭐ 55

A list of awesome remote sensing image captioning resources

[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.

Stylenet ⭐ 53

A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"

Knowing When To Look Adaptive Attention ⭐ 48

PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning

Updown Baseline ⭐ 48

Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".

Cvt2distilgpt2 ⭐ 46

Improving Chest X-Ray Report Generation by Leveraging Warm-Starting

Show_and_tell ⭐ 43

Show and Tell : A Neural Image Caption Generator

Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）

Image_captioning ⭐ 40

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Neural Image Captioning ⭐ 39

Using scene-specific contexts and region-based attention in neural image captioning

Image Captioning ⭐ 37

Image Captioning: Implementing the Neural Image Caption Generator with python

Image Caption Generator ⭐ 37

The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)

MLSys Workshop NeurIPS 2023 - Redco: A Lightweight Tool to Automate Distributed Training and Inference

Punny_captions ⭐ 31

An implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".

Machine Learning ⭐ 31

The projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.

Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Network for Fine-Grained Image Captioning (TPAMI 2019)

Optimization_of_image_description_metrics_using_policy_gradient_methods ⭐ 29

Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods

Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019

Skeleton Key ⭐ 26

The implementation of the model in paper "Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition"

Im2latex ⭐ 26

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

Inverse Dall E For Optical Character Recognition ⭐ 24

Inverse DALL-E for Optical Character Recognition

Image Caption ⭐ 24

Using LSTM or Transformer to solve Image Captioning in Pytorch

CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022

Soft Attention Image Captioning ⭐ 20

tensorflow implementation of show, attend and tell (ICML'15)

Controllable mage captioning model with unsupervised modes

Recurrent_fusion_network ⭐ 19

Source code for "Recurrent Fusion Network for Image Captioning".

Multimodal Meta Learn ⭐ 19

Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning" (published at ICLR 2023).

Gramtion ⭐ 18

Twitter bot for generating photo descriptions (alt text)

Show Attend And Tell ⭐ 18

A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Xmodal Ctx ⭐ 18

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Butd_model ⭐ 17

A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.

Fluent Cap ⭐ 17

code for fluency-guided cross-lingual image captioning

Attn Gan ⭐ 17

Pytorch implementation of paper: AttnGAN Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

Captioning_chainer ⭐ 16

A fast implementation of Neural Image Caption by Chainer

End-to-end deep learning model for image captioning

Codalab Microsoft Coco Image Captioning Challenge ⭐ 15

🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

Image Caption Tf ⭐ 15

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

Sparse Image Captioning ⭐ 14

Image captioning with weight pruning in PyTorch

A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.

Compositional Image Captioning ⭐ 13

Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Aralikatte and Desmond Elliott

Image Captioning ⭐ 13

Image Captioning project for Computer Vision Course at NYU

An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)

Interactive Keras Captioning ⭐ 12

Interactive multimedia captioning with Keras

Self Critical ⭐ 11

PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"

Pytorch Image Captioning ⭐ 11

Transformer & CNN Image Captioning model in PyTorch.

Ntu 2022fall Dlcv ⭐ 11

Deep Learning for Computer Vision 深度學習於電腦視覺 by Frank Wang 王鈺強

Causalvlr ⭐ 11

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning

Image2latex ⭐ 11

Image to Latex using Encoder-Decoder architecture

Look And Modify ⭐ 10

PyTorch Implementation of our BMVC 2019 Paper Look and Modify: Modification Networks for Image Captioning

Related Searches

Python Machine Learning (19,284)

Python Dataset (14,792)

Python Tensorflow (13,736)

Python Deep Learning (13,092)

Python Jupyter Notebook (12,976)

Python Natural Language Processing (9,064)

Python Artificial Intelligence (8,580)

Python Pytorch (7,877)

Python Neural (7,444)

Python Keras (6,821)

1-100 of 132 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.