Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for captioning
captioning
x
30 search results found
Mmf
⭐
5,414
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Capdec
⭐
155
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
Aac Datasets
⭐
74
Audio Captioning datasets for PyTorch.
Fully Convolutional Point Network
⭐
73
Fully-Convolutional Point Networks for Large-Scale Point Clouds
Vistext
⭐
58
VisText is a benchmark dataset for semantically rich chart captioning.
Mtl Aqa
⭐
38
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment
Iperceive
⭐
36
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
Dcase 2020 Baseline
⭐
35
Audio captioning baseline system for DCASE 2020 challenge.
Vsua Captioning
⭐
29
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
Caption By Committee
⭐
27
Using LLMs and pre-trained caption models for super-human performance on image captioning.
Medicalreportgeneration
⭐
27
A Base Tensorflow Project for Medical Report Generation
Vidsitu
⭐
23
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Papernotes
⭐
23
My notes on some Deep Learning papers
Camel
⭐
21
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
Ebu Tt Live Toolkit
⭐
20
Toolkit for supporting the EBU-TT Live specification
X Trans2cap
⭐
20
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
Pacscore
⭐
20
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023
Awesome Diverse Captioning
⭐
18
Some papers about *diverse* image (a few videos) captioning
Aoa Pytorch
⭐
13
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
Aac Metrics
⭐
12
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
S2vt Seq2seq Video Captioning Attention
⭐
12
S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
Clotho Dataset
⭐
12
Python code for handling the Clotho dataset.
Smart I
⭐
9
Smart-I is an android application aimed at helping the visually impaired using artificial intelligence and cloud computing.
Tennis
⭐
7
A Tennis dataset and models for event detection & commentary generation
Simplesubtitleeditor
⭐
7
SimpleSubtitleEditor for Blender
Medclip
⭐
7
Medical image captioning using OpenAI's CLIP
Indonesian Image Captioning
⭐
7
Indonesian Image Captioning using Attention-based Semantic Compositional Networks
Zerogen
⭐
6
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
Pma Net
⭐
5
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023
R3transformer
⭐
5
Official python implementation of R3-Transformer
1-30 of 30 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.