Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing multimodal deep learning
multimodal-deep-learning
x
natural-language-processing
x
15 search results found
Awesome Grounding
⭐
689
awesome grounding: A curated list of research papers in visual grounding
Nsmusics
⭐
601
NSMusicS(Nine Songs · Music World:九歌 · 音乐世界),Multi platform Multi mode Super Music Software (Full stack development, audio processing, artificial intelligence, natural language processing)
Awesome Emotion Recognition In Conversations
⭐
216
A comprehensive reading list for Emotion Recognition in Conversations
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Awesome 3d Vision And Language
⭐
62
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
Vg Gplms
⭐
49
The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".
Visual Spatial Reasoning
⭐
38
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
Slp
⭐
20
Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
Multimodal Transformer
⭐
17
Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset
Concatbert
⭐
17
Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and image representation obtained from VGG16 pretrained model.
Drml
⭐
16
Official Code Release for Diagnosing and Rectifying Vision Models using Language
Edis
⭐
15
Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)
Whos Waldo
⭐
13
Who's Waldo? Linking People Across Text and Images. ICCV 2021.
Job Recommend Competition
⭐
9
🥇KNOW기반 직업 추천 알고리즘 경진대회 1등 솔루션입니다🥇
Multimodal Robustness
⭐
9
Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'
Bias In Vision And Language
⭐
6
Code for paper "Measuring Social Biases in Grounded Vision and Language Embeddings"
Mm Align
⭐
5
This repository contains the official implementation of the paper: MM-Align: Learning Optimal Transport-based Alignment Dynamics for Fast and Accurate Inference on Missing Modality Sequences (EMNLP 2022)
Memsem
⭐
5
A Multi-modal Framework for Sentimental Analysis of Meme
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,414)
Pytorch Natural Language Processing (1,212)
Artificial Intelligence Natural Language Processing (1,010)
Dataset Natural Language Processing (1,010)
Tensorflow Natural Language Processing (909)
Javascript Natural Language Processing (843)
Natural Language Processing Chatbot (726)
1-15 of 15 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.