Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for multimodal deep learning
multimodal-deep-learning
x
139 search results found
Mm_alt
⭐
10
[MM 2022 Oral] MM-ALT: A Multimodal Automatic Lyric Transcription System
Clot
⭐
10
Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation"
Log 2023 Gnns Recsys
⭐
9
Presented as tutorial at the Second Learning on Graphs Conference (LoG 2023)
Job Recommend Competition
⭐
9
🥇KNOW기반 직업 추천 알고리즘 경진대회 1등 솔루션입니다🥇
Deep Representations Of Visual Descriptions
⭐
9
Pytorch implementation of CVPR'16 paper "Learning Deep Representations of Fine-Grained Visual Descriptions", by Reed et al.
Multimodal Robustness
⭐
9
Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'
Pegasus
⭐
9
PegasusX: The Future of Multimodal Embeddings 🦄 🦄
Fashion_image_caption
⭐
9
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
Deepcu Ijcai19
⭐
8
DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19
Vtc
⭐
8
VTC: Improving Video-Text Retrieval with User Comments
Ofa X
⭐
8
This repository contains the code for the publication "Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language Explanations"
Segresearchtoolkit
⭐
8
A High-Efficient Research Development Toolkit for Image Segmentation Based on Pytorch.
Blitext
⭐
8
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Mqmc
⭐
8
This repo has the PyTorch implementation and datasets of our WSDM 2023 paper: “Multi-queue Momentum Contrast for Microvideo-Product Retrieval”.
Dailypaperclub
⭐
8
The repository for the exclusive Daily Paper Club hosted at Agora every 10pm NYC time at this discord: https://discord.gg/Gnzh6dnzyz
Mmae
⭐
8
Package for Multimodal Autoencoders in TensorFlow / Keras
Multimodal Dl Framework
⭐
7
An extensible PyTorch framework to experiment with neural-networks-based deep learning algorithms on multiple data modalities for binary classification.
Bifusion
⭐
7
M2h2 Dataset
⭐
7
This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecognition in Conversations
Ducho
⭐
7
Accepted at ACM Multimedia 2023 in the Open Source track.
Lemanchot Analysis
⭐
7
LeManchot-Analysis is a system for abnormal detection in coupled visible-thermal images
Image Text Verification
⭐
7
Official repository for the "VERITE: A Robust Benchmark for Multimodal Misinformation Detection Accounting for Unimodal Bias" paper.
Stacked Attention Networks For Visual Question Answering
⭐
7
Implementation of the paper "Stacked Attention Networks for Image Question Answering" in Tensorflow
Gvcci
⭐
7
[IROS 2023] GVCCI: Lifelong Learning of Visual Grounding for Language-Guided Robotic Manipulation
Awesome Video Language Understanding
⭐
7
A Survey on video and language understanding.
Flair 2
⭐
7
Engage in a semantic segmentation challenge for land cover description using multimodal remote sensing earth observation data, delving into real-world scenarios with a dataset comprising 70,000+ aerial imagery patches and 50,000 Sentinel-2 satellite acquisitions.
Cpac
⭐
7
[Bioinformatics 2022] Cross-Modality and Self-Supervised Protein Embedding for Compound-Protein Affinity and Contact Prediction
Vision Language Modelling Series
⭐
7
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
Greedy_multimodal_learning
⭐
6
Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks
Taris
⭐
6
Transformer-based online speech recognition system with TensorFlow 2
Bias In Vision And Language
⭐
6
Code for paper "Measuring Social Biases in Grounded Vision and Language Embeddings"
Formal Multimod Rec
⭐
6
Formalizing Multimedia Recommendation through Multimodal Deep Learning, under review at TORS.
Deepfake Detection Challenge Dfad2023
⭐
6
Implementation of solution for the Media Analytics Challenge.
Mm Align
⭐
5
This repository contains the official implementation of the paper: MM-Align: Learning Optimal Transport-based Alignment Dynamics for Fast and Accurate Inference on Missing Modality Sequences (EMNLP 2022)
Platform
⭐
5
Run custom multi-modal AI models fully on-device
Mogonet
⭐
5
MOGONET (Multi-Omics Graph cOnvolutional NETworks) is multi-omics data integrative analysis framework for classification tasks in biomedical applications.
Multi Modal Recommendation System
⭐
5
Official code for the paper "Towards developing a Multi Modal Video Recommendation system"
Memsem
⭐
5
A Multi-modal Framework for Sentimental Analysis of Meme
Protein_pretrain
⭐
5
Multimodal Pretraining for Unsupervised Protein Representation Learning
101-139 of 139 search results
< Previous
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.