Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for computer vision multimodal learning
computer-vision
x
multimodal-learning
x
25 search results found
Awesome Multimodal Ml
⭐
5,399
Reading list for research topics in multimodal machine learning
Open_flamingo
⭐
3,115
An open-source framework for training large multimodal models.
Iccv 2023 Papers
⭐
806
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!
Pykale
⭐
415
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
Xpretrain
⭐
382
Multi-modality pre-training
Multibench
⭐
356
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Missing_aware_prompts
⭐
101
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
Ofasys
⭐
79
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
Learning2dance_cag_2020
⭐
79
PyTorch implementation of our graph convolutional network (GCN) for human motion generation from music. Also with paired dance-music data for training!
Baidubigdata19 Urfc
⭐
72
my solution with 0.67 accuracy
General Gpt
⭐
61
Multiviz
⭐
48
[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models
Cova Web Object Detection
⭐
37
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
Adamml
⭐
36
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
G Universal Clip
⭐
33
4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022
Dapt
⭐
26
Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)
Egopat3d
⭐
25
[CVPR 2022] Egocentric Action Target Prediction in 3D
Valhalla Nmt
⭐
23
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
Mica Movieclip
⭐
22
This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
Isbertblind
⭐
19
This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding" (CVPR 2023)
Multimodal Distillation
⭐
13
Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)
Itra
⭐
7
A codebase for flexible and efficient Image Text Representation Alignment
Diverse_and_specific_image_captioning
⭐
7
Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions.
Kosmosg
⭐
7
My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"
Related Searches
Python Computer Vision (6,027)
Deep Learning Computer Vision (3,558)
Machine Learning Computer Vision (2,342)
Jupyter Notebook Computer Vision (1,649)
Computer Vision Opencv (1,326)
Pytorch Computer Vision (1,099)
Convolutional Neural Networks Computer Vision (1,072)
Artificial Intelligence Computer Vision (949)
Tensorflow Computer Vision (905)
Computer Vision Image Processing (837)
1-25 of 25 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.