Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python cross modal
cross-modal
x
python
x
21 search results found
Discoart
⭐
3,773
🪩 Create Disco Diffusion artworks in one line
Multimodal Maestro
⭐
871
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Scan
⭐
442
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Solc
⭐
109
Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类
Objects That Sound
⭐
62
Implementation of Google Deepmind's paper `Objects that Sound`
Cmg
⭐
52
The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)
Dsran
⭐
50
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
Vltvg
⭐
47
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
Distill Bev
⭐
46
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
Unipt
⭐
36
The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
Aaai17 Cdq
⭐
31
The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"
Duet
⭐
29
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Biot5
⭐
29
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations (EMNLP 2023)
Zerovl
⭐
20
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
Cross Modal Hasing Playground
⭐
19
Python implementation of cross-modal hashing algorithms
Sakdn
⭐
19
[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition
Xmodal Ctx
⭐
18
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
Text2pos Cvpr2022
⭐
17
Code, dataset and models for our CVPR 2022 publication "Text2Pos"
Aladin
⭐
16
Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"
Speech To Image Translation Without Text
⭐
10
Code for paper "direct speech-to-image translation"
Dscnet
⭐
9
DSCNet Visible-Infrared Person ReID (TIFS 2022)
Related Searches
Python Dataset (14,792)
Python Machine Learning (14,099)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
Python Pytorch (7,877)
Python Testing (7,358)
Python Keras (6,821)
Python Paper (6,577)
1-21 of 21 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.