Awesome Open Source

Programming Languages

Search results for python cross modal

21 search results found

Discoart ⭐ 3,773

🪩 Create Disco Diffusion artworks in one line

Multimodal Maestro ⭐ 871

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类

Objects That Sound ⭐ 62

Implementation of Google Deepmind's paper `Objects that Sound`

The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

Distill Bev ⭐ 46

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"

Aaai17 Cdq ⭐ 31

The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations (EMNLP 2023)

[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources

Cross Modal Hasing Playground ⭐ 19

Python implementation of cross-modal hashing algorithms

[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition

Xmodal Ctx ⭐ 18

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Text2pos Cvpr2022 ⭐ 17

Code, dataset and models for our CVPR 2022 publication "Text2Pos"

Official implementation of the paper "ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval"

Speech To Image Translation Without Text ⭐ 10

Code for paper "direct speech-to-image translation"

DSCNet Visible-Infrared Person ReID (TIFS 2022)

Related Searches

Python Dataset (14,792)

Python Machine Learning (14,099)

Python Tensorflow (13,736)

Python Deep Learning (13,092)

Python Algorithms (10,033)

Python Natural Language Processing (9,064)

Python Pytorch (7,877)

Python Testing (7,358)

Python Keras (6,821)

Python Paper (6,577)

1-21 of 21 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.