Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for multimodal multimodality
multimodal
x
multimodality
x
14 search results found
Llava
⭐
12,514
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Internlm Xcomposer
⭐
820
Clip4clip
⭐
663
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Swarms
⭐
376
Build, Deploy, and Scale Reliable Swarms of Autonomous Agents for Workflow Automation. Join our Community: https://discord.gg/DbjBMJTSWD
Cm3leon
⭐
288
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Clip Guided Diffusion
⭐
267
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Mmmu
⭐
167
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Fusilli
⭐
120
A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluation - fusilli's got you covered 🌸
Pali3
⭐
97
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
Andromeda
⭐
92
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
Swarms Pytorch
⭐
67
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
Multi_token
⭐
54
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
Pali
⭐
42
Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"
Kosmos2.5
⭐
34
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Trar Vqa
⭐
23
This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task
Guidance For Multi Omics And Multi Modal Data Integration And Analysis On Aws
⭐
16
This guidance creates a scalable environment in AWS to prepare genomic, clinical, mutation, expression and imaging data for large-scale analysis and perform interactive queries against a data lake. The solution also demonstrates the use of Amazon Omics for multi-modal analysis.
Mvgl
⭐
15
TCyb17: Graph learning for multiview clustering
Pywikimm
⭐
9
Collects a multimodal dataset of Wikipedia articles and their images
Gato
⭐
8
Plug in and play Implementation of "A Generalist Agent" by Deepmind.
Mmca
⭐
8
The open source community's implementation of the all-new Multi-Modal Causal Attention from "DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention"
Dailypaperclub
⭐
8
The repository for the exclusive Daily Paper Club hosted at Agora every 10pm NYC time at this discord: https://discord.gg/Gnzh6dnzyz
Tinygptv
⭐
7
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
Multimodal Tot
⭐
6
Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement
Gats
⭐
6
Implementation of GATS from the paper: "GATS: Gather-Attend-Scatter" in pytorch and zeta
Mlxtransformer
⭐
5
Simple Implementation of a Transformer in the new framework MLX by Apple
Related Searches
Python Multimodal (186)
1-14 of 14 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.