Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for gpt multimodal
gpt
x
multimodal
x
13 search results found
Ai Notes
⭐
4,180
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
Marqo
⭐
3,893
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Visualglm 6b
⭐
3,638
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Interngpt
⭐
2,976
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Mplug Owl
⭐
1,657
[Official Implementation] mPLUG-Owl & mPLUG-Owl2: Alibaba MLLM Family.
Awesome Llm Reasoning
⭐
1,035
Papers and resources on Reasoning in Language Models (LLMs), including Chain-of-Thought, Instruction-Tuning, Multimodality.
Motiongpt
⭐
1,018
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Data Juicer
⭐
994
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Multimodal Gpt
⭐
971
Multimodal-GPT
Vectordb Recipes
⭐
267
High quality resources & applications for LLMs, multi-modal models and VectorDBs
Easyinstruct
⭐
253
An Easy-to-use Instruction Processing Framework for LLMs.
Lrv Instruction
⭐
160
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Vlmevalkit
⭐
137
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Ll3da
⭐
65
"LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
Gemini Pro Bot
⭐
59
A Python Telegram bot powered by Google's gemini-pro LLM API
Kosmos X
⭐
53
The Next Generation Multi-Modality Superintelligence
Videodb Python
⭐
37
VideoDB Python SDK
Generative_deep_learning_2nd_edition
⭐
16
<만들면서 배우는 생성 AI 2판>의 코드 저장소
Described
⭐
14
Automatically describe images sent by users on popular media platforms, incredibly useful for the visually impaired and for complicated imagery.
Mmc
⭐
13
Shapegpt
⭐
12
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model
Related Searches
Openai Gpt (333)
Chatgpt Gpt (327)
Artificial Intelligence Gpt (270)
Python Gpt (262)
Llm Gpt (260)
Python Multimodal (186)
1-13 of 13 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.