Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for foundation models vision language model
foundation-models
x
vision-language-model
x
7 search results found
Llava
⭐
12,514
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Awesome Japanese Llm
⭐
585
日本語LLMまとめ - Overview of Japanese LLMs
Groundinglmm
⭐
434
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Voxposer
⭐
103
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Attackvlm
⭐
79
Code of the paper: On Evaluating Adversarial Robustness of Large Vision-Language Models
Awesome Multimodal Llm Autonomous Driving
⭐
35
Multimodal Large Language Models for Autonomous Driving [WACV 2024 Survey Paper]
Llava Docker
⭐
32
Docker image for LLaVA: Large Language and Vision Assistant
Related Searches
Python Foundation Models (42)
Python Vision Language Model (20)
Multimodal Foundation Models (10)
Chatgpt Foundation Models (8)
Vision Transformer Foundation Models (8)
Multimodal Vision Language Model (6)
Llm Vision Language Model (6)
Vision Language Model Llava (6)
Foundation Models Embodied Ai (4)
Artificial Intelligence Vision Language Model (4)
1-7 of 7 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.