Awesome Open Source

Programming Languages

Search results for foundation models vision language model

foundation-models x

vision-language-model x

7 search results found

Llava ⭐ 12,514

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Awesome Japanese Llm ⭐ 585

日本語LLMまとめ - Overview of Japanese LLMs

Groundinglmm ⭐ 434

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Voxposer ⭐ 103

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Attackvlm ⭐ 79

Code of the paper: On Evaluating Adversarial Robustness of Large Vision-Language Models

Awesome Multimodal Llm Autonomous Driving ⭐ 35

Multimodal Large Language Models for Autonomous Driving [WACV 2024 Survey Paper]

Llava Docker ⭐ 32

Docker image for LLaVA: Large Language and Vision Assistant

Related Searches

Python Foundation Models (42)

Python Vision Language Model (20)

Multimodal Foundation Models (10)

Chatgpt Foundation Models (8)

Vision Transformer Foundation Models (8)

Multimodal Vision Language Model (6)

Llm Vision Language Model (6)

Vision Language Model Llava (6)

Foundation Models Embodied Ai (4)

Artificial Intelligence Vision Language Model (4)

1-7 of 7 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.