Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for gpt 4 llava
gpt-4
x
llava
x
8 search results found
Llava
⭐
12,514
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Multimodal Maestro
⭐
871
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
Video Chatgpt
⭐
590
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Lrv Instruction
⭐
160
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Llavar
⭐
133
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
Hallusionbench
⭐
128
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Vip Llava
⭐
81
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Llava Docker
⭐
32
Docker image for LLaVA: Large Language and Vision Assistant
Related Searches
Python Gpt 4 (493)
Chatgpt Gpt 4 (492)
Artificial Intelligence Gpt 4 (340)
Llm Gpt 4 (171)
Javascript Gpt 4 (170)
Chatbot Gpt 4 (148)
Gpt 4 Langchain (94)
Gpt 4 Llama (47)
Gpt 4 Llama2 (20)
Gpt 4 Multimodal (16)
1-8 of 8 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.