Awesome Open Source

Programming Languages

Search results for gpt 4 llava

8 search results found

Llava ⭐ 12,514

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Multimodal Maestro ⭐ 871

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

Video Chatgpt ⭐ 590

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Lrv Instruction ⭐ 160

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Hallusionbench ⭐ 128

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Vip Llava ⭐ 81

ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Llava Docker ⭐ 32

Docker image for LLaVA: Large Language and Vision Assistant

Related Searches

Python Gpt 4 (493)

Chatgpt Gpt 4 (492)

Artificial Intelligence Gpt 4 (340)

Llm Gpt 4 (171)

Javascript Gpt 4 (170)

Chatbot Gpt 4 (148)

Gpt 4 Langchain (94)

Gpt 4 Llama (47)

Gpt 4 Llama2 (20)

Gpt 4 Multimodal (16)

1-8 of 8 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.