LLaVA Instruct — LLM multimodal Dataset

Visual instruction tuning dataset. Contains 158k unique language-image instruction-following samples to make LLMs 'see'.

Dataset Details

ProviderHaotian Liu
Categorymultimodal
Size158k Samples
LicenseCC-BY-NC 4.0
Downloads200k
TagsVision, Image, Multimodal
from datasets import load_dataset
ds = load_dataset("Haotian Liu/llava-instruct")

← All Datasets | Fine-Tuning Guide