Databricks Dolly 15k — LLM instruction Dataset

15,000 high-quality human-generated prompt/response pairs. Specifically designed for commercial use without restrictive licenses.

Dataset Details

ProviderDatabricks
Categoryinstruction
Size15k Rows
LicenseCC-BY-SA 3.0
Downloads3M
TagsHuman-Generated, Commercial, General
from datasets import load_dataset
ds = load_dataset("Databricks/dolly-15k")

← All Datasets | Fine-Tuning Guide