UltraFeedback — LLM preference Dataset

Large-scale, fine-grained preference dataset. It contains 64k prompts with multiple model responses rated by GPT-4. Essential for RLHF and DPO.

Dataset Details

ProviderOpenBMB
Categorypreference
Size64k Rows
LicenseMIT
Downloads1.2M
TagsRLHF, DPO, Alignment
from datasets import load_dataset
ds = load_dataset("OpenBMB/ultrafeedback")

← All Datasets | Fine-Tuning Guide