O Open Source Frameworks medium

Awesome LLM Human Preference Datasets

by Community

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

Visit Community View repo Submit your build →

OSS

Added 1 June 2026

#awesome-list #datasets #eval #human-preferences #llm #machine-learning #nlp #rlhf

Overview

This is a curated list of human preference datasets for LLM fine-tuning, RLHF, and evaluation. It is maintained as a community resource on GitHub with 391 stars.

Best for

Best for
Researchers and developers needing a curated index of human preference datasets.

Use cases

Finding datasets for RLHF fine-tuning of language models.
Locating human preference data for model evaluation.
Discovering benchmark datasets for preference learning research.

Notes

This is a curated list of human preference datasets for LLM fine-tuning, RLHF, and evaluation. It is maintained as a community resource on GitHub with 391 stars.

391 stars on GitHub. Last updated 2023-10-04. Licensed MIT.

Use cases

Finding datasets for RLHF fine-tuning of language models.
Locating human preference data for model evaluation.
Discovering benchmark datasets for preference learning research.

Pros

Centralized reference for many human preference datasets.
Community driven and openly available on GitHub.
Useful starting point for researchers new to RLHF.

Cons

No guarantee of active maintenance or updates.
List may lack recent datasets or be incomplete.
No tooling or automation, just an index.

Indexed from awesome-llm and enriched against its public facts.

Pros

Centralized reference for many human preference datasets.
Community driven and openly available on GitHub.
Useful starting point for researchers new to RLHF.

Cons

No guarantee of active maintenance or updates.
List may lack recent datasets or be incomplete.
No tooling or automation, just an index.

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with4entries

O OSS Framework medium

OpenRLHF

Community

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

★ 9,583 updated 1mo ago

O OSS Framework medium

veRL

Community

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

★ 21,691 updated 1mo ago

O OSS Framework medium

Axolotl

Community

Go ahead and axolotl questions

★ 11,997 updated 1mo ago

O OSS Framework medium

lm-evaluation-harness

Community

A framework for few-shot evaluation of language models.

★ 12,772 updated 2mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →