Awesome LLM Human Preference Datasets
by Community
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
OSS
Awesome LLM Human Preference Datasets
Added 1 June 2026
Overview
This is a curated list of human preference datasets for LLM fine-tuning, RLHF, and evaluation. It is maintained as a community resource on GitHub with 391 stars.
Best for
Best for
Researchers and developers needing a curated index of human preference datasets.
Use cases
- Finding datasets for RLHF fine-tuning of language models.
- Locating human preference data for model evaluation.
- Discovering benchmark datasets for preference learning research.
Notes
This is a curated list of human preference datasets for LLM fine-tuning, RLHF, and evaluation. It is maintained as a community resource on GitHub with 391 stars.
391 stars on GitHub. Last updated 2023-10-04. Licensed MIT.
Use cases
- Finding datasets for RLHF fine-tuning of language models.
- Locating human preference data for model evaluation.
- Discovering benchmark datasets for preference learning research.
Pros
- Centralized reference for many human preference datasets.
- Community driven and openly available on GitHub.
- Useful starting point for researchers new to RLHF.
Cons
- No guarantee of active maintenance or updates.
- List may lack recent datasets or be incomplete.
- No tooling or automation, just an index.
Indexed from awesome-llm and enriched against its public facts.
Pros
- Centralized reference for many human preference datasets.
- Community driven and openly available on GitHub.
- Useful starting point for researchers new to RLHF.
Cons
- No guarantee of active maintenance or updates.
- List may lack recent datasets or be incomplete.
- No tooling or automation, just an index.
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
Axolotl
Community
Go ahead and axolotl questions
unslothai
Community
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
OpenRLHF
Community
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
veRL
Community
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
lm-evaluation-harness
Community
A framework for few-shot evaluation of language models.