Open Source Alternatives
Open source alternatives to OpenRLHF
Open source alternatives to OpenRLHF, ranked by GitHub stars and freshness.
5 open-source alternatives in the index, ranked by GitHub stars and freshness.
veRL
Community
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Best for: ML engineers building custom RL post-training pipelines for LLMs at scale
ROLL
Community
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Best for: Researchers and engineers working on RL-based LLM alignment and fine-tuning at scale.
TRL
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Best for: Developers fine-tuning language models with reinforcement learning for alignment or behavior optimization.
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Community
Stanford
Best for: Researchers and developers who need a straightforward, stable method to align language models with human preferences without the overhead of reinforcement learning.
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Community
Recent AI-assistant agents, such as ChatGPT, predominantly rely on supervised fine-tuning (SFT) with human annotations and reinforcement learning from human feedback (RLHF) to al
Best for: Researchers and developers seeking cost-effective LLM alignment methods