O Open Source Frameworks medium

TRL

by Community

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Visit Community View repo Submit your build →

OSS

TRL

Added 1 June 2026

Overview

TRL is an open-source framework for training transformer language models with reinforcement learning. It implements algorithms like PPO and DPO to align models with human preferences. The framework integrates with Hugging Face Transformers and supports custom reward models.

Best for

Best for
Developers fine-tuning language models with reinforcement learning for alignment or behavior optimization.

Use cases

Fine-tuning LLMs using reinforcement learning from human feedback (RLHF)
Aligning models to reduce harmful or biased outputs
Optimizing model behavior for specific reward signals or constraints

Notes

Use cases

Fine-tuning LLMs using reinforcement learning from human feedback (RLHF)
Aligning models to reduce harmful or biased outputs
Optimizing model behavior for specific reward signals or constraints

Pros

Built on top of the popular Hugging Face Transformers library
Supports multiple RL algorithms including PPO and DPO
Active community and maintained by Hugging Face

Cons

Requires solid understanding of reinforcement learning concepts
Training is computationally expensive compared to standard fine-tuning
Limited to models compatible with the Hugging Face ecosystem

Indexed from awesome-llm and enriched against its public facts.

Pros

Built on top of the popular Hugging Face Transformers library
Supports multiple RL algorithms including PPO and DPO
Active community and maintained by Hugging Face

Cons

Requires solid understanding of reinforcement learning concepts
Training is computationally expensive compared to standard fine-tuning
Limited to models compatible with the Hugging Face ecosystem

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses1entry

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Alternative to2entries

O OSS Framework medium

OpenRLHF

Community

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

★ 9,583 updated 1mo ago

O OSS Framework medium

veRL

Community

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

★ 21,691 updated 1mo ago

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →