O Open Source Frameworks medium

veRL

by Community

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Visit Community View repo Submit your build →

OSS

veRL

Added 1 June 2026

Overview

veRL is a Python framework for reinforcement learning post-training of large language models. It provides a flexible architecture for running RL workflows at scale, supporting distributed training across multiple GPUs and optimized inference pipelines. The framework handles reward modeling, policy optimization, and generation sampling in a modular design.

Best for

Best for
ML engineers building custom RL post-training pipelines for LLMs at scale

Use cases

Fine-tuning LLMs with RL objectives like RLHF or DPO
Running distributed RL experiments across GPU clusters
Building custom reward models and policy optimization loops

Notes

21,691 stars on GitHub. Last updated 2026-06-01. Licensed Apache-2.0.

Use cases

Fine-tuning LLMs with RL objectives like RLHF or DPO
Running distributed RL experiments across GPU clusters
Building custom reward models and policy optimization loops

Pros

Modular architecture allows swapping components like reward models and optimizers
Optimized for distributed training with efficient GPU utilization
Active community project with 21k+ stars indicating adoption and maintenance

Cons

Requires significant infrastructure and GPU resources to run effectively
Steeper learning curve compared to higher-level fine-tuning APIs
Documentation and examples may be limited relative to mainstream frameworks

Indexed from awesome-llm and enriched against its public facts.

Pros

Modular architecture allows swapping components like reward models and optimizers
Optimized for distributed training with efficient GPU utilization
Active community project with 21k+ stars indicating adoption and maintenance

Cons

Requires significant infrastructure and GPU resources to run effectively
Steeper learning curve compared to higher-level fine-tuning APIs
Documentation and examples may be limited relative to mainstream frameworks

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses3entries

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

vLLM

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 81,619 updated 1mo ago

Built with1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 1mo ago

Pairs with2entries

O OSS Framework medium

DeepSpeed

Community

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

★ 42,436 updated 1mo ago

O OSS Framework medium

vLLM

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 81,619 updated 1mo ago

Alternative to1entry

O OSS Framework medium

OpenRLHF

Community

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

★ 9,583 updated 1mo ago

Pairs with2entries

O OSS Framework medium

Awesome LLM Human Preference Datasets

Community

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

★ 391 updated 2y ago

O OSS Framework medium

DeepSeek-R1

Community

First-generation reasoning models from DeepSeek.

★ 92,010 updated 1y ago

Alternatives3entries

O OSS Framework medium

OpenRLHF

Community

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

★ 9,583 updated 1mo ago

O OSS Framework medium

ROLL

Community

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

★ 3,193 updated 1mo ago

O OSS Framework medium

TRL

Community

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Free 27-page guide

Get the free Developer’s Field Guide

A 27-page field guide to the AI coding workflow with Claude. Claude Code, MCP servers, the prompt patterns that work, and what to delegate. Free.

Enter your work email. We send it straight over, plus a few short notes worth knowing. Unsubscribe any time.

← Back to Open Source Submit your own entry →