ROLL: Reinforcement Learning Optimization for Large-Scale Learning

🚀 An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models 🚀

ROLL is an efficient and user-friendly RL library designed for Large Language Models (LLMs) utilizing Large Scale GPU resources. It significantly enhances LLM performance in key areas such as human preference alignment, complex reasoning, and multi-turn agentic interaction scenarios.

Leveraging a multi-role distributed architecture with Ray for flexible resource allocation and heterogeneous task scheduling, ROLL integrates cutting-edge technologies like Megatron-Core, SGLang and vLLM to accelerate model training and inference.

[08/11/2025] 🎉 Our Paper released, see Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning. [06/09/2025] 🎉 ROLL tech report is now available! Access the report here.

🚀 快速入门

文档

快速上手

安装指南
 快速上手：单机版部署指南
 快速上手：多节点部署指南
 ROLL 调试指南
 常见问题解答 (Q&A)

使用指南

开发

架构

AgenticPipeline RLVRPipeline

开发者指南

如何支持新模型
 自定义Env
Prompt生成指南

We welcome contributions from the community! 🤝

🚀 An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models 🚀

🚀 快速入门​

快速上手​

使用指南​

配置​

流水线​

算法​

Agentic​

高级特性​

Tracker 和 Metrics​

硬件支持​

开发​

架构​

开发者指南​

🚀 快速入门

快速上手

使用指南

配置

流水线

算法

Agentic

高级特性

Tracker 和 Metrics

硬件支持

开发

架构

开发者指南