跳到主要内容
ROLL Logo

ROLL: Reinforcement Learning Optimization for Large-Scale Learning

🚀 An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models 🚀

LicenseGitHub issuesRepo starsWeChat QR

ROLL is an efficient and user-friendly RL library designed for Large Language Models (LLMs) utilizing Large Scale GPU resources. It significantly enhances LLM performance in key areas such as human preference alignment, complex reasoning, and multi-turn agentic interaction scenarios.

Leveraging a multi-role distributed architecture with Ray for flexible resource allocation and heterogeneous task scheduling, ROLL integrates cutting-edge technologies like Megatron-Core, SGLang and vLLM to accelerate model training and inference.

[08/11/2025] 🎉 Our Paper released, see Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning. [06/09/2025] 🎉 ROLL tech report is now available! Access the report here.


🚀 快速入门

文档

快速开始

安装指南
配置系统详解
调试指南
Trackers和Metrics
检查点保存与恢复指南
模型转换为 Hugging Face 格式
快速上手:单机版部署指南
快速上手:多节点部署指南
常见问题解答 (Q&A)

使用指南


We welcome contributions from the community! 🤝