ROLL x Ascend

Last updated: 09/28/2025.

We have added support for Huawei Ascend devices in ROLL.

Hardware Support

Atlas 900 A2 PODc

Installation

Basic Environment Setup

Software	Version
Python	3.10
CANN	8.1.RC1

Create Conda Environment

Use the following commands to create a new conda environment in Miniconda:

conda create --name roll python=3.10
conda activate roll

Install torch & torch_npu

To use torch and torch_npu in ROLL, install them using the commands below:

# Install CPU version of torch
pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cpu

# Install torch_npu
pip install torch_npu==2.5.1

Install vllm & vllm-ascend

To use vllm in ROLL, compile and install vllm and vllm-ascend as follows:

# vllm
git clone -b v0.8.4 --depth 1 https://github.com/vllm-project/vllm.git
cd vllm

VLLM_TARGET_DEVICE=empty pip install -v -e .
cd ..
# vllm-ascend
git clone -b v0.8.4rc2 --depth 1 https://github.com/vllm-project/vllm-ascend.git
cd vllm-ascend

export COMPILE_CUSTOM_KERNELS=1
pip install -e .
cd ..

If you encounter an error like this during the vllm-ascend installation:

RuntimeError: CMake configuration failed: Command '['/pathto/miniconda3/envs/roll/bin/python3.10', '-m', 'pybind11', '--cmake']' returned non-zero exit status 2.

Try modifying lines 151–158 in setup.py under the vllm-ascend directory as follows, then recompile:

try:
    # if pybind11 is installed via pip
    pybind11_cmake_path = (subprocess.check_output(
        [python_executable, "-m", "pybind11",
        "--cmakedir"]).decode().strip())
except subprocess.CalledProcessError as e:
    # else specify pybind11 path installed from source code on CI container
    raise RuntimeError(f"CMake configuration failed: {e}")

Install ROLL

git clone https://github.com/alibaba/ROLL.git
cd ROLL
pip install -r requirements_common.txt
pip install deepspeed==0.16.0
cd ..

Additional Third-Party Libraries

Software	Description
transformers	v4.52.4
flash_attn	not supported
transformer-engine[pytorch]	not supported

transformers v4.52.4 supports enabling --flash_attention_2.
flash_attn acceleration is not supported.
transformer-engine[pytorch] is currently not supported.

pip install transformers==4.52.4

Quick Start: Single-Node Deployment

Before full usage, we recommend testing the single-node pipeline to verify your environment and installation. Since Megatron-LM training is not yet supported, first change strategy_args in the relevant files to use the deepspeed option.

Run the single-node pipeline via shell:

bash examples/agentic_demo/run_agentic_pipeline_frozen_lake_single_node_demo.sh

Run the agentic pipeline using a config file:

# Make sure you are in the root directory of the ROLL project
# export PYTHONPATH=$(pwd):$PYTHONPATH

python examples/start_agentic_pipeline.py \
        --config_path qwen2.5-0.5B-agentic \
        --config_name agentic_val_sokoban

--config_path – Directory containing your YAML configuration files.
--config_name – Filename (without the .yaml extension).

Current Support Status

Feature	Example	Training Backend	Inference Backend	Hardware
Agentic	examples/qwen2.5-0.5B-agentic/run_agentic_pipeline_sokoban.sh	DeepSpeed	vLLM	Atlas 900 A2 PODc
Agentic-Rollout	examples/qwen2.5-0.5B-agentic/run_agentic_rollout_sokoban.sh	DeepSpeed	vLLM	Atlas 900 A2 PODc
DPO	examples/qwen2.5-3B-dpo_megatron/run_dpo_pipeline.sh	DeepSpeed	vLLM	Atlas 900 A2 PODc
RLVR	examples/qwen2.5-7B-rlvr_megatron/run_rlvr_pipeline.sh	DeepSpeed	vLLM	Atlas 900 A2 PODc

Disclaimer

The Ascend support provided in ROLL is intended as a reference example. For production use, please consult official channels.

ROLL x Ascend

Hardware Support​

Installation​

Basic Environment Setup​

Create Conda Environment​

Install torch & torch_npu​

Install vllm & vllm-ascend​

Install ROLL​

Additional Third-Party Libraries​

Quick Start: Single-Node Deployment​

Current Support Status​

Disclaimer​