AI & Machine Learning

openrlhf-training

openrlhf-training

High-performance RLHF framework with Ray+vLLM acceleration

Category

AI & Machine Learning

Developer

Updated

Jan

2026

Tags

2

Total

Description

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

Skill File

SKILL.md

1High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

Tags

AiUi

Information

Developerdavila7

CategoryAI & Machine Learning

CreatedJan 15, 2026

UpdatedJan 15, 2026

View Source Documentation

You Might Also Like

add-uint-support

Add Uint Support

Add unsigned integer (uint) type support to PyTorch operators by updating AT_DISPATCH macros

docstring

Docstring

Write docstrings for PyTorch functions and methods following PyTorch conventions

skill-creator

Skill Creator

Guide for creating effective skills

claude-opus-4-5-migration

Claude Opus 4 5 Migration

Migrate prompts and code from Claude Sonnet 4

agent-identifier

Agent Identifier

This skill should be used when the user asks to "create an agent", "add an agent", "write a subag...

command-development

Command Development

This skill should be used when the user asks to "create a slash command", "add a command", "write...