Leandro von Werra
Lists / People

Leandro von Werra

TRL library; RLHF tutorials

Machine learning engineer at Hugging Face, where he leads work on post-training and alignment tooling. He created and maintains TRL (Transformer Reinforcement Learning), the widely adopted open-source library for fine-tuning LLMs with PPO, DPO, and related RL methods. He is also known for accessible tutorials demystifying RLHF for the broader ML community.

@jeremyphowardWikipedia
#67AI Researchers Who Communicate
48/ 100
Authority scoreNotable
1lists#67peak1primary1category
Appears alongside

Top neighbors of Leandro von Werra

8people
Last updated
10 May 2026
Suggest an edit →