Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Iterative training scripts for SPIN and SPPO
#3011 opened Mar 5, 2025 by jkx19 Draft
3 of 5 tasks
Online-dpo-ben
#4252 opened Oct 10, 2025 by burtenshaw Draft
5 tasks
[DRAFT] Refactor DPO
#3906 opened Aug 15, 2025 by qgallouedec Draft
6 tasks
Gold refactor
#4373 opened Oct 29, 2025 by qgallouedec Draft
5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
Updated OpenEnv docs
#4418 opened Oct 31, 2025 by sergiopaniego Draft
8 tasks
Tool call
#4300 opened Oct 18, 2025 by qgallouedec Draft
5 tasks
[WIP] vllm-server-spec-dec-support
#3643 opened Jun 24, 2025 by shirinyamani Loading…
5 tasks
dynamic temperature
#3844 opened Aug 4, 2025 by shirinyamani Draft
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602 opened Jun 16, 2025 by ioverho Loading…
2 of 5 tasks
Test in distributed setting
#3902 opened Aug 15, 2025 by qgallouedec Loading…
5 tasks
wip - env
#4320 opened Oct 22, 2025 by qgallouedec Loading…
5 tasks
GRPO: Pack Responses within the same group.
#3642 opened Jun 24, 2025 by pramodith Draft
4 of 5 tasks
feat: Initial implementation of RePO trainer and components
#3655 opened Jun 26, 2025 by celsowm Loading…
5 tasks
[WIP] PEFT 🤝 Liger DPO
#3065 opened Mar 12, 2025 by SalmanMohammadi Draft
5 tasks
Create "Talks" subsection
#4414 opened Oct 31, 2025 by sergiopaniego Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.