Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Updated OpenEnv docs
#4418 opened Oct 31, 2025 by sergiopaniego Draft
8 tasks
Add On-Policy Distillation from thinking labs to paper index.
#4410 opened Oct 30, 2025 by pramodith Loading…
4 of 5 tasks
Gold refactor
#4373 opened Oct 29, 2025 by qgallouedec Draft
5 tasks
[OpenENV] Openenv rollout_func signature proposal
#4344 opened Oct 27, 2025 by kashif Loading…
5 tasks
wip - env
#4320 opened Oct 22, 2025 by qgallouedec Loading…
5 tasks
refactor: simplify parameter freezing in modeling_base.py
#4305 opened Oct 20, 2025 by Ki-Seki Loading…
2 of 5 tasks
[SFT] Log mean token accuracy from Liger kernel
#4302 opened Oct 18, 2025 by kashif Loading…
5 tasks
Tool call
#4300 opened Oct 18, 2025 by qgallouedec Draft
5 tasks
Fix DPO Trainer Bug For Qwen2-VL (Issue 2660)
#4257 opened Oct 11, 2025 by FabianSchuetze Loading…
1 of 3 tasks
Online-dpo-ben
#4252 opened Oct 10, 2025 by burtenshaw Draft
5 tasks
Add support for Python 3.14
#4225 opened Oct 8, 2025 by albertvillanova Loading…
Add trust_remote_code to GRPOConfig
#4186 opened Oct 1, 2025 by muupan Loading…
3 of 4 tasks
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2025-10-02.