generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Allow for saving the PPOTrainer value model (critic model)
#3308
opened Apr 16, 2025 by
AMindToThink
Loading…
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
feat: Add Multi-Token Prediction (MTP) support to SFTTrainer
#4290
opened Oct 15, 2025 by
KLGR123
Loading…
[#3647] Fix: Assign default values in the GKDTrainer's constructor only when …
#3851
opened Aug 5, 2025 by
seungduk-yanolja
Loading…
2 of 5 tasks
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
Add tip for logging evaluation metrics during regular evaluations
#4367
opened Oct 29, 2025 by
cam1llynha
Loading…
Remove FSDP1 support: use FSDP2 exclusively
#4260
opened Oct 11, 2025 by
behroozazarkhalili
Loading…
Fix: ignore precompute_ref_log_probs when use_liger_loss=True
#4008
opened Sep 4, 2025 by
ginkyenglee
Loading…
5 tasks
Use explicit tiny-Qwen2ForCausalLM-2.5 model_id param in CI tests
#4331
opened Oct 23, 2025 by
albertvillanova
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.