generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Draft] Add configurable dataset column logging to GRPOTrainer W&B tables
#4045
opened Sep 9, 2025 by
davanstrien
•
Draft
Fix #3982: Fix DPO Trainer support for Gemma 3 vision models
#4022
opened Sep 6, 2025 by
akshay-babbar
Loading…
Fix: undefined
current_gradient_accumulation_steps
#4014
opened Sep 5, 2025 by
ysjprojects
Loading…
2 of 5 tasks
Fix: ignore precompute_ref_log_probs when use_liger_loss=True
#4008
opened Sep 4, 2025 by
ginkyenglee
Loading…
5 tasks
Enable saving and loading precomputed reference log probabilities in …
#3986
opened Sep 1, 2025 by
ginkyenglee
Loading…
3 tasks
[#3647] Fix: Assign default values in the GKDTrainer's constructor only when …
#3851
opened Aug 5, 2025 by
seungduk-yanolja
Loading…
2 of 5 tasks
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758
opened Jul 23, 2025 by
almeidava93
Loading…
2 of 5 tasks
Add basic support for FSDP/Lora when using TRL/VLLM
#3735
opened Jul 14, 2025 by
ojh31
Loading…
5 tasks
feat: Initial implementation of RePO trainer and components
#3655
opened Jun 26, 2025 by
celsowm
Loading…
5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508
opened May 29, 2025 by
shaischaudhry
Loading…
3 of 5 tasks
ProTip!
no:milestone will show everything without a milestone.