generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
🚀 Enhance GRPO VLLM server from sync to async and accelerate training
#3182
opened Mar 30, 2025 by
binary-husky
Loading…
Fixing GRPO
reward_func being a model with DeepSpeed ZeRO-3
#2984
opened Feb 28, 2025 by
jamesbraza
Loading…
Add the metrics completion_length_max and completion_length_min
#2930
opened Feb 22, 2025 by
dignfei
Loading…
4 tasks
Allow for saving the PPOTrainer value model (critic model)
#3308
opened Apr 16, 2025 by
AMindToThink
Loading…
Add
config_init_kwargs option in GRPOConfig
#4069
opened Sep 12, 2025 by
hokuyama0106
Loading…
2 of 5 tasks
ClearML logging of visualization in RewardTrainer evaluation
#3602
opened Jun 16, 2025 by
ioverho
Loading…
2 of 5 tasks
Add basic support for FSDP/Lora when using TRL/VLLM
#3735
opened Jul 14, 2025 by
ojh31
Loading…
5 tasks
Reintroduce
generate method for PPOTrainer
#3374
opened Apr 27, 2025 by
CloseChoice
Loading…
4 tasks done
Allow an user to train from a local dataset
#3470
opened May 19, 2025 by
gogo2464
Loading…
1 of 5 tasks
Update
max_length explanation for VLM trainers
#4220
opened Oct 7, 2025 by
sergiopaniego
Loading…
5 tasks
[Draft] Add configurable dataset column logging to GRPOTrainer W&B tables
#4045
opened Sep 9, 2025 by
davanstrien
•
Draft
[#3647] Fix: Assign default values in the GKDTrainer's constructor only when …
#3851
opened Aug 5, 2025 by
seungduk-yanolja
Loading…
2 of 5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.