Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

🐯 [Liger] add native liger-kernel ORPO loss
#2482 opened Dec 15, 2024 by kashif Loading…
[Liger] Integrate Liger CPO & SimPO
#2506 opened Dec 20, 2024 by Mecoli1219 Loading…
1 of 6 tasks
Add the metrics completion_length_max and completion_length_min
#2930 opened Feb 22, 2025 by dignfei Loading…
4 tasks
Support ReMax Algorithm
#2955 opened Feb 25, 2025 by liziniu Loading…
3 tasks done
[WIP] Iterative training scripts for SPIN and SPPO
#3011 opened Mar 5, 2025 by jkx19 Draft
3 of 5 tasks
[WIP] PEFT 🤝 Liger DPO
#3065 opened Mar 12, 2025 by SalmanMohammadi Draft
5 tasks
GRPO: Scalable training with one LLM/node
#3186 opened Mar 31, 2025 by jglaser Loading…
3 of 5 tasks
Support iterable datasets in GRPO
#3226 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Add a raw generate API to the vLLM server
#3227 opened Apr 3, 2025 by wilrop Loading…
5 tasks
add vllm support for token ids as input
#3280 opened Apr 11, 2025 by wybryan Loading…
[Feat] Suppport SGLang as rollout engine of GRPO trainer
#3370 opened Apr 27, 2025 by ryang-max Loading…
2 of 8 tasks
Reintroduce generate method for PPOTrainer
#3374 opened Apr 27, 2025 by CloseChoice Loading…
4 tasks done
Reintroducing step method in ppo_trainer
#3410 opened May 3, 2025 by jskaf34 Loading…
2 of 5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
[GKD] Use vllm for the student model
#3475 opened May 21, 2025 by kashif Loading…
5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.