Pull requests · huggingface/trl

New pull request New

Clear current search query, filters, and sorts

75 Open 2,166 Closed

#3227 opened Apr 3, 2025 by wilrop

Loading…

5 tasks

#3643 opened Jun 24, 2025 by shirinyamani

Loading…

5 tasks

#3844 opened Aug 4, 2025 by shirinyamani • Draft

5 tasks

Add config_init_kwargs option in GRPOConfig

#4069 opened Sep 12, 2025 by hokuyama0106

Loading…

2 of 5 tasks

#3602 opened Jun 16, 2025 by ioverho

Loading…

2 of 5 tasks

#3735 opened Jul 14, 2025 by ojh31

Loading…

5 tasks

Reintroduce generate method for PPOTrainer

#3374 opened Apr 27, 2025 by CloseChoice

Loading…

4 tasks done

#3470 opened May 19, 2025 by gogo2464

Loading…

1 of 5 tasks

#3902 opened Aug 15, 2025 by qgallouedec

Loading…

5 tasks

[GSPO]: Refactor _compute_loss

#3835 opened Aug 1, 2025 by pramodith

Loading…

2 of 5 tasks

Reintroducing step method in ppo_trainer

#3410 opened May 3, 2025 by jskaf34

Loading…

2 of 5 tasks

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Pull requests: huggingface/trl

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list