Skip to content

Conversation

@pluesclues
Copy link
Collaborator

@pluesclues pluesclues commented Sep 29, 2025

Need to fix gradient accumulation and loss types due to upstream changes in TRL and transformers.

https://github.com/huggingface/trl/pull/3938/files

huggingface/transformers#38837

Relies:

unslothai/unsloth-zoo#308

@pluesclues pluesclues changed the title Grpo gradient accumulation edits Oct 1, 2025
@pluesclues pluesclues changed the title Grpo gradient accumulation edits (WIP) Oct 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants