Skip to content

Conversation

@HIT-cwh
Copy link
Collaborator

@HIT-cwh HIT-cwh commented Jun 17, 2024

…t shard moe

@pppppM pppppM merged commit bddf85d into InternLM:main Jun 17, 2024
@HIT-cwh HIT-cwh deleted the fix_hf_ckpt_hook_bugs branch June 17, 2024 06:13
llkn-2 pushed a commit to llkn-2/xtuner that referenced this pull request Jul 31, 2024
… withou… (InternLM#774)

fix HFCheckpointHook bugs when training deepseekv2 and mixtral without shard moe
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants