Skip to content

Conversation

@HIT-cwh
Copy link
Collaborator

@HIT-cwh HIT-cwh commented Apr 17, 2024

  1. Support Qwen 1.5 moe (attn, varlen attn, sequence parallel)
  2. Set moe blocks zero3 leaf modules (Reference: Add API to set a module as a leaf node when recursively setting Z3 hooks deepspeedai/DeepSpeed#4966)
@pppppM pppppM merged commit d722775 into InternLM:main Apr 19, 2024
llkn-2 pushed a commit to llkn-2/xtuner that referenced this pull request Jul 31, 2024
* support qwen moe dispatch

* fix qwen and mistral config for auto sp
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants