Skip to content

Conversation

@HIT-cwh
Copy link
Collaborator

@HIT-cwh HIT-cwh commented May 8, 2024

No description provided.

@pppppM pppppM merged commit f7d1aea into InternLM:main Jun 13, 2024
llkn-2 pushed a commit to llkn-2/xtuner that referenced this pull request Jul 31, 2024
* support deepseek v2

* fix dispatch

* refactor deepseek v2

* fix lint

* fix bugs

* fix bugs

* delete useless codes

* refactor deepseek config

* rewrite DeepseekV2PreTrainedModel.from_pretrained

* revert sft.py to main

* delete useless codes

* add deepseek v2 config

* add deepseek readme

* add HFCheckpointHook

* optimize mixtral moe

* fix bugs

* delete useless codes

* delete evalchathook

* fix bugs

* fix bugs

* add moe SUPPORT_MODELS and fix HFCheckpointHook

* add moe SUPPORT_MODELS and fix HFCheckpointHook

* fix bugs

* refactor modeling_deepseek

* update deepseek readme

* support deepseek v2 lite

* fix bugs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants