Skip to content

Conversation

@jinzhen-lin
Copy link
Contributor

@jinzhen-lin jinzhen-lin commented Dec 31, 2025

After #29642 , the ep support for moe marlin kernel is no longer required. Therefore, we can remove this dead code.

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request removes unused is_ep (expert parallelism) logic from the MoE Marlin kernel. The changes are consistent across Python, C++, and CUDA files, simplifying the codebase by removing dead code. This refactoring is a good improvement for maintainability, as it makes the kernel logic more straightforward. The assumption is that expert parallelism details are now handled before the kernel is invoked, which is a sound design choice. The changes appear correct and well-executed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

1 participant