Skip to content
View giangdip2410's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report giangdip2410

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. HyperRouter HyperRouter Public

    Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"

    Python 33 3

  2. CompeteSMoE CompeteSMoE Public

    Code for this paper "CompeteSMoE - Effective Sparse Mixture of Experts Training via Competition"

    Python 6 3

  3. DANet DANet Public

    DANets (a family of neural networks) for tabular data classification/ regression.

    Python 5 6

  4. SimSMoE SimSMoE Public

    Code for this paper "SimSMoE: Toward Efficient Training Mixture of Experts via Solving Representational Collapse".

    Python 5

  5. Brainformer-SMOE Brainformer-SMOE Public

    Brainformer SMOE

    Python 3

  6. VQMoE VQMoE Public

    Code for this paper "On the Role of Discrete Representation in Sparse Mixture of Experts".

    Python 3