Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

This is the codebase for:

Not All Models Suit Expert Offloading:
On Local Routing Consistency of Mixture-of-Expert Models

Requirements

Setup a virtual environment with Python 3.13, and run pip install -e requirements.txt to install dependencies. You will also need scattermoe and smoe to run LLaMA-MoE-v2.

Usage

Cite

@misc{liang2025modelssuitexpertoffloading,
      title={Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models}, 
      author={Jingcong Liang and Siyuan Wang and Miren Tian and Yitong Li and Duyu Tang and Zhongyu Wei},
      year={2025},
      eprint={2505.16056},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2505.16056}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
case/json		case/json
config		config
data		data
model		model
plot		plot
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Requirements

Usage

Cite

About

Uh oh!

Uh oh!

Languages

License

ljcleo/moe-lrc

Folders and files

Latest commit

History

Repository files navigation

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Requirements

Usage

Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages