[draft](eplb): add per-layer expert-load statistics monitor for EP path by JiaoliangYu · Pull Request #1210 · ROCm/ATOM

JiaoliangYu · 2026-06-15T00:51:32Z

Collect per-layer, per-expert token counts from the MORI EP dispatch output (dispatch_recv_token_num) into a windowed ExpertLoadMonitor. Logs avg/max/balancedness and can emit a one-shot offline rebalance plan (hot/cold experts) via the offline_eplb_rebalance utility command.

Scope is statistics only: per-rank, no cross-rank all-reduce, and no actual expert weight remap/transfer yet. All gated behind ATOM_ENABLE_EPLB_LOAD_STATS (default off).

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Collect per-layer, per-expert token counts from the MORI EP dispatch output (dispatch_recv_token_num) into a windowed ExpertLoadMonitor. Logs avg/max/balancedness and can emit a one-shot offline rebalance plan (hot/cold experts) via the offline_eplb_rebalance utility command. Scope is statistics only: per-rank, no cross-rank all-reduce, and no actual expert weight remap/transfer yet. All gated behind ATOM_ENABLE_EPLB_LOAD_STATS (default off). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[draft](eplb): add per-layer expert-load statistics monitor for EP path#1210

[draft](eplb): add per-layer expert-load statistics monitor for EP path#1210
JiaoliangYu wants to merge 1 commit into
ROCm:mainfrom
JiaoliangYu:feat/eplb-expert-load-pass

JiaoliangYu commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JiaoliangYu commented Jun 15, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant