add `num_worst_token` #59

inkcherry · 2025-09-03T03:54:45Z

Motivation

Adjustable max_token_recv_per_rank, allowing for reduced memory overhead in some balancing scenarios.
FYI @zhenhuang12

TianDi101 · 2025-09-03T08:52:08Z

@inkcherry Thanks for this PR! @isytwu Could you please help review this? The idea is very similar to what we have discussed before to reduce memory usage.

isytwu · 2025-09-03T09:21:52Z

include/mori/ops/dispatch_combine/dispatch_combine.hpp


  inline __host__ __device__ int MaxNumTokensToRecv() const {
+    if (numWorstToken != 0) {
+      return numWorstToken;


I wonder if return worldSize * numWorstToken will be better?

Perhaps using min(numWorstToken, worldSize * MaxNumTokensToRecvPerRank()) could prevent users from passing large values. And should MaxNumTokensToSend() also be changed to add numWorstToken?

add num_worst_token

c0a9974

TianDi101 requested a review from isytwu September 3, 2025 08:49

isytwu reviewed Sep 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add `num_worst_token` #59

add `num_worst_token` #59

Uh oh!

inkcherry commented Sep 3, 2025 •

edited

Loading

Uh oh!

TianDi101 commented Sep 3, 2025

Uh oh!

isytwu Sep 3, 2025

Uh oh!

isytwu Sep 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

add num_worst_token #59

Are you sure you want to change the base?

add num_worst_token #59

Uh oh!

Conversation

inkcherry commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

TianDi101 commented Sep 3, 2025

Uh oh!

isytwu Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

isytwu Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

add `num_worst_token` #59

add `num_worst_token` #59

inkcherry commented Sep 3, 2025 •

edited

Loading

isytwu Sep 4, 2025 •

edited

Loading