-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
[Kernel] some optimizations for dense marlin and moe marlin #16850
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
simon-mo
merged 54 commits into
vllm-project:main
from
jinzhen-lin:marlin-kernel-optimization
May 5, 2025
Merged
Changes from all commits
Commits
Show all changes
54 commits
Select commit
Hold shift + click to select a range
7db8212
marlin optimization
jinzhen-lin 48874db
fix
jinzhen-lin e13c8a1
fix
jinzhen-lin dd8a5e1
fix
jinzhen-lin fd16948
fix
jinzhen-lin ac5dc47
fix
jinzhen-lin cb8229c
fix
jinzhen-lin 8bac124
fix moe performance bad cases
jinzhen-lin 649701b
fix dense marlin performance bad cases
jinzhen-lin 5fa7f33
some fix
jinzhen-lin eb3f2ed
fix
jinzhen-lin 15daa36
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin fb85636
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 72a6ded
fix
jinzhen-lin 367b5d9
remove kU8
jinzhen-lin 90e1063
fix name
jinzhen-lin f42ac97
fix and add comment
jinzhen-lin 110bbb8
fix
jinzhen-lin 30dbb98
fix
jinzhen-lin 720b900
fix
jinzhen-lin 02d33ed
fix
jinzhen-lin c9adb76
fix
jinzhen-lin 855efb0
fix
jinzhen-lin 63c23a9
fix
jinzhen-lin 4887c4d
fix
jinzhen-lin 21047a3
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 1349629
fix 'variable "xxx" was declared but never referenced' warning
jinzhen-lin 31f65ce
rerun
jinzhen-lin e2d255a
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 5f0370a
fix
jinzhen-lin d872ceb
fix
jinzhen-lin a538f5b
rerun
jinzhen-lin d29a12e
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 3dbd866
fix 'variable "xxx" was declared but never referenced'
jinzhen-lin fcac83b
fix
jinzhen-lin f55339e
fix
jinzhen-lin 2656b1b
update
jinzhen-lin 1dd2f2b
update
jinzhen-lin 0523d57
fix
jinzhen-lin a3345bb
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 2df71dc
fix
jinzhen-lin bdb9f10
add comment
jinzhen-lin ade7fcd
fix
jinzhen-lin a86e539
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 305c0bd
fix
jinzhen-lin aa36125
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin da5d871
rerun
jinzhen-lin e1cec3c
rerun
jinzhen-lin 645f16f
fix
jinzhen-lin 37c4c43
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin 8444f78
fix
jinzhen-lin 77addcd
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin dd5cce2
fix
jinzhen-lin 7dd8299
Merge remote-tracking branch 'origin/main' into marlin-kernel-optimiz…
jinzhen-lin File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
kernel_*.cu |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.