MobileNetV5 #2527

rwightman · 2025-06-26T00:29:51Z

Switching to RMS Norm

Fixes for correct export

Tinkering with 'mobiletnetv5' details, fixing some issues with msfa

A few tweaks and comments to example MNV5 impl

Update RmsNorm2d modules to use own 2d eager kernel instead of torch rms_norm w/ permute

Fix propagation of act_layer to RmsNormAct*, use ConvNormAct for stem instead of just Conv2d

Fixes from weights conversion

Plumbing norm_layer through to MultiQueryAttention2d

impl forward_features for Transformers compatibility

Adding forward_* APIs to MobileNetV5Encoder

cleanup

cleanup, model entrypt rename

Large redundant with 300m

Update input size for configs

Fix stem conv layer name

fix: always norm in MSFA

Always call final MSFA norm layer

Remove some FIXME, fix MSFA docstring. Remove use_layer_scale and rely on values == None, not used currently in any case.

Adds a TSV describing the full params from an Orbax checkpoint Switching to RMS Norm Fixes for correct export Tinkering with 'mobiletnetv5' details, fixing some issues with msfa A few tweaks and comments to example MNV5 impl Update RmsNorm2d modules to use own 2d eager kernel instead of torch rms_norm w/ permute Fix propagation of act_layer to RmsNormAct*, use ConvNormAct for stem instead of just Conv2d Fixes from weights conversion Plumbing norm_layer through to MultiQueryAttention2d impl forward_features for Transformers compatibility Adding forward_* APIs to MobileNetV5Encoder cleanup cleanup, model entrypt rename Large redundant with 300m Update input size for configs Fix stem conv layer name fix: always norm in MSFA Always call final MSFA norm layer Remove some FIXME, fix MSFA docstring. Remove use_layer_scale and rely on values == None, not used currently in any case.

HuggingFaceDocBuilderDev · 2025-06-26T00:32:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…tes()

…closer for original weights.

JulienMaille · 2025-06-26T22:13:31Z

Cool, do you have any plans to pretrain it?

rwightman · 2025-06-26T22:25:21Z

@JulienMaille possibly yes, I may do something at a 'base' or slighty smaller size as a reference.

More near term, I will bring over the image encoder only weights as timm models for fine-tune. Currently the model def is just being utilized as the encoder for the full weights as loaded in transformers. There wasn't time to co-ordinate the other bits before release.

rwightman · 2025-06-26T22:27:36Z

It should be pointed out that the '300m' size is the only official google size as used in gemma 3n. The base definition there is my own scale down that I used for testing, validation of the architecture. I did some initial epochs of pretrain, etc in sanity checks. Though I might tweak / change that model def before any final weights are trained on my end.

rwightman added 4 commits June 25, 2025 21:31

Fix torchscript compat of MobileNetV5 MSFA

1690574

Fixed pool size (16,16) because of of MSFA.

739b46c

Make features_only=True work with mnv5 & enc, uses forward_intermedia…

e0cb669

…tes()

Simplify resolution check for improved script/trace compat

857727d

rwightman force-pushed the mobilenetv5 branch from 6ab5954 to 857727d Compare June 26, 2025 05:48

rwightman added 4 commits June 26, 2025 07:36

Cleanup imports, mark MSFA as notrace

4cc7fdb

Update test, encoder_only mode for backward test

ddd3f99

Switch to 'same' padding emulation for the enc model as it should be …

136440d

…closer for original weights.

Make RmsNormAct sync with RmsNorm re default eps of 1e-6

3828676

rwightman merged commit 1f69a52 into main Jun 26, 2025
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MobileNetV5 #2527

MobileNetV5 #2527

Uh oh!

rwightman commented Jun 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 26, 2025

Uh oh!

Uh oh!

JulienMaille commented Jun 26, 2025

Uh oh!

rwightman commented Jun 26, 2025

Uh oh!

rwightman commented Jun 26, 2025

Uh oh!

Uh oh!

Uh oh!

MobileNetV5 #2527

MobileNetV5 #2527

Uh oh!

Conversation

rwightman commented Jun 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jun 26, 2025

Uh oh!

Uh oh!

JulienMaille commented Jun 26, 2025

Uh oh!

rwightman commented Jun 26, 2025

Uh oh!

rwightman commented Jun 26, 2025

Uh oh!

Uh oh!