fix qwen text config #41158

zucchini-nlp · 2025-09-25T10:23:19Z

What does this PR do?

Fixes #41020 and ensures constency in Qwen-VL text config. Side note: prev we had flat dict structure in Qwen and for BC passed kwargs to super() and to text_config. This caused confusion in TRL which apparently resets some text attributes manually when training

In this PR, Qwen will set/get text related attributes only through text config. The attributes are obtainable from nested config as config.text_config.vocab_size and from root as config.vocab_size (BC)

HuggingFaceDocBuilderDev · 2025-09-25T10:32:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-09-25T12:37:41Z

Test failures are not related

Cyrilvallez

Arf, very sad we did not use subconfigs when the model was added 🥲

Cyrilvallez · 2025-09-29T13:29:15Z

src/transformers/models/qwen2_5_vl/configuration_qwen2_5_vl.py

+        super().__init__(**kwargs)
+


I believe this should stay after setting the subconfigs no? Otherwise the __setattr__ won't work as it won't see the subcinfigs?

Ah, I added it for extra kwargs that users can add in a flat dict. Otherwise those are getting serialized in text config because we pass all kwargs to the text config

For usage it has no effects, only if someone has non-common kwarg which is not supposed to be in text config. I think supporting correct attention is more important than a rare edge case, so I'll revert it

Cyrilvallez · 2025-09-29T13:48:50Z

src/transformers/models/qwen2_5_vl/configuration_qwen2_5_vl.py

+                return setattr(text_config, key, value)
+
+        return super().__setattr__(key, value)


no return for setattr!

…nsistency

github-actions · 2025-09-30T17:16:21Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: glm4v, glm4v_moe, qwen2_5_vl, qwen2_vl

* fix qwen text config * fix tests * fix one more test * address comments

fix qwen text config

8fb3d73

zucchini-nlp added 2 commits September 25, 2025 13:56

fix tests

4bd019e

fix one more test

0ea8854

zucchini-nlp requested a review from Cyrilvallez September 25, 2025 12:38

Cyrilvallez approved these changes Sep 29, 2025

View reviewed changes

address comments

9ac57de

zucchini-nlp enabled auto-merge (squash) September 30, 2025 11:19

zucchini-nlp added 3 commits September 30, 2025 14:02

Merge remote-tracking branch 'upstream/main' into qwen-text-config-co…

65a9309

…nsistency

woraround the edge case

ad26f44

nope, could not make it with super call at the end

41ea0f2

zucchini-nlp merged commit f22cb1e into huggingface:main Sep 30, 2025
25 checks passed

zucchini-nlp added a commit to zucchini-nlp/transformers that referenced this pull request Sep 30, 2025

fix qwen text config (huggingface#41158)

dc47be1

* fix qwen text config * fix tests * fix one more test * address comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix qwen text config #41158

fix qwen text config #41158

Uh oh!

zucchini-nlp commented Sep 25, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 25, 2025

Uh oh!

zucchini-nlp commented Sep 25, 2025

Uh oh!

Cyrilvallez left a comment

Uh oh!

Cyrilvallez Sep 29, 2025

Uh oh!

zucchini-nlp Sep 30, 2025

Uh oh!

Cyrilvallez Sep 29, 2025

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

		return setattr(text_config, key, value)

		return super().__setattr__(key, value)

fix qwen text config #41158

fix qwen text config #41158

Uh oh!

Conversation

zucchini-nlp commented Sep 25, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 25, 2025

Uh oh!

zucchini-nlp commented Sep 25, 2025

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!