Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Trainer: Pass num_items_in_batch to compute_loss in prediction_step
#41183 opened Sep 26, 2025 by pramodith Loading…
3 of 5 tasks
Update GLM-4.1V MMRope implementation
#41182 opened Sep 26, 2025 by zRzRzRzRzRzRzR Loading…
download and use HF Hub Cache
#41181 opened Sep 26, 2025 by ydshieh Loading…
Fix Latex typesetting in documentation
#41177 opened Sep 26, 2025 by cyyever Loading…
1 of 5 tasks
Optimize rope_deltas propagation logic in Qwen2.5-VL
#41176 opened Sep 26, 2025 by Xqle Loading…
5 tasks
Bump hfh prerelease version
#41175 opened Sep 26, 2025 by Wauplin Loading…
🚨 [v5] Delete feature extractors used for vision
#41174 opened Sep 26, 2025 by zucchini-nlp Loading…
Rope for Qwen2--5-vl
#41173 opened Sep 26, 2025 by zucchini-nlp Loading…
Fix typsetting and content of llm_tutorial_optimization.md
#41172 opened Sep 26, 2025 by cyyever Loading…
1 of 5 tasks
[v5] Remove model_parallel deprecated feature
#41166 opened Sep 25, 2025 by SunMarc Loading…
[Trainer] deprecate num_train_tokens
#41165 opened Sep 25, 2025 by SunMarc Loading…
[DistilBert] Refactor Attention
#41163 opened Sep 25, 2025 by vasqu Loading…
Add Sinhala (සිංහල) translation of README
#41162 opened Sep 25, 2025 by Ranjuna120 Loading…
5 tasks
Remove old sagemaker api support
#41161 opened Sep 25, 2025 by SunMarc Loading…
[docs] update tips syntax
#41160 opened Sep 25, 2025 by mishig25 Loading…
Support setting total_train_batch_size.
#41159 opened Sep 25, 2025 by zhengchenyu Loading…
5 tasks
fix qwen text config
#41158 opened Sep 25, 2025 by zucchini-nlp Loading…
Fix white space in documentation
#41157 opened Sep 25, 2025 by cyyever Loading…
1 of 5 tasks
Fix inaccurate train_tokens_per_second when resuming from checkpoint
#41156 opened Sep 25, 2025 by lilin-1 Loading…
2 of 5 tasks
add rotary kernel support to Qwen3 model
#41147 opened Sep 25, 2025 by kaixuanliu Loading…
Fix typing of train_args
#41142 opened Sep 25, 2025 by cyyever Draft
ProTip! Adding no:label will show everything without a label.