generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Account for
token_type_ids
in DataCollatorForVisionLanguageModeling
#4190
opened Oct 1, 2025 by
qgallouedec
Loading…
🧺 [5/N] Refactor
_generate
in GRPO/RLOO: Insert images in the prompt
#4155
opened Sep 26, 2025 by
qgallouedec
Loading…
🧺 [4/N] Refactor
_generate
in GRPO/RLOO: Move forward_kwargs
outside generation method
#4154
opened Sep 26, 2025 by
qgallouedec
Loading…
🧺 [3/N] Refactor
_generate
in GRPO/RLOO: Rely on generator for prompt truncation
#4153
opened Sep 26, 2025 by
qgallouedec
Loading…
🧺 [2/N] Refactor
_generate
in GRPO/RLOO: Use prompt_ids
from generation
#4152
opened Sep 26, 2025 by
qgallouedec
Loading…
update guided decoding param to structured outputs
#4117
opened Sep 22, 2025 by
jiqing-feng
Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091
opened Sep 15, 2025 by
ycma8
Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084
opened Sep 15, 2025 by
sergiopaniego
Loading…
5 tasks
Add
config_init_kwargs
option in GRPOConfig
#4069
opened Sep 12, 2025 by
hokuyama0106
Loading…
2 of 5 tasks
[Draft] Add configurable dataset column logging to GRPOTrainer W&B tables
#4045
opened Sep 9, 2025 by
davanstrien
•
Draft
Fix #3982: Fix DPO Trainer support for Gemma 3 vision models
#4022
opened Sep 6, 2025 by
akshay-babbar
Loading…
Fix: undefined
current_gradient_accumulation_steps
#4014
opened Sep 5, 2025 by
ysjprojects
Loading…
2 of 5 tasks
Fix: ignore precompute_ref_log_probs when use_liger_loss=True
#4008
opened Sep 4, 2025 by
ginkyenglee
Loading…
5 tasks
Enable saving and loading precomputed reference log probabilities in …
#3986
opened Sep 1, 2025 by
ginkyenglee
Loading…
3 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.