-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
WIP - [GroundingDino] Fix grounding dino loss
#31828
opened Jul 7, 2024 by
EduardoPach
Loading…
1 task
Avoid failure
TFBlipModelTest::test_pipeline_image_to_text
#31827
opened Jul 7, 2024 by
ydshieh
Loading…
New option called
"best"
for args.save_strategy
.
trainer
#31817
opened Jul 6, 2024 by
seanswyi
Loading…
3 of 5 tasks
Bump certifi from 2023.7.22 to 2024.7.4 in /examples/research_projects/decision_transformer
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#31813
opened Jul 6, 2024 by
dependabot
bot
Loading…
Fix incorrect accelerator device handling for MPS in
TrainingArguments
#31812
opened Jul 5, 2024 by
andstor
Loading…
1 of 5 tasks
Fix pipeline tests - don't set torch_dtype on non-torch pipelines
#31809
opened Jul 5, 2024 by
amyeroberts
Loading…
Push sharded checkpoint to hub when
push_to_hub=True
in TrainingArguments
#31808
opened Jul 5, 2024 by
SunMarc
Loading…
Fix multi-model training with deepspeed nvme-offloading
#31800
opened Jul 5, 2024 by
xu-song
Loading…
2 of 5 tasks
fix-_is_package_available-unify-behavior-for-available-failing-import
#31798
opened Jul 4, 2024 by
Laz4rz
Loading…
3 tasks
Fix Bug: Gemma2 the past_key_value.update() function has added a new parameter "sliding_window" to support the _sliding_update function.
#31786
opened Jul 4, 2024 by
kkk935208447
Loading…
2 of 4 tasks
Fixes to alternating SWA layers in Gemma2
#31775
opened Jul 3, 2024 by
turboderp
Loading…
1 of 5 tasks
[whisper] compile compatibility with long-form decoding
#31772
opened Jul 3, 2024 by
sanchit-gandhi
Loading…
Speedup model loading (by ~10x) and .generate() on CPU (by ~10x)!
#31771
opened Jul 3, 2024 by
muellerzr
Loading…
1 of 5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.