Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Updated Megatron conversion script for gpt2 checkpoints
#38969 opened Jun 22, 2025 by LckyLke Loading…
Add ModernBERT Decoder Models - ModernBERT, but trained with CLM!
#38967 opened Jun 22, 2025 by orionw Loading…
4 of 5 tasks
[docs] Typos - Single GPU efficient training features
#38964 opened Jun 21, 2025 by casinca Loading…
1 of 5 tasks
Update test_candidate_generator.py
#38962 opened Jun 21, 2025 by Natakarani Loading…
5 tasks
feat: add number token loss implementation
#38960 opened Jun 21, 2025 by happybear-21 Loading…
Updated the model card for wav2vec2-phoneme
#38959 opened Jun 21, 2025 by AshAnand34 Loading…
1 task done
Updated model card for wav2vec2-conformer
#38958 opened Jun 21, 2025 by AshAnand34 Loading…
1 task done
Update wav2vec2-bert model card
#38957 opened Jun 21, 2025 by AshAnand34 Loading…
1 task done
Updating model card for wav2vec2
#38956 opened Jun 20, 2025 by AshAnand34 Loading…
1 task done
docs: Musicgen melody model card
#38955 opened Jun 20, 2025 by AshAnand34 Loading…
1 task done
docs: created musicgen model card
#38953 opened Jun 20, 2025 by AshAnand34 Loading…
1 task done
[WIP] FSDP2 + TP + (other)
#38949 opened Jun 20, 2025 by S1ro1 Draft
[Attention] Small fix on output attentions
#38948 opened Jun 20, 2025 by vasqu Loading…
Totally rewrite how pipelines load preprocessors
#38947 opened Jun 20, 2025 by Rocketknight1 Loading…
Internvl fix
#38946 opened Jun 20, 2025 by remi-or Loading…
[tests] remove TF tests (uses of require_tf)
#38944 opened Jun 20, 2025 by gante Loading…
Break tie in Expectations and gemma3 fixes
#38943 opened Jun 20, 2025 by remi-or Loading…
Decouple device_map='auto' and tp_plan='auto'
#38942 opened Jun 20, 2025 by SunMarc Loading…
Remove script datasets in tests
#38940 opened Jun 20, 2025 by lhoestq Loading…
Use newer typing notation
#38934 opened Jun 20, 2025 by cyyever Loading…
[qwen] refactor attentions for vision/audio
#38930 opened Jun 20, 2025 by zucchini-nlp Loading…
Enable XPU on torchao doc
#38929 opened Jun 20, 2025 by jiqing-feng Draft
ProTip! What’s not been updated in a month: updated:<2025-05-22.