| COMMIT |
1.00 |
Fix vllm cis (#45139) |
|
Commit message contains explicit AI assi |
2026-04-08 |
| COMMIT |
1.00 |
Update tiny model creation script (#45241) |
|
Commit message contains explicit AI assi |
2026-04-04 |
| COMMIT |
1.00 |
fix: prefer registered config over remote code in AutoConfig |
|
Commit message contains explicit AI assi |
2026-03-31 |
| COMMIT |
1.00 |
Fix stupid test fetcher (#45140) |
|
Commit message contains explicit AI assi |
2026-03-31 |
| COMMIT |
1.00 |
[Bugfix] Remove incorrect torchvision requirement from PIL b |
|
Commit message contains explicit AI assi |
2026-03-30 |
| PR |
1.00 |
Fix #45305 + add regression test GAS |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
Drop unused Gemma4TextAttention weights when sharing KV Cach |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
WIP: Add support for Granite4VisionForConditionalGeneration |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
Fix apply_chat_template crash on tool_call messages without |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
Add cuda compatibility check for using `grouped_mm` |
|
PR body explicitly mentions AI collabora |
2026-03-25 |
| PR |
1.00 |
feat: make timesfm2_5 onnx export compatible |
|
PR body explicitly mentions AI collabora |
2026-04-04 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Fix MoE routers returning probabilities instead of logits |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
add Qianfan-OCR model definition |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
fix: liger unnecessarily materializes logits in VRAM during |
|
PR body explicitly mentions AI collabora |
2026-04-06 |
| PR |
1.00 |
Add Qwen3.5 GGUF loading support |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Fix UnboundLocalError in invert_attention_mask by adding pro |
|
PR body explicitly mentions AI collabora |
2026-04-05 |
| PR |
1.00 |
docs maintenance for transformers repository 979e8 |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
0.30 |
Fix: ObjectDetectionPipeline batch inference only returns fi |
|
Mix of technical and explanatory; no sus |
2026-04-03 |
| PR |
0.30 |
feat[vLLM × v5]: Add vLLM compatibility for audio models |
|
Contains domain-specific notes and symbo |
2026-04-08 |
| PR |
0.20 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, concise, slightly formal but |
2026-03-30 |
| PR |
0.20 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Enumerates tests, specifics, and parenth |
2026-03-14 |
| PR |
0.20 |
Refactor CLIP-like models |
|
Casual style, issue references, and proj |
2026-03-04 |
| PR |
0.20 |
Fix mutable default arguments in quantization config classes |
|
Fairly formal, but technical specifics a |
2026-04-07 |
| PR |
0.20 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Clear technical context, but slightly mo |
2026-04-07 |
| PR |
0.20 |
[gemma4] Dissociate kv states sharing from the Cache |
|
Technical and succinct, direct to the ch |
2026-04-08 |
| PR |
0.15 |
FSDP2 native support in transformers |
|
Terse informal notes and abbreviations; |
2026-02-17 |
| PR |
0.13 |
fix: skip `clean_up_tokenization` for BPE tokenizers in `Pre |
|
Describes bug and context in detail, wit |
2026-03-21 |
| COMMIT |
0.10 |
Fix `SmolVLM` video processor `resize` using wrong interpola |
|
Technical, detailed explanation with dom |
2026-04-06 |
| COMMIT |
0.10 |
empty (#45261) |
|
Casual tone, technical context, some com |
2026-04-06 |
| PR |
0.10 |
Load adapter with TP |
|
Brief, domain-specific answer with jargo |
2026-03-31 |
| PR |
0.10 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Brief, technical, and domain-specific la |
2026-04-09 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Focused, domain-specific fix; language i |
2026-04-09 |
| PR |
0.10 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Brief, context-aware, and domain-specifi |
2026-04-03 |
| PR |
0.10 |
[docs] training on specific hardware |
|
Structured changelog, domain terms, not |
2026-03-17 |
| PR |
0.10 |
[docs] training performance |
|
Bullet points, domain-specific; concise, |
2026-02-27 |
| PR |
0.10 |
TP refactor for FSDP + TP integration |
|
Terse, uses domain-specific abbreviation |
2026-03-26 |
| PR |
0.10 |
Use torchvision `decode_image` to load images in the torchv |
|
Concise, domain-specific; headings templ |
2026-04-02 |
| PR |
0.10 |
Fix ByteLevel-BPE tokenizers silently breaking in `LlamaToke |
|
Strong technical jargon and brevity; no |
2026-04-09 |
| PR |
0.10 |
fix(ernie4_5_vl_moe): resolve three config loading failures |
|
Terse, error-focused, uses abbreviations |
2026-04-07 |
| PR |
0.10 |
Fix ty for transformers cli |
|
Domain abbreviations, direct writing; no |
2026-04-02 |
| PR |
0.10 |
Gemma4 resizing per layer inputs |
|
Terse, includes shorthand and references |
2026-04-08 |
| PR |
0.10 |
Add Videoprism |
|
Direct, brief; reviewer comment is also |
2025-08-04 |
| PR |
0.10 |
Fix softmaxing router logits |
|
Clear, specific about errors; tone natur |
2026-04-08 |
| PR |
0.10 |
Fix broken HQQ support |
|
Terse bullet points, informal language; |
2026-03-31 |
| PR |
0.10 |
Modular playground |
|
Uses technical shorthand and bullet poin |
2026-02-04 |
| PR |
0.10 |
Add RF-DETR |
|
Terse, direct, domain-specific; no AI ge |
2025-03-21 |
| PR |
0.10 |
Fix conversion mappings for vlms |
|
Uses terse phrases, issues, and informal |
2026-04-09 |
| PR |
0.10 |
[PoC] HF exporters |
|
Casual edits, emoji use, and PR referenc |
2025-11-03 |
| PR |
0.10 |
fix: dont download artifacts from the test hub |
|
Direct fix, minimal language, friendly r |
2026-04-08 |
| PR |
0.10 |
Refactor OwlViT to modular Transformers |
|
Highly technical, uses jargon and shorth |
2026-03-27 |
| PR |
0.10 |
Trainer: set skip_logits for loss-only eval when liger enabl |
|
Technical context, jargon, precise langu |
2026-03-25 |
| PR |
0.10 |
Copy the template resolution logic from the base apply_chat_ |
|
Uses shorthand and informal expressions, |
2026-03-30 |
| PR |
0.10 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` |
|
Domain-specific detail, concise; some te |
2026-03-30 |
| PR |
0.10 |
Add THD support in ESM |
|
Technical explanation with a typo ('Rota |
2026-02-19 |
| PR |
0.10 |
feat/rfc/poc: Agnostic GPU |
|
Domain-focused, includes abbreviation RF |
2026-04-04 |
| PR |
0.10 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical, precise edit summary; likely |
2026-04-03 |
| PR |
0.10 |
Fix AttributeError in _patch_mistral_regex when fix_mistral_ |
|
Contains typos and specific error contex |
2026-04-08 |
| PR |
0.10 |
docs: document known limitations of _can_set_attn/experts_im |
|
Technical changelog, standard structure, |
2026-04-09 |
| PR |
0.10 |
[Gemma4] Fix chat template and stop tokens for OpenAI tool c |
|
Jargon, bullet points, technical referen |
2026-04-05 |
| PR |
0.10 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Casual tone, references, and technical c |
2026-04-09 |
| PR |
0.10 |
Remove references to torchao's AffineQuantizedTensor |
|
Uses clear summary with domain context; |
2026-04-08 |
| PR |
0.10 |
Fix "AttributeError: NewTokenizer has no attribute special_a |
|
Detailed human bug explanation and struc |
2026-04-07 |
| PR |
0.10 |
[docs] modular transformers |
|
Casual tone with technical abbreviations |
2026-04-08 |
| PR |
0.10 |
Make the cli a top-level package |
|
Technical, concise, domain-specific phra |
2026-04-02 |
| PR |
0.10 |
chore: added circleci python script to ruff and ty checkers |
|
Short, focused, direct; includes repo-sp |
2026-04-09 |
| PR |
0.10 |
Pretrained-config bug(45072/huggingfacebug) |
|
References specific issues and Pydantic; |
2026-03-31 |
| PR |
0.10 |
fix: leak in tokenizer registry for `test_processors` |
|
Clear technical explanation; informal, n |
2026-04-08 |
| PR |
0.10 |
Fix Gemma4 `use_cache=False` producing bad logits |
|
Problem/solution summary with technical |
2026-04-05 |
| PR |
0.10 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
Technical explanations, commit reference |
2026-04-02 |
| PR |
0.10 |
Add heterogeneous config support (per-layer configuration) |
|
Brief, domain-specific, and concise with |
2026-04-09 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Short, technical, informal; lacks AI hal |
2026-04-09 |
| PR |
0.05 |
refactor: display test duration |
|
Brief, technical content with example; i |
2026-04-09 |
| PR |
0.05 |
http retries on audio file downloads |
|
Uses domain terms, concise phrasing, not |
2026-03-30 |
| PR |
0.05 |
cohere_asr: fix bug for model_parallel_beam_search test case |
|
Technical, direct; bugfix description ty |
2026-04-03 |
| PR |
0.05 |
skip 2 invalid test cases for pi0 model |
|
Informal tone and chat; domain-specific |
2026-03-26 |
| PR |
0.05 |
musicflamingo: add test support for Intel XPU device |
|
Casual review ask, abbreviations; no AI |
2026-04-03 |
| COMMIT |
0.00 |
Load adapter with TP (#45155) |
|
Terse, uses technical abbreviations, no |
2026-04-09 |
| COMMIT |
0.00 |
[docs] tp training (#44613) |
|
Very minimal, domain-specific shorthand, |
2026-04-09 |
| COMMIT |
0.00 |
[docs] training performance (#44342) |
|
Short, informal phrases typical of human |
2026-04-09 |
| COMMIT |
0.00 |
[docs] optimizers, hyperparam search, training features (#44 |
|
Informal list format, domain terms, not |
2026-04-09 |
| COMMIT |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Domain-specific wording, includes co-aut |
2026-04-09 |
| COMMIT |
0.00 |
Use torchvision `decode_image` to load images in the torchv |
|
Succinct, technical, includes co-author, |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Fix device map auto (#45347) |
|
Brief, uses domain terminology, not AI g |
2026-04-09 |
| COMMIT |
0.00 |
Refactor CLIP-like models (#44431) |
|
Terse, casual edits, clear human voice t |
2026-04-09 |
| COMMIT |
0.00 |
refactor: display test duration (#45344) |
|
Minimal, domain-focused, typical of huma |
2026-04-09 |
| COMMIT |
0.00 |
http retries on audio file downloads (#45126) |
|
Technical, brief, informal, no AI-like l |
2026-04-09 |
| COMMIT |
0.00 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` (#45108 |
|
Succinct commit, human style, includes d |
2026-04-09 |
| COMMIT |
0.00 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, technical, informal and direct co |
2026-04-09 |
| COMMIT |
0.00 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical jargon and fix history, not AI |
2026-04-09 |
| COMMIT |
0.00 |
Add THD support in ESM (#44145) |
|
Sequence of technical steps and signed-o |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Short, informal phrases and abbreviation |
2026-04-09 |
| COMMIT |
0.00 |
Fix conversion mappings for vlms (#45340) |
|
Terse and informal with typos, not AI-ge |
2026-04-09 |
| COMMIT |
0.00 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Casual tone, short phrasing, domain lang |
2026-04-09 |
| COMMIT |
0.00 |
chore: added circleci python script to ruff and ty checkers |
|
Single-sentence changes, abbreviations, |
2026-04-09 |
| COMMIT |
0.00 |
tweak checkers output on errors (#45163) |
|
Succinct, with human-like test fix expla |
2026-04-09 |
| COMMIT |
0.00 |
fix: leak in tokenizer registry for `test_processors` (#4531 |
|
Short, typo ('reigstry'), human casual s |
2026-04-09 |
| COMMIT |
0.00 |
chore: remove test_hub for now (#45337) |
|
Concise, informal commit message typical |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Dissociate kv states sharing from the Cache (#45312 |
|
Terse commit history, domain-specific an |
2026-04-09 |
| COMMIT |
0.00 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Direct, domain-specific commit message w |
2026-04-08 |
| COMMIT |
0.00 |
[docs] pipeline cleanup (#44954) |
|
Extremely terse; typical human doc updat |
2026-04-08 |
| COMMIT |
0.00 |
Add MoE to Gemma4 TP plan (#45219) |
|
Concise, domain-language, signed off by |
2026-04-08 |
| COMMIT |
0.00 |
Fix export for gemma4 and add Integration tests (#45285) |
|
Choppy informal commit breakdowns; clear |
2026-04-08 |
| COMMIT |
0.00 |
[docs] static model rules (#45232) |
|
Minimal, iterative commit style; clearly |
2026-04-08 |
| COMMIT |
0.00 |
fix(security): prevent untrusted users from triggering TRL C |
|
Technical detail, informal, and concise |
2026-04-07 |
| COMMIT |
0.00 |
Fix missing image processors backends (#45165) |
|
Very terse commit summary and message, c |
2026-04-07 |
| COMMIT |
0.00 |
[AMD CI] Fix Qwen2 expectations (#45284) |
|
Terse and informal phrasing; human style |
2026-04-07 |
| COMMIT |
0.00 |
Add `hasattr(torch.backends.cudnn, "conv")` to `conftest.py` |
|
Concise commit message; domain-specific |
2026-04-06 |
| COMMIT |
0.00 |
Fix `Qwen2IntegrationTest` (#45268) |
|
Single word summary; strongly human. |
2026-04-06 |
| COMMIT |
0.00 |
doc: fix TokenizersBackend.convert_to_native_format docstrin |
|
Minimal, template-style docstring fix; n |
2026-04-06 |
| COMMIT |
0.00 |
Fix unexpected TF32 being enabled in testing (#45252) |
|
Minimal, terse commit message; human typ |
2026-04-05 |
| COMMIT |
0.00 |
Fix tf32 issue: set `torch.backends.cudnn.conv.fp32_precisio |
|
Commit message is terse, with typical de |
2026-04-05 |
| COMMIT |
0.00 |
Nvidia CI with `torch 2.11` (#45243) |
|
Short, domain-specific, and informal com |
2026-04-04 |
| COMMIT |
0.00 |
Update `get_test_info.py` (related to tiny model creation) ( |
|
Brief, technical, and uses domain abbrev |
2026-04-04 |
| COMMIT |
0.00 |
More fix for tiny model creation (#45228) |
|
Commit message and bullet points are ter |
2026-04-03 |
| COMMIT |
0.00 |
remove unnecessary entries in some auto model mappings (#452 |
|
Extremely terse commit note; clear human |
2026-04-03 |
| COMMIT |
0.00 |
fix: hf-doc-builder insallation was failing (#45225) |
|
Brief and informal phrasing; no AI trait |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Add per-request logits processors (#45026) |
|
Terse, minimal commit message; typical h |
2026-04-03 |
| COMMIT |
0.00 |
[docs] formatting (#45196) |
|
Sparse, technical note with nonstandard |
2026-04-03 |
| COMMIT |
0.00 |
fix `test_register_result_handler` (#45188) |
|
Very brief commit message, lacks AI hall |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Tweaks to update and minor fixes (#45179) |
|
Bullet style, typos, and brevity point t |
2026-04-03 |
| COMMIT |
0.00 |
Fix pypi release (#45210) |
|
Minimal, direct style, some typos, clear |
2026-04-03 |
| COMMIT |
0.00 |
update to dev version 5.6.0-dev0 |
|
Simple version update note, extremely br |
2026-04-03 |
| COMMIT |
0.00 |
fix(docs): correct gemma4 docs and examples (#45197) |
|
Standard commit message with co-author t |
2026-04-02 |
| COMMIT |
0.00 |
Add Turkish (tr) translation for Get Started section (#45158 |
|
Human style, domain-specific, and inform |
2026-04-02 |
| COMMIT |
0.00 |
[docs] transformers serve (#45174) |
|
Very brief and informal commit messages, |
2026-04-02 |
| COMMIT |
0.00 |
casually dropping the most capable open weights on the plane |
|
Casual, informal tone; no AI signatures; |
2026-04-02 |
| COMMIT |
0.00 |
Internalise the NomicBERT model (#43067) |
|
Structured sequential work, domain-speci |
2026-04-02 |
| COMMIT |
0.00 |
Fix resized LM head weights being overwritten by post_init ( |
|
Technically detailed, with typos and dir |
2026-04-02 |
| COMMIT |
0.00 |
[Qwen3.5 MoE] Add _tp_plan to ForConditionalGeneration (#451 |
|
Technical, domain-specific, and succinct |
2026-04-02 |
| COMMIT |
0.00 |
Fix TypeError: 'NoneType' object is not iterable in Generati |
|
Very brief and specific to bug; human co |
2026-04-02 |
| COMMIT |
0.00 |
fix(models): Fix dtype mismatch in SwitchTransformers and Ti |
|
Uses human-like shorthand and changelog |
2026-04-02 |
| COMMIT |
0.00 |
Generalize gemma vision mask to videos (#45185) |
|
Short, informal, with direct responses t |
2026-04-02 |
| COMMIT |
0.00 |
[misc] fix qwen35 tests: correct the text model type and ski |
|
Concise commit message, domain-specific, |
2026-04-02 |
| COMMIT |
0.00 |
🔒 Pin GitHub Actions to commit SHAs (#45180) |
|
Explicit, terse commit titles; no AI hal |
2026-04-02 |
| COMMIT |
0.00 |
Use doc-builder runnable example for GLM-ASR (#44277) |
|
Informal language and typographical erro |
2026-04-02 |
| COMMIT |
0.00 |
CI] Small T5 expectations updated (#45138) |
|
Extremely terse commit message; clearly |
2026-04-02 |
| COMMIT |
0.00 |
fix: correct type annotations across config classes for @str |
|
Contains domain-specific detail, terse s |
2026-04-01 |
| COMMIT |
0.00 |
Fix explicit local code resolution for tokenizers and image |
|
Technical, uses signing trailer but no A |
2026-04-01 |
| COMMIT |
0.00 |
Fix T5Attention shape mismatch under Tensor Parallelism (#45 |
|
Technical explanation, review references |
2026-04-01 |
| COMMIT |
0.00 |
[refactor] Serving into proper modules (#44796) |
|
Commit history is terse, informal, and t |
2026-04-01 |
| COMMIT |
0.00 |
Re-add regex substitutions to the response parsing spec (#45 |
|
Informal changelog, abbreviations, and t |
2026-04-01 |
| COMMIT |
0.00 |
fix bug for janus model image generation (#45044) |
|
Informal, domain-specific, signed by hum |
2026-04-01 |
| COMMIT |
0.00 |
Fix incorrect TrainingArguments example in training.md (#451 |
|
Brief commit messages, domain-specific, |
2026-03-31 |
| COMMIT |
0.00 |
Add parse_response to Processor, make it a bit more official |
|
Direct, minimal phrasing; no AI indicato |
2026-03-31 |
| COMMIT |
0.00 |
DeepGEMM (#44832) |
|
Informal, technical jargon, no sign of A |
2026-03-31 |
| COMMIT |
0.00 |
🚨 [Cache] Native mamba & hybrid cache (#44950) |
|
Casual commit style, lots of terse fixes |
2026-03-31 |
| COMMIT |
0.00 |
[serving] Fix continuous batching JSON response serializatio |
|
Detailed but technical, regression test |
2026-03-31 |
| COMMIT |
0.00 |
refactoring: speedup static checks with disk cache (#44992) |
|
Terse, to-the-point, refactoring context |
2026-03-31 |
| COMMIT |
0.00 |
:rotating_light: [`LightGlue`] Remove remote code execution |
|
Minimal commit messages, no AI hallmarks |
2026-03-31 |
| COMMIT |
0.00 |
[CB] Add warmup feature (#45112) |
|
Brief, iterative commits, domain terms, |
2026-03-31 |
| COMMIT |
0.00 |
feature: added import complexity checker (#45013) |
|
Simple feature summary, short updates, h |
2026-03-31 |
| COMMIT |
0.00 |
Fix tests for `janus` model (#44739) |
|
Issue/PR style with signed-off trailer, |
2026-03-31 |
| COMMIT |
0.00 |
CB improvements for serving (#45063) |
|
All items are terse commit messages typi |
2026-03-30 |
| COMMIT |
0.00 |
Add Music Flamingo (#43538) |
|
Human-typical terse, technical commit me |
2026-03-30 |
| COMMIT |
0.00 |
[docs] continuous batching (#44896) |
|
Brief, informal, technical wording; no A |
2026-03-30 |
| COMMIT |
0.00 |
Fix few issues in Qwen_3_Omni_Moe (#44848) |
|
Commit messages are typical, concise dev |
2026-03-30 |
| COMMIT |
0.00 |
Fix PP test_ocr_queries (#45123) |
|
Short technical fix message, human style |
2026-03-30 |
| COMMIT |
0.00 |
Fix TypeError in rope validation when ignore_keys is a list |
|
Technical explanation with specific deta |
2026-03-30 |
| COMMIT |
0.00 |
refactor: added cache in check_repo (#45012) |
|
Follows conventional commit style; infor |
2026-03-30 |
| COMMIT |
0.00 |
Remove unused TensorFlow env var (#45065) |
|
Very terse, typical human commit style, |
2026-03-27 |
| COMMIT |
0.00 |
fix: add identity reverse_op to dequantize ops for save_pret |
|
Technical, domain-specific, informal, hu |
2026-03-27 |
| COMMIT |
0.00 |
Fix when RoPE params are in kwargs (#45049) |
|
Concise, technical phrasing, typical of |
2026-03-27 |
| COMMIT |
0.00 |
chore: update update_metdata.yml (#45054) |
|
— |
2026-03-27 |
| COMMIT |
0.00 |
[`FA`] Fix BC support for a few versions + add deprecation c |
|
Commit is terse, informal, and uses doma |
2026-03-27 |
| COMMIT |
0.00 |
fix(testing): Fix Parakeet, Evolla, Pi0, and Phi-3 test fail |
|
Commit and co-authored-by trailer show h |
2026-03-27 |
| COMMIT |
0.00 |
Allow advanced users to override `model_type` in `AutoConfig |
|
Brief technical summary; concise, domain |
2026-03-27 |
| COMMIT |
0.00 |
Fix llama4 bnb mode (#44588) |
|
Commit log style; terse, technical, no A |
2026-03-27 |
| COMMIT |
0.00 |
Fix failing `SmolLM3IntegrationTest` (#45048) |
|
Minimal content; repetitive test referen |
2026-03-27 |
| COMMIT |
0.00 |
fix tests/quantization/fp_quant_integration/test_fp_quant.py |
|
Technical jargon, short notes, domain-sp |
2026-03-27 |
| PR |
0.00 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Template content only, no free-text from |
2026-04-08 |
| PR |
0.00 |
[docs] tp training |
|
Very brief, to-the-point; lacking AI hal |
2026-03-11 |
| PR |
0.00 |
Close file handler |
|
Very brief, domain-specific; lacks AI-li |
2026-04-02 |
| PR |
0.00 |
[docs] optimizers, hyperparam search, training features |
|
Brief, domain-specific, non-formal chang |
2026-02-26 |
| PR |
0.00 |
Add Deepseek-OCR-2 model |
|
Direct, technical, and references specif |
2026-03-27 |
| PR |
0.00 |
[docs] zero + sequence parallelism |
|
Technical summarization, clear manual st |
2026-03-11 |
| PR |
0.00 |
Add AudioFlamingoNext model |
|
Terse changelog, references issue and sp |
2026-03-18 |
| PR |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Explains technical bug fix with domain-s |
2026-04-04 |
| PR |
0.00 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Bug fix with issue reference, human-like |
2026-03-23 |
| PR |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode |
|
Casual phrasing, brief and references co |
2026-04-08 |
| PR |
0.00 |
[WIP][Fix] GLM 5 set `apply_rotary_pos_emb` to `is_neox_styl |
|
Uses technical jargon, WIP marker, and i |
2026-03-26 |
| PR |
0.00 |
Fix gemma4 has flash-attention incompatbile head-dim=512 |
|
Contains informal tone and direct admiss |
2026-04-02 |
| PR |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Lists concise technical changes in non-f |
2026-03-16 |
| PR |
0.00 |
[docs] model testing |
|
Informal tone, minimal capitalisation; c |
2026-03-31 |
| PR |
0.00 |
Fix Double Application of Softmax for Router Logits in MoE m |
|
Contains only a technical title; no free |
2026-04-09 |
| PR |
0.00 |
[gemma4] Fix device map auto |
|
Technical phrasing, domain details, info |
2026-04-09 |
| PR |
0.00 |
fix bug for videomt model device mismatch |
|
Informal and terse, uses shorthand and d |
2026-04-03 |
| PR |
0.00 |
Add GGUF support to Gemma4 (31B & 26B-A4B) text |
|
Most text is template, no AI-style conte |
2026-04-07 |
| PR |
0.00 |
nomic_bert: make the test suitable for general device. |
|
Very brief, direct, and informal with '@ |
2026-04-03 |
| PR |
0.00 |
Use `_keys_to_ignore_on_load_unexpected/missing` recursively |
|
Message is only a technical commit title |
2026-04-09 |
| PR |
0.00 |
user friendly error when loading audio from video |
|
Minimal, informal, domain-jargon; review |
2026-04-03 |
| PR |
0.00 |
tweak checkers output on errors |
|
Terse description and informal language; |
2026-04-01 |
| PR |
0.00 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Very technical, bug-focused, references |
2026-04-07 |
| PR |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Minimal free-text, template section, no |
2026-04-08 |
| PR |
0.00 |
Add HyperCLOVAX SEED Think 14B |
|
Rich technical details and formatting; h |
2026-03-23 |
| PR |
0.00 |
Fix Zamba2MambaMixer ignoring use_mamba_kernels=False |
|
Bug explanation and code context with in |
2026-03-19 |
| PR |
0.00 |
chore: remove test_hub for now |
|
Extremely terse summary; informal/human |
2026-04-09 |
| PR |
0.00 |
throw error when conversion required |
|
Short, informal, specific bugfix context |
2026-03-27 |