| COMMIT |
1.00 |
Add Neuron to auto-compile hardware list (#44757) |
|
Commit message contains explicit AI assi |
2026-04-16 |
| COMMIT |
1.00 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(serving): resolve rust tokenizer from ProcessorMixin in |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Explicit mention of Claude Code and 'Bui |
2026-04-13 |
| COMMIT |
1.00 |
docs: fix 5 docstring errors in Gemma3nTextConfig (typos, gr |
|
Mentions 'Built by Rudrendu Paul, develo |
2026-04-13 |
| PR |
1.00 |
Add ctsm model |
|
PR body explicitly mentions AI collabora |
2026-04-17 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
.4021378118068288:94a295563cf6b5aa7d67bd0f2c0cd7a7_69e2df734 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
.4021378118068288:e3506c3c5a98ec3a50332c6102362804_69e2e5884 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
.4021378118068288:4da7ed27ccaa5f974fe4a552e2b67bb6_69e2eea84 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
.4021378118068288:aafb9167aaa6b321205f754209b0cbcb_69e2f2564 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
.4021378118068288:1e40fd96a800b4038c914120b0aa85c2_69e2faf34 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
.4021378118068288:98296403e0cd6dedb7b420b80d0fe80b_69e2ce7c4 |
|
PR body explicitly mentions AI collabora |
2026-04-18 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
SonicMoe |
|
PR body explicitly mentions AI collabora |
2026-04-14 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
PR body explicitly mentions AI collabora |
2026-04-17 |
| PR |
1.00 |
add Qianfan-OCR model definition |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
add expert parallelism for gemma-4-26B-A4B-it |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
fix(testing_utils): guard get_device_capability with torch.c |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
0.30 |
Proposal to add Qwen3-ASR support [WIP] |
|
Some formal phrasing like 'This PR adds' |
2026-02-08 |
| PR |
0.20 |
Parakeet tdt |
|
Technical phrasing and references; no AI |
2026-02-20 |
| PR |
0.20 |
chore(qa): split out mlinter |
|
Uses bullet points and domain terms; no |
2026-04-16 |
| PR |
0.20 |
Add option to export encoder hidden states for Granite-spee |
|
Concise, domain-specific; no strong AI s |
2026-03-03 |
| PR |
0.20 |
Adding support for Nandi Models |
|
Informal thanks and custom shout-outs; l |
2026-03-29 |
| PR |
0.20 |
Add IndexCache support for GLM5 DSA |
|
Concise but more formal, some AI-like ma |
2026-04-14 |
| PR |
0.20 |
Add Neuron to auto-compile hardware list |
|
Concise, technical, but slightly more fo |
2026-03-16 |
| PR |
0.20 |
Drop `content=None` from messages in `apply_chat_template` |
|
Focused on technical fix, uses technical |
2026-04-14 |
| PR |
0.20 |
[`fix`] Always early return for non-Mistral models in _patch |
|
Mix of template structure and technical |
2026-04-14 |
| PR |
0.11 |
fix(auto): Map deepseek_v2 and deepseek_v3 to LlamaTokenizer |
|
Technical motivation/problem section, do |
2026-03-17 |
| COMMIT |
0.10 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Uses technical detail and casual, non-fo |
2026-04-14 |
| COMMIT |
0.10 |
fix: prevent accelerate from splitting vision encoder by set |
|
Human, domain-focused writing and inform |
2026-04-14 |
| COMMIT |
0.10 |
[Doc] MoE routing capture and replay recipe (#44925) |
|
Informal style with partial sentences an |
2026-04-14 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Domain-specific and concise with abbrevi |
2026-04-18 |
| PR |
0.10 |
[Doc] Fix 'tokenized' -> 'tokenizer' typo in streamer docstr |
|
Direct, technical, and uses natural chan |
2026-04-18 |
| PR |
0.10 |
Add qwen3 tts |
|
Use of abbreviations, concise bullets, a |
2026-03-07 |
| PR |
0.10 |
Fix position_ids docstring in modeling_flash_attention_utils |
|
Specific references, natural summary, no |
2026-03-09 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Concise, technical, truncated but no AI |
2026-03-30 |
| PR |
0.10 |
feat(models): Make MimiModel encoding padding-aware to ensur |
|
Structured, technical, and natural engin |
2026-01-20 |
| PR |
0.10 |
fix(models): Bamba model fails with torch.compile when using |
|
Terse, domain-specific, includes informa |
2026-01-28 |
| PR |
0.10 |
fix(models): Fix suno/bark-small CPU offload device mismatch |
|
Concise, technical explanation, domain e |
2026-01-29 |
| PR |
0.10 |
fix(tokenizer): Avert special token property overwrites in b |
|
Bug reference, short and technical, no A |
2026-01-31 |
| PR |
0.10 |
fix(models): Unpack BitNet packed weights to fix CI failure |
|
Brief, domain-specific summary; no AI-st |
2026-02-03 |
| PR |
0.10 |
fix(testing): Fix BLOOM tokenizer, CLAP audio features, and |
|
Direct test references, concise; not AI- |
2026-02-06 |
| PR |
0.10 |
fix(models): Apply STE in Dac.from_latents to match the forw |
|
Succinct and technical, lacks AI hallmar |
2026-02-07 |
| PR |
0.10 |
fix(testing): Fix LayoutXLM tokenization test and LightOnOCR |
|
References specific tests, technical, no |
2026-02-13 |
| PR |
0.10 |
fix(testing): Update stale device override test in GraniteSp |
|
Brief, technical, minimal completion, no |
2026-02-17 |
| PR |
0.10 |
fix(models): Fix LayoutLMv2 NER crash and broken batched tru |
|
Succinct, technical summary; not AI-gene |
2026-02-20 |
| PR |
0.10 |
fix(utils): Make torch_compilable_check compatible with torc |
|
Includes reasoning, but phrasing remains |
2026-02-24 |
| PR |
0.10 |
model: Add DEIMv2 to Transformers |
|
Direct, informal, includes notebook link |
2026-02-27 |
| PR |
0.10 |
fix(tokenizer): Only strip Fast from class names in AutoToke |
|
Very technical, includes links, no AI-st |
2026-03-04 |
| PR |
0.10 |
fix(testing): Fix MoonshineEncoder UnboundLocalError and Flo |
|
Concise, technical, references code; not |
2026-03-06 |
| PR |
0.10 |
fix(models, testing): Fix Llama4 vision rotary meta tensor i |
|
Technical, concise, uses domain-specific |
2026-03-10 |
| PR |
0.10 |
fix(models): Forward timm model kwargs to timm.create_model |
|
Technical, uses YAML block & technical c |
2026-03-11 |
| PR |
0.10 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, direct, references CI/test detail |
2026-03-14 |
| PR |
0.10 |
fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures |
|
Direct, contains domain references and l |
2026-03-16 |
| PR |
0.10 |
fix(models): Fix Perceiver interpolate_pos_encoding interpol |
|
Direct, references commit hashes, human |
2026-03-20 |
| PR |
0.10 |
fix(models): Fix dtype mismatch in SwitchTransformers and Ti |
|
Contains commit references, technical fo |
2026-03-27 |
| PR |
0.10 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
In-depth and technical, uses specific ja |
2026-04-02 |
| PR |
0.10 |
feat[vLLM × v5]: Add vLLM compatibility for audio models |
|
Brief, domain-focused, concise technical |
2026-04-08 |
| PR |
0.10 |
fix(testing): Fix Parakeet, Evolla, Pi0, and Phi-3 test fail |
|
Technical summary, grouped failures, hum |
2026-03-25 |
| PR |
0.10 |
Ignore CLIP position_ids in unexpected key loading report |
|
Technical and concise, natural terminolo |
2026-04-12 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Direct technical language and domain-spe |
2026-04-18 |
| PR |
0.10 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Brief, relevant, and informal phrasing; |
2026-04-15 |
| PR |
0.10 |
Blockwise mask fn as opt arg in all masking functions |
|
Informal, uses unfinished sentences, dom |
2026-04-16 |
| PR |
0.10 |
Update quants tests |
|
Brief, informal, and uses abbreviations |
2026-04-16 |
| PR |
0.10 |
[`Conversion Mapping`] Small fixups |
|
Technical list format; lacks AI-like pat |
2026-04-16 |
| PR |
0.10 |
Make Gemma4ClippableLinear inherit from nn.Linear for PEFT/L |
|
Concise, technical, and context-specific |
2026-04-12 |
| PR |
0.10 |
Fix conversion mappings for vlms |
|
Contains domain context and references; |
2026-04-09 |
| PR |
0.10 |
typing: rule 15 - checks for tie_word_embeddings presence |
|
Terse wording; insertion of rule in tech |
2026-03-25 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Technical focus with casual parenthetica |
2026-04-09 |
| PR |
0.10 |
Fix model parallel issue for altclip model and ChineseClip m |
|
Technical, targets specific test failure |
2026-04-17 |
| PR |
0.10 |
Fix response api support |
|
Contains informal tone and abbreviations |
2026-04-15 |
| PR |
0.10 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Technical content, specific bug explanat |
2026-04-16 |
| PR |
0.10 |
Generic Sequence Classifier works for multimodal models |
|
Casual language, uses 'ig', references d |
2026-03-13 |
| PR |
0.10 |
Allow loading Qwen Thinker 'base' models without generative |
|
Technical explanation, casual tone, doma |
2026-04-15 |
| PR |
0.08 |
fix: return empty tuple from import_protobuf_decode_error wh |
|
Clear technical explanation, some formal |
2026-04-16 |
| PR |
0.05 |
Refactor OwlViT to modular Transformers |
|
Domain-specific terms and terse language |
2026-03-27 |
| PR |
0.05 |
Fix: modular image processors |
|
Concise, references a specific PR; direc |
2026-04-17 |
| PR |
0.05 |
[WIP] Major processing refactor |
|
Brief, technical summary, uses domain te |
2026-04-17 |
| PR |
0.05 |
Add GGUF loading support for Qwen3-Next (qwen3_next) archite |
|
Technical, contains abbreviations and do |
2026-02-17 |
| PR |
0.05 |
Fix: propagate quantization_config to text sub-config for co |
|
Specific terminology, concise issue desc |
2026-04-17 |
| PR |
0.05 |
Add CLIP-like models in conversion to VLMs |
|
Terse, issue links, technical context, n |
2026-04-10 |
| PR |
0.02 |
Align gemma3n cache sharing to gemma4 |
|
Short, references specific PRs, domain-s |
2026-04-17 |
| PR |
0.01 |
fix(x_clip): fix 8 failed test cases |
|
Very terse, minimal description; human e |
2026-04-13 |
| PR |
0.01 |
Remove redundant condition checks in `get_image_size` method |
|
Extremely terse, direct, no AI-like phra |
2026-04-15 |
| COMMIT |
0.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
Brief, technical, terse revert commit ty |
2026-04-17 |
| COMMIT |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
Technical language, specific errors, dir |
2026-04-17 |
| COMMIT |
0.00 |
Remove redundant condition checks in `get_image_size` method |
|
Terse bullet points, technical, informal |
2026-04-17 |
| COMMIT |
0.00 |
add Qianfan-OCR model definition (#45280) |
|
Informal, iterative fix messages, domain |
2026-04-17 |
| COMMIT |
0.00 |
Add check-auto in repo-consistency and fix sorting (#45481) |
|
Colloquial wording, speculative ('maybe? |
2026-04-17 |
| COMMIT |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py (#45 |
|
Concise typo fix, no free-text, purely a |
2026-04-17 |
| COMMIT |
0.00 |
typing: rule 15 - checks for tie_word_embeddings presence (# |
|
Brief update notes, technical context, c |
2026-04-17 |
| COMMIT |
0.00 |
[CB] Fix capture of max_seqlen (#45323) |
|
Informal commit titles, multiple granula |
2026-04-17 |
| COMMIT |
0.00 |
Fix response api support (#45463) |
|
Commit messages are terse, informal, sho |
2026-04-16 |
| COMMIT |
0.00 |
Minor update (#45484) |
|
Minimal human-written commit; contains C |
2026-04-16 |
| COMMIT |
0.00 |
Allow loading Qwen Thinker 'base' models without generative |
|
Technical explanation, domain language, |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Always early return for non-Mistral models in _patch |
|
Human-style summary and messages, includ |
2026-04-16 |
| COMMIT |
0.00 |
Fix spurious position_ids warnings for at least 40 architect |
|
Structured, technical commit explanation |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Make Qwen2_5OmniProcessor warning a lot less noisy v |
|
Multiple edits and reverts; informal sty |
2026-04-16 |
| COMMIT |
0.00 |
Dynamic auto mapping (#45018) |
|
Terse, messy progression; highly informa |
2026-04-16 |
| COMMIT |
0.00 |
[serve] Forward `tool_calls`/`tool_call_id` in processor inp |
|
Commit message is concise, technical, an |
2026-04-15 |
| COMMIT |
0.00 |
[docs] vlm addition (#45271) |
|
Short, informal commit messages with min |
2026-04-15 |
| COMMIT |
0.00 |
fix: dont download artifacts from the test hub (#45319) |
|
Commit log has informal, domain-specific |
2026-04-15 |
| COMMIT |
0.00 |
refactor(qa): extend extras so ty can run on server modules |
|
Brief, domain-specific commit message wi |
2026-04-15 |
| COMMIT |
0.00 |
fix(clipseg): fix 2 failing tests (#45403) |
|
Technical, informal bullet points, signe |
2026-04-15 |
| COMMIT |
0.00 |
[docs] @auto_docstring decorator (#45130) |
|
Concise, informal commit messages typica |
2026-04-15 |
| COMMIT |
0.00 |
Fix Sam3Processor missing input_boxes_labels for padded None |
|
Detailed technical explanation using dom |
2026-04-15 |
| COMMIT |
0.00 |
Multimodal serve support (#45220) |
|
Informal, domain-based commit history, h |
2026-04-15 |
| COMMIT |
0.00 |
better grad acc tests (#45434) |
|
Terse commit message with domain abbrevi |
2026-04-15 |
| COMMIT |
0.00 |
avoid wrap 4bit-quantized model into DP (#45407) |
|
Signed by human; no AI indicators in con |
2026-04-15 |
| COMMIT |
0.00 |
Add example for iterative chatting with MLLMs (#45398) |
|
Commit message is terse and has co-autho |
2026-04-15 |
| COMMIT |
0.00 |
Gemma4 resizing per layer inputs (#45324) |
|
Commit message is short, uses domain jar |
2026-04-15 |
| COMMIT |
0.00 |
Add `step3_vl` to `MODELS_WITH_INCORRECT_HUB_TOKENIZER_CLASS |
|
Commit is a standard changelog with clea |
2026-04-15 |
| COMMIT |
0.00 |
Update workflow references to new commit hash (#45442) |
|
Standard commit message; minimal human w |
2026-04-14 |
| COMMIT |
0.00 |
[Doc] Correct checkpoint path in Dinov2 model_docs (#45430) |
|
Commit is concise and fixes a specific t |
2026-04-14 |
| COMMIT |
0.00 |
Fix ty for transformers cli (#45190) |
|
Casual, terse commit messages typical of |
2026-04-14 |
| COMMIT |
0.00 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
Natural, technical commit flow; includes |
2026-04-14 |
| COMMIT |
0.00 |
Fix Qwen2.5VL temporal grid positions (#45400) |
|
Commit messages are brief, casual, and h |
2026-04-14 |
| COMMIT |
0.00 |
[`fix`] PEFT integration fixes preventing save/load & integr |
|
Technical, concise, with human co-author |
2026-04-14 |
| COMMIT |
0.00 |
Fix the response schema for the gemma4 converter (#45411) |
|
Short, direct PR summary; no AI-style ph |
2026-04-14 |
| COMMIT |
0.00 |
Fix `apply_chat_template` crash on `tool_call` messages with |
|
Terse commit messages with domain terms; |
2026-04-13 |
| COMMIT |
0.00 |
Add SAM3-LiteText (#44320) |
|
Very terse, incremental commit messages; |
2026-04-13 |
| COMMIT |
0.00 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Detailed explanation with domain context |
2026-04-13 |
| COMMIT |
0.00 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Brief, informal commit messages; typical |
2026-04-13 |
| COMMIT |
0.00 |
[inference_fusion] convert conv3d patch embed to linear (#45 |
|
Informal commit messages with technical |
2026-04-13 |
| COMMIT |
0.00 |
Fix #45305 + add regression test GAS (#45349) |
|
Terse, informal tone with inline technic |
2026-04-13 |
| COMMIT |
0.00 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Mostly placeholder commit messages, no A |
2026-04-13 |
| COMMIT |
0.00 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Technical bugfixes and test updates in b |
2026-04-13 |
| COMMIT |
0.00 |
Less unnecessary RoPE warnings (#45289) |
|
Terse, domain-specific commit; no AI sig |
2026-04-13 |
| COMMIT |
0.00 |
[`Tokenizers`] Move gpt sw3 tokenizer out (#45404) |
|
Brief, technical fix; no AI phrasing pre |
2026-04-13 |
| COMMIT |
0.00 |
Fix unintended Hub metadata calls from _patch_mistral_regex |
|
Detailed, technical with domain slang; h |
2026-04-13 |
| COMMIT |
0.00 |
Fix MoE routers returning probabilities instead of logits (# |
|
Technical fix, informal style; clearly h |
2026-04-13 |
| COMMIT |
0.00 |
Fix NaN weights on non-rank-0 FSDP processes (#45050) |
|
Short, domain-specific; lacks AI tone. |
2026-04-13 |
| COMMIT |
0.00 |
remove cache file from tree (#45392) |
|
Very terse commit about cache file; huma |
2026-04-13 |
| COMMIT |
0.00 |
[docs] training on specific hardware (#44799) |
|
Commit messages are terse and minimal, t |
2026-04-10 |
| COMMIT |
0.00 |
[docs] zero + sequence parallelism (#44605) |
|
Very brief, domain-specific shorthand; n |
2026-04-10 |
| COMMIT |
0.00 |
Fix vlm weight mappings (#45358) |
|
Informal tone and shorthand imply human |
2026-04-10 |
| COMMIT |
0.00 |
Copy the template resolution logic from the base apply_chat_ |
|
Direct, informal commit log—no AI phrasi |
2026-04-10 |
| COMMIT |
0.00 |
add kwargs to all methods in the CallbackHandler class (#453 |
|
No text beyond the conventional PR title |
2026-04-10 |
| COMMIT |
0.00 |
Close file handler (#45187) |
|
Terse, technical fix with human co-autho |
2026-04-10 |
| COMMIT |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Technical summary with explicit changelo |
2026-04-10 |
| COMMIT |
0.00 |
`cohere_asr`: fix device issue for `test_model_parallel_beam |
|
Domain-specific fixes, Signed-off-by and |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in Gemma3ForConditionalGeneration and Gem |
|
Standard patch title and human co-author |
2026-04-10 |
| COMMIT |
0.00 |
fix bug for videomt model device mismatch (#45204) |
|
Domain-specific, includes Signed-off-by; |
2026-04-10 |
| COMMIT |
0.00 |
fix gemma4 gradient accumulation loss and last token incorre |
|
Terse commit messages and domain abbrevi |
2026-04-10 |
| COMMIT |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode (#453 |
|
Very short, casual commit messages sugge |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Technical problem explanation, domain de |
2026-04-10 |
| COMMIT |
0.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
Technical and precise, contains domain-s |
2026-04-10 |
| COMMIT |
0.00 |
musicflamingo: add test support for Intel XPU device (#45212 |
|
Terse, domain-specific, and includes sig |
2026-04-10 |
| COMMIT |
0.00 |
nomic_bert: make the test suitable for general device. (#452 |
|
Minimal, with only template sign-off and |
2026-04-10 |
| COMMIT |
0.00 |
Skip invalid flash-attn tests for `pi0` model (#45011) |
|
Terse, informal commit messages and incl |
2026-04-10 |
| COMMIT |
0.00 |
Add cuda compatibility check for using `grouped_mm` (#45001) |
|
Terse commit subject; technical co-autho |
2026-04-10 |
| COMMIT |
0.00 |
Load adapter with TP (#45155) |
|
Terse, uses technical abbreviations, no |
2026-04-09 |
| COMMIT |
0.00 |
[docs] tp training (#44613) |
|
Very minimal, domain-specific shorthand, |
2026-04-09 |
| COMMIT |
0.00 |
[docs] training performance (#44342) |
|
Short, informal phrases typical of human |
2026-04-09 |
| COMMIT |
0.00 |
[docs] optimizers, hyperparam search, training features (#44 |
|
Informal list format, domain terms, not |
2026-04-09 |
| COMMIT |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Domain-specific wording, includes co-aut |
2026-04-09 |
| COMMIT |
0.00 |
Use torchvision `decode_image` to load images in the torchv |
|
Succinct, technical, includes co-author, |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Fix device map auto (#45347) |
|
Brief, uses domain terminology, not AI g |
2026-04-09 |
| COMMIT |
0.00 |
Refactor CLIP-like models (#44431) |
|
Terse, casual edits, clear human voice t |
2026-04-09 |
| COMMIT |
0.00 |
refactor: display test duration (#45344) |
|
Minimal, domain-focused, typical of huma |
2026-04-09 |
| COMMIT |
0.00 |
http retries on audio file downloads (#45126) |
|
Technical, brief, informal, no AI-like l |
2026-04-09 |
| COMMIT |
0.00 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` (#45108 |
|
Succinct commit, human style, includes d |
2026-04-09 |
| COMMIT |
0.00 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, technical, informal and direct co |
2026-04-09 |
| COMMIT |
0.00 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical jargon and fix history, not AI |
2026-04-09 |
| COMMIT |
0.00 |
Add THD support in ESM (#44145) |
|
Sequence of technical steps and signed-o |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Short, informal phrases and abbreviation |
2026-04-09 |
| COMMIT |
0.00 |
Fix conversion mappings for vlms (#45340) |
|
Terse and informal with typos, not AI-ge |
2026-04-09 |
| COMMIT |
0.00 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Casual tone, short phrasing, domain lang |
2026-04-09 |
| COMMIT |
0.00 |
chore: added circleci python script to ruff and ty checkers |
|
Single-sentence changes, abbreviations, |
2026-04-09 |
| COMMIT |
0.00 |
tweak checkers output on errors (#45163) |
|
Succinct, with human-like test fix expla |
2026-04-09 |
| COMMIT |
0.00 |
fix: leak in tokenizer registry for `test_processors` (#4531 |
|
Short, typo ('reigstry'), human casual s |
2026-04-09 |
| COMMIT |
0.00 |
chore: remove test_hub for now (#45337) |
|
Concise, informal commit message typical |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Dissociate kv states sharing from the Cache (#45312 |
|
Terse commit history, domain-specific an |
2026-04-09 |
| COMMIT |
0.00 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Direct, domain-specific commit message w |
2026-04-08 |
| COMMIT |
0.00 |
[docs] pipeline cleanup (#44954) |
|
Extremely terse; typical human doc updat |
2026-04-08 |
| COMMIT |
0.00 |
Add MoE to Gemma4 TP plan (#45219) |
|
Concise, domain-language, signed off by |
2026-04-08 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Very terse, includes TODOs and questions |
2026-03-26 |
| PR |
0.00 |
Add Molmo2 |
|
Only title/template present, insufficien |
2026-01-23 |
| PR |
0.00 |
Add V-JEPA 2.1 inference support |
|
Template plus technical, domain-specific |
2026-04-17 |
| PR |
0.00 |
:rotating_light: [`Kernels`] Fix kernel function registratio |
|
Template plus normal technical summary, |
2026-04-13 |
| PR |
0.00 |
[serve] Update tool call to switch to `parse_response` |
|
Uses PR template; free-text section is i |
2026-04-16 |
| PR |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
PR template used; technical diagnostic s |
2026-04-13 |
| PR |
0.00 |
Add check-auto in repo-consistency and fix sorting |
|
Single punctuation mark; no content to j |
2026-04-16 |
| PR |
0.00 |
throw error when conversion required |
|
Short, direct, uses explicit references, |
2026-03-27 |
| PR |
0.00 |
chore: bump doc-builder SHA for main doc build workflow |
|
Brief, domain-specific commit message; n |
2026-04-16 |
| PR |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py |
|
Brief, specific, and informal; clear hum |
2026-04-06 |
| PR |
0.00 |
[CB] Fix capture of max_seqlen |
|
Domain-specific fix, concise and technic |
2026-04-08 |
| PR |
0.00 |
generation/stopping_criteria: short-circuit StoppingCriteria |
|
Terse, technical explanation with jargon |
2026-04-12 |
| PR |
0.00 |
Draft commit |
|
Extremely brief and informal; no AI indi |
2026-04-15 |
| PR |
0.00 |
[Don't merge] Call CI workflow |
|
Template; free-text is minimal and unpol |
2026-04-16 |
| PR |
0.00 |
Minor update |
|
Extremely brief, no AI hallmarks. |
2026-04-16 |
| PR |
0.00 |
Dynamic auto mapping |
|
Brief, domain-specific explanation; no A |
2026-03-26 |
| PR |
0.00 |
Fix spurious position_ids warnings for at least 40 architect |
|
PR content is missing; only template is |
2026-04-14 |
| PR |
0.00 |
[`fix`] Make Qwen2_5OmniProcessor warning a lot less noisy v |
|
Human tone with domain-specific referenc |
2026-04-15 |
| PR |
0.00 |
Fix MPS SDPA output shape when value head dim differs from q |
|
Direct, technical description with domai |
2026-04-16 |
| PR |
0.00 |
fix(tokenization): re-raise ImportError to allow RuntimeErro |
|
Technical summary, some truncation, no s |
2026-04-15 |
| PR |
0.00 |
Add expert parallelism (EP) support for Qwen3 MoE + fix Grou |
|
Lists features/fixes in concise, domain- |
2026-04-14 |