| COMMIT |
1.00 |
Add Neuron to auto-compile hardware list (#44757) |
|
Commit message contains explicit AI assi |
2026-04-16 |
| COMMIT |
1.00 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(serving): resolve rust tokenizer from ProcessorMixin in |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Explicit mention of Claude Code and 'Bui |
2026-04-13 |
| COMMIT |
1.00 |
docs: fix 5 docstring errors in Gemma3nTextConfig (typos, gr |
|
Mentions 'Built by Rudrendu Paul, develo |
2026-04-13 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
SonicMoe |
|
PR body explicitly mentions AI collabora |
2026-04-14 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
PR body explicitly mentions AI collabora |
2026-04-17 |
| PR |
1.00 |
Add ctsm model |
|
PR body explicitly mentions AI collabora |
2026-04-17 |
| PR |
1.00 |
add Qianfan-OCR model definition |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
add expert parallelism for gemma-4-26B-A4B-it |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
fix(testing_utils): guard get_device_capability with torch.c |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
Fix Sam3Processor missing input_boxes_labels for padded None |
|
PR body explicitly mentions AI collabora |
2026-04-01 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Fix EtaLogitsWarper on fully masked logits |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
0.60 |
🚨 Refactor ViT to updated standards |
|
Uses phrase 'This PR aims at', formal to |
2025-10-17 |
| PR |
0.30 |
Proposal to add Qwen3-ASR support [WIP] |
|
Some formal phrasing like 'This PR adds' |
2026-02-08 |
| PR |
0.20 |
Parakeet tdt |
|
Technical phrasing and references; no AI |
2026-02-20 |
| PR |
0.20 |
chore(qa): split out mlinter |
|
Uses bullet points and domain terms; no |
2026-04-16 |
| PR |
0.20 |
Add option to export encoder hidden states for Granite-spee |
|
Concise, domain-specific; no strong AI s |
2026-03-03 |
| PR |
0.20 |
Adding support for Nandi Models |
|
Informal thanks and custom shout-outs; l |
2026-03-29 |
| PR |
0.20 |
Add IndexCache support for GLM5 DSA |
|
Concise but more formal, some AI-like ma |
2026-04-14 |
| PR |
0.20 |
Add Neuron to auto-compile hardware list |
|
Concise, technical, but slightly more fo |
2026-03-16 |
| PR |
0.20 |
Drop `content=None` from messages in `apply_chat_template` |
|
Focused on technical fix, uses technical |
2026-04-14 |
| PR |
0.20 |
[`fix`] Always early return for non-Mistral models in _patch |
|
Mix of template structure and technical |
2026-04-14 |
| PR |
0.20 |
model: Add DEIMv2 to Transformers |
|
Short, casual bullet points and emoji; n |
2026-02-27 |
| PR |
0.13 |
fix(gemma3, gemma4): default token_type_ids to zeros for tex |
|
Domain-specific error referencing, infor |
2026-04-03 |
| PR |
0.12 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Domain-specific, natural explanation; no |
2026-03-10 |
| PR |
0.11 |
fix(auto): Map deepseek_v2 and deepseek_v3 to LlamaTokenizer |
|
Technical motivation/problem section, do |
2026-03-17 |
| COMMIT |
0.10 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Uses technical detail and casual, non-fo |
2026-04-14 |
| COMMIT |
0.10 |
fix: prevent accelerate from splitting vision encoder by set |
|
Human, domain-focused writing and inform |
2026-04-14 |
| COMMIT |
0.10 |
[Doc] MoE routing capture and replay recipe (#44925) |
|
Informal style with partial sentences an |
2026-04-14 |
| PR |
0.10 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Short and technical, no AI style detecte |
2026-04-15 |
| PR |
0.10 |
Blockwise mask fn as opt arg in all masking functions |
|
Informal, uses unfinished sentences, dom |
2026-04-16 |
| PR |
0.10 |
Update quants tests |
|
Brief, informal, and uses abbreviations |
2026-04-16 |
| PR |
0.10 |
[`Conversion Mapping`] Small fixups |
|
Technical list format; lacks AI-like pat |
2026-04-16 |
| PR |
0.10 |
Make Gemma4ClippableLinear inherit from nn.Linear for PEFT/L |
|
Concise, technical, and context-specific |
2026-04-12 |
| PR |
0.10 |
Fix conversion mappings for vlms |
|
Contains domain context and references; |
2026-04-09 |
| PR |
0.10 |
typing: rule 15 - checks for tie_word_embeddings presence |
|
Terse wording; insertion of rule in tech |
2026-03-25 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Technical focus with casual parenthetica |
2026-04-09 |
| PR |
0.10 |
Fix model parallel issue for altclip model and ChineseClip m |
|
Technical, targets specific test failure |
2026-04-17 |
| PR |
0.10 |
Fix response api support |
|
Contains informal tone and abbreviations |
2026-04-15 |
| PR |
0.10 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Technical content, specific bug explanat |
2026-04-16 |
| PR |
0.10 |
Generic Sequence Classifier works for multimodal models |
|
Casual language, uses 'ig', references d |
2026-03-13 |
| PR |
0.10 |
Allow loading Qwen Thinker 'base' models without generative |
|
Technical explanation, casual tone, doma |
2026-04-15 |
| PR |
0.10 |
Gemma4 training with text-only samples |
|
Uses domain terms like PG and concise ex |
2026-04-15 |
| PR |
0.10 |
[docs] vlm addition |
|
Terse, domain-specific and informal; cle |
2026-04-06 |
| PR |
0.10 |
[docs] model testing |
|
Terse and informal; rewrite focus and ca |
2026-03-31 |
| PR |
0.10 |
[docs] @auto_docstring decorator |
|
Succinct technical bullet points; straig |
2026-03-30 |
| PR |
0.10 |
Multimodal serve support |
|
Informal summary with domain jargon and |
2026-04-03 |
| PR |
0.08 |
fix: return empty tuple from import_protobuf_decode_error wh |
|
Clear technical explanation, some formal |
2026-04-16 |
| PR |
0.05 |
Refactor OwlViT to modular Transformers |
|
Domain-specific terms and terse language |
2026-03-27 |
| PR |
0.05 |
Fix: modular image processors |
|
Concise, references a specific PR; direc |
2026-04-17 |
| PR |
0.05 |
[WIP] Major processing refactor |
|
Brief, technical summary, uses domain te |
2026-04-17 |
| PR |
0.05 |
Add GGUF loading support for Qwen3-Next (qwen3_next) archite |
|
Technical, contains abbreviations and do |
2026-02-17 |
| PR |
0.05 |
Fix: propagate quantization_config to text sub-config for co |
|
Specific terminology, concise issue desc |
2026-04-17 |
| PR |
0.05 |
Add CLIP-like models in conversion to VLMs |
|
Terse, issue links, technical context, n |
2026-04-10 |
| PR |
0.05 |
chore(typing): added modeling_utils to ty |
|
Brief technical update, informal, typos; |
2026-04-14 |
| PR |
0.05 |
fix(clipseg): fix 2 failing tests |
|
Brief, informal wording; domain abbrevia |
2026-04-13 |
| PR |
0.05 |
better grad acc tests |
|
Informal, promise to colleague; domain p |
2026-04-14 |
| PR |
0.05 |
feat: bump min safetensors version to `0.8.0-rc.0` |
|
Brief, informal update; domain terminolo |
2026-04-14 |
| PR |
0.02 |
Align gemma3n cache sharing to gemma4 |
|
Short, references specific PRs, domain-s |
2026-04-17 |
| PR |
0.01 |
fix(x_clip): fix 8 failed test cases |
|
Very terse, minimal description; human e |
2026-04-13 |
| PR |
0.01 |
Remove redundant condition checks in `get_image_size` method |
|
Extremely terse, direct, no AI-like phra |
2026-04-15 |
| COMMIT |
0.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
Brief, technical, terse revert commit ty |
2026-04-17 |
| COMMIT |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
Technical language, specific errors, dir |
2026-04-17 |
| COMMIT |
0.00 |
Remove redundant condition checks in `get_image_size` method |
|
Terse bullet points, technical, informal |
2026-04-17 |
| COMMIT |
0.00 |
add Qianfan-OCR model definition (#45280) |
|
Informal, iterative fix messages, domain |
2026-04-17 |
| COMMIT |
0.00 |
Add check-auto in repo-consistency and fix sorting (#45481) |
|
Colloquial wording, speculative ('maybe? |
2026-04-17 |
| COMMIT |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py (#45 |
|
Concise typo fix, no free-text, purely a |
2026-04-17 |
| COMMIT |
0.00 |
typing: rule 15 - checks for tie_word_embeddings presence (# |
|
Brief update notes, technical context, c |
2026-04-17 |
| COMMIT |
0.00 |
[CB] Fix capture of max_seqlen (#45323) |
|
Informal commit titles, multiple granula |
2026-04-17 |
| COMMIT |
0.00 |
Fix response api support (#45463) |
|
Commit messages are terse, informal, sho |
2026-04-16 |
| COMMIT |
0.00 |
Minor update (#45484) |
|
Minimal human-written commit; contains C |
2026-04-16 |
| COMMIT |
0.00 |
Allow loading Qwen Thinker 'base' models without generative |
|
Technical explanation, domain language, |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Always early return for non-Mistral models in _patch |
|
Human-style summary and messages, includ |
2026-04-16 |
| COMMIT |
0.00 |
Fix spurious position_ids warnings for at least 40 architect |
|
Structured, technical commit explanation |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Make Qwen2_5OmniProcessor warning a lot less noisy v |
|
Multiple edits and reverts; informal sty |
2026-04-16 |
| COMMIT |
0.00 |
Dynamic auto mapping (#45018) |
|
Terse, messy progression; highly informa |
2026-04-16 |
| COMMIT |
0.00 |
[serve] Forward `tool_calls`/`tool_call_id` in processor inp |
|
Commit message is concise, technical, an |
2026-04-15 |
| COMMIT |
0.00 |
[docs] vlm addition (#45271) |
|
Short, informal commit messages with min |
2026-04-15 |
| COMMIT |
0.00 |
fix: dont download artifacts from the test hub (#45319) |
|
Commit log has informal, domain-specific |
2026-04-15 |
| COMMIT |
0.00 |
refactor(qa): extend extras so ty can run on server modules |
|
Brief, domain-specific commit message wi |
2026-04-15 |
| COMMIT |
0.00 |
fix(clipseg): fix 2 failing tests (#45403) |
|
Technical, informal bullet points, signe |
2026-04-15 |
| COMMIT |
0.00 |
[docs] @auto_docstring decorator (#45130) |
|
Concise, informal commit messages typica |
2026-04-15 |
| COMMIT |
0.00 |
Fix Sam3Processor missing input_boxes_labels for padded None |
|
Detailed technical explanation using dom |
2026-04-15 |
| COMMIT |
0.00 |
Multimodal serve support (#45220) |
|
Informal, domain-based commit history, h |
2026-04-15 |
| COMMIT |
0.00 |
better grad acc tests (#45434) |
|
Terse commit message with domain abbrevi |
2026-04-15 |
| COMMIT |
0.00 |
avoid wrap 4bit-quantized model into DP (#45407) |
|
Signed by human; no AI indicators in con |
2026-04-15 |
| COMMIT |
0.00 |
Add example for iterative chatting with MLLMs (#45398) |
|
Commit message is terse and has co-autho |
2026-04-15 |
| COMMIT |
0.00 |
Gemma4 resizing per layer inputs (#45324) |
|
Commit message is short, uses domain jar |
2026-04-15 |
| COMMIT |
0.00 |
Add `step3_vl` to `MODELS_WITH_INCORRECT_HUB_TOKENIZER_CLASS |
|
Commit is a standard changelog with clea |
2026-04-15 |
| COMMIT |
0.00 |
Update workflow references to new commit hash (#45442) |
|
Standard commit message; minimal human w |
2026-04-14 |
| COMMIT |
0.00 |
[Doc] Correct checkpoint path in Dinov2 model_docs (#45430) |
|
Commit is concise and fixes a specific t |
2026-04-14 |
| COMMIT |
0.00 |
Fix ty for transformers cli (#45190) |
|
Casual, terse commit messages typical of |
2026-04-14 |
| COMMIT |
0.00 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
Natural, technical commit flow; includes |
2026-04-14 |
| COMMIT |
0.00 |
Fix Qwen2.5VL temporal grid positions (#45400) |
|
Commit messages are brief, casual, and h |
2026-04-14 |
| COMMIT |
0.00 |
[`fix`] PEFT integration fixes preventing save/load & integr |
|
Technical, concise, with human co-author |
2026-04-14 |
| COMMIT |
0.00 |
Fix the response schema for the gemma4 converter (#45411) |
|
Short, direct PR summary; no AI-style ph |
2026-04-14 |
| COMMIT |
0.00 |
Fix `apply_chat_template` crash on `tool_call` messages with |
|
Terse commit messages with domain terms; |
2026-04-13 |
| COMMIT |
0.00 |
Add SAM3-LiteText (#44320) |
|
Very terse, incremental commit messages; |
2026-04-13 |
| COMMIT |
0.00 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Detailed explanation with domain context |
2026-04-13 |
| COMMIT |
0.00 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Brief, informal commit messages; typical |
2026-04-13 |
| COMMIT |
0.00 |
[inference_fusion] convert conv3d patch embed to linear (#45 |
|
Informal commit messages with technical |
2026-04-13 |
| COMMIT |
0.00 |
Fix #45305 + add regression test GAS (#45349) |
|
Terse, informal tone with inline technic |
2026-04-13 |
| COMMIT |
0.00 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Mostly placeholder commit messages, no A |
2026-04-13 |
| COMMIT |
0.00 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Technical bugfixes and test updates in b |
2026-04-13 |
| COMMIT |
0.00 |
Less unnecessary RoPE warnings (#45289) |
|
Terse, domain-specific commit; no AI sig |
2026-04-13 |
| COMMIT |
0.00 |
[`Tokenizers`] Move gpt sw3 tokenizer out (#45404) |
|
Brief, technical fix; no AI phrasing pre |
2026-04-13 |
| COMMIT |
0.00 |
Fix unintended Hub metadata calls from _patch_mistral_regex |
|
Detailed, technical with domain slang; h |
2026-04-13 |
| COMMIT |
0.00 |
Fix MoE routers returning probabilities instead of logits (# |
|
Technical fix, informal style; clearly h |
2026-04-13 |
| COMMIT |
0.00 |
Fix NaN weights on non-rank-0 FSDP processes (#45050) |
|
Short, domain-specific; lacks AI tone. |
2026-04-13 |
| COMMIT |
0.00 |
remove cache file from tree (#45392) |
|
Very terse commit about cache file; huma |
2026-04-13 |
| COMMIT |
0.00 |
[docs] training on specific hardware (#44799) |
|
Commit messages are terse and minimal, t |
2026-04-10 |
| COMMIT |
0.00 |
[docs] zero + sequence parallelism (#44605) |
|
Very brief, domain-specific shorthand; n |
2026-04-10 |
| COMMIT |
0.00 |
Fix vlm weight mappings (#45358) |
|
Informal tone and shorthand imply human |
2026-04-10 |
| COMMIT |
0.00 |
Copy the template resolution logic from the base apply_chat_ |
|
Direct, informal commit log—no AI phrasi |
2026-04-10 |
| COMMIT |
0.00 |
add kwargs to all methods in the CallbackHandler class (#453 |
|
No text beyond the conventional PR title |
2026-04-10 |
| COMMIT |
0.00 |
Close file handler (#45187) |
|
Terse, technical fix with human co-autho |
2026-04-10 |
| COMMIT |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Technical summary with explicit changelo |
2026-04-10 |
| COMMIT |
0.00 |
`cohere_asr`: fix device issue for `test_model_parallel_beam |
|
Domain-specific fixes, Signed-off-by and |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in Gemma3ForConditionalGeneration and Gem |
|
Standard patch title and human co-author |
2026-04-10 |
| COMMIT |
0.00 |
fix bug for videomt model device mismatch (#45204) |
|
Domain-specific, includes Signed-off-by; |
2026-04-10 |
| COMMIT |
0.00 |
fix gemma4 gradient accumulation loss and last token incorre |
|
Terse commit messages and domain abbrevi |
2026-04-10 |
| COMMIT |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode (#453 |
|
Very short, casual commit messages sugge |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Technical problem explanation, domain de |
2026-04-10 |
| COMMIT |
0.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
Technical and precise, contains domain-s |
2026-04-10 |
| COMMIT |
0.00 |
musicflamingo: add test support for Intel XPU device (#45212 |
|
Terse, domain-specific, and includes sig |
2026-04-10 |
| COMMIT |
0.00 |
nomic_bert: make the test suitable for general device. (#452 |
|
Minimal, with only template sign-off and |
2026-04-10 |
| COMMIT |
0.00 |
Skip invalid flash-attn tests for `pi0` model (#45011) |
|
Terse, informal commit messages and incl |
2026-04-10 |
| COMMIT |
0.00 |
Add cuda compatibility check for using `grouped_mm` (#45001) |
|
Terse commit subject; technical co-autho |
2026-04-10 |
| COMMIT |
0.00 |
Load adapter with TP (#45155) |
|
Terse, uses technical abbreviations, no |
2026-04-09 |
| COMMIT |
0.00 |
[docs] tp training (#44613) |
|
Very minimal, domain-specific shorthand, |
2026-04-09 |
| COMMIT |
0.00 |
[docs] training performance (#44342) |
|
Short, informal phrases typical of human |
2026-04-09 |
| COMMIT |
0.00 |
[docs] optimizers, hyperparam search, training features (#44 |
|
Informal list format, domain terms, not |
2026-04-09 |
| COMMIT |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Domain-specific wording, includes co-aut |
2026-04-09 |
| COMMIT |
0.00 |
Use torchvision `decode_image` to load images in the torchv |
|
Succinct, technical, includes co-author, |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Fix device map auto (#45347) |
|
Brief, uses domain terminology, not AI g |
2026-04-09 |
| COMMIT |
0.00 |
Refactor CLIP-like models (#44431) |
|
Terse, casual edits, clear human voice t |
2026-04-09 |
| COMMIT |
0.00 |
refactor: display test duration (#45344) |
|
Minimal, domain-focused, typical of huma |
2026-04-09 |
| COMMIT |
0.00 |
http retries on audio file downloads (#45126) |
|
Technical, brief, informal, no AI-like l |
2026-04-09 |
| COMMIT |
0.00 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` (#45108 |
|
Succinct commit, human style, includes d |
2026-04-09 |
| COMMIT |
0.00 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, technical, informal and direct co |
2026-04-09 |
| COMMIT |
0.00 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical jargon and fix history, not AI |
2026-04-09 |
| COMMIT |
0.00 |
Add THD support in ESM (#44145) |
|
Sequence of technical steps and signed-o |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Short, informal phrases and abbreviation |
2026-04-09 |
| COMMIT |
0.00 |
Fix conversion mappings for vlms (#45340) |
|
Terse and informal with typos, not AI-ge |
2026-04-09 |
| COMMIT |
0.00 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Casual tone, short phrasing, domain lang |
2026-04-09 |
| COMMIT |
0.00 |
chore: added circleci python script to ruff and ty checkers |
|
Single-sentence changes, abbreviations, |
2026-04-09 |
| COMMIT |
0.00 |
tweak checkers output on errors (#45163) |
|
Succinct, with human-like test fix expla |
2026-04-09 |
| COMMIT |
0.00 |
fix: leak in tokenizer registry for `test_processors` (#4531 |
|
Short, typo ('reigstry'), human casual s |
2026-04-09 |
| COMMIT |
0.00 |
chore: remove test_hub for now (#45337) |
|
Concise, informal commit message typical |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Dissociate kv states sharing from the Cache (#45312 |
|
Terse commit history, domain-specific an |
2026-04-09 |
| COMMIT |
0.00 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Direct, domain-specific commit message w |
2026-04-08 |
| COMMIT |
0.00 |
[docs] pipeline cleanup (#44954) |
|
Extremely terse; typical human doc updat |
2026-04-08 |
| COMMIT |
0.00 |
Add MoE to Gemma4 TP plan (#45219) |
|
Concise, domain-language, signed off by |
2026-04-08 |
| PR |
0.00 |
Add V-JEPA 2.1 inference support |
|
Template plus technical, domain-specific |
2026-04-17 |
| PR |
0.00 |
:rotating_light: [`Kernels`] Fix kernel function registratio |
|
Template plus normal technical summary, |
2026-04-13 |
| PR |
0.00 |
[serve] Update tool call to switch to `parse_response` |
|
Uses PR template; free-text section is i |
2026-04-16 |
| PR |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
PR template used; technical diagnostic s |
2026-04-13 |
| PR |
0.00 |
Add check-auto in repo-consistency and fix sorting |
|
Single punctuation mark; no content to j |
2026-04-16 |
| PR |
0.00 |
throw error when conversion required |
|
Short, direct, uses explicit references, |
2026-03-27 |
| PR |
0.00 |
chore: bump doc-builder SHA for main doc build workflow |
|
Brief, domain-specific commit message; n |
2026-04-16 |
| PR |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py |
|
Brief, specific, and informal; clear hum |
2026-04-06 |
| PR |
0.00 |
[CB] Fix capture of max_seqlen |
|
Domain-specific fix, concise and technic |
2026-04-08 |
| PR |
0.00 |
generation/stopping_criteria: short-circuit StoppingCriteria |
|
Terse, technical explanation with jargon |
2026-04-12 |
| PR |
0.00 |
Draft commit |
|
Extremely brief and informal; no AI indi |
2026-04-15 |
| PR |
0.00 |
[Don't merge] Call CI workflow |
|
Template; free-text is minimal and unpol |
2026-04-16 |
| PR |
0.00 |
Minor update |
|
Extremely brief, no AI hallmarks. |
2026-04-16 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Terse todo list, informal notes, full of |
2026-03-26 |
| PR |
0.00 |
Dynamic auto mapping |
|
Brief, domain-specific explanation; no A |
2026-03-26 |
| PR |
0.00 |
Fix spurious position_ids warnings for at least 40 architect |
|
PR content is missing; only template is |
2026-04-14 |
| PR |
0.00 |
[`fix`] Make Qwen2_5OmniProcessor warning a lot less noisy v |
|
Human tone with domain-specific referenc |
2026-04-15 |
| PR |
0.00 |
Fix MPS SDPA output shape when value head dim differs from q |
|
Direct, technical description with domai |
2026-04-16 |
| PR |
0.00 |
fix(tokenization): re-raise ImportError to allow RuntimeErro |
|
Technical summary, some truncation, no s |
2026-04-15 |
| PR |
0.00 |
Add expert parallelism (EP) support for Qwen3 MoE + fix Grou |
|
Lists features/fixes in concise, domain- |
2026-04-14 |
| PR |
0.00 |
Support for BharatGen's Param2MoE model architecture |
|
Detailed model intro but with domain-rel |
2026-02-10 |
| PR |
0.00 |
audio tester class |
|
Concise, domain-specific explanation wit |
2026-04-13 |
| PR |
0.00 |
skip test_flash_attn_2_can_dispatch_composite_models tests f |
|
Casual tone and direct mention to review |
2026-04-16 |
| PR |
0.00 |
Fix: propagate interpolate_pos_encoding through Pixio model |
|
Filled with domain context and in-progre |
2026-04-16 |
| PR |
0.00 |
fix(testing_utils): guard get_device_capability() with torch |
|
Highly technical, structured like typica |
2026-04-14 |
| PR |
0.00 |
Add Gemma4ForSequenceClassification |
|
Technical terms, references, and concise |
2026-04-14 |
| PR |
0.00 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Domain-specific, precise feature additio |
2026-03-30 |
| PR |
0.00 |
fix: return empty tuple when protobuf not available |
|
Uses domain terms, technical details, an |
2026-04-16 |
| PR |
0.00 |
[docs] contributing |
|
Direct and informal, typical of a human |
2026-04-15 |
| PR |
0.00 |
Add AudioFlamingoNext model |
|
PR content uses a bulleted changelog and |
2026-03-18 |
| PR |
0.00 |
[serve] Forward `tool_calls`/`tool_call_id` in processor inp |
|
PR content is abbreviated and technical |
2026-04-13 |
| PR |
0.00 |
[WIP] Add CharacterBERT model |
|
PR content section is empty or template, |
2023-10-05 |
| PR |
0.00 |
[docs] modular transformers |
|
Terse, technical bullet points and abbre |
2026-04-08 |
| PR |
0.00 |
avoid wrap 4bit-quantized model into DP |
|
Abbreviations, mention of users, and inf |
2026-04-13 |
| PR |
0.00 |
fix: dont download artifacts from the test hub |
|
Brief bullet points; domain-specific, no |
2026-04-08 |
| PR |
0.00 |
Improve nested `base_model_prefix` handling in weight conver |
|
Direct reference to specific code files; |
2026-04-13 |
| PR |
0.00 |
refactor(qa): extend extras so ty can run on server modules |
|
Casual style and use of technical slang; |
2026-04-15 |
| PR |
0.00 |
chore(sec): added a handful of security checks |
|
Direct, short change note style; no sign |
2026-04-15 |
| PR |
0.00 |
do not index past decoded chars with special tokens |
|
Extremely terse, no AI hallmarks detecte |
2026-04-14 |
| PR |
0.00 |
refactor: replace wildcard imports with explicit imports in |
|
Technical, domain-specific, informal; no |
2026-04-15 |
| PR |
0.00 |
[GGUF] Reduce peak RAM usage by casting dequantized tensors |
|
Direct, domain-specific; no AI-like phra |
2026-04-12 |