| COMMIT |
1.00 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(serving): resolve rust tokenizer from ProcessorMixin in |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Explicit mention of Claude Code and 'Bui |
2026-04-13 |
| COMMIT |
1.00 |
docs: fix 5 docstring errors in Gemma3nTextConfig (typos, gr |
|
Mentions 'Built by Rudrendu Paul, develo |
2026-04-13 |
| COMMIT |
1.00 |
Fix vllm cis (#45139) |
|
Commit message contains explicit AI assi |
2026-04-08 |
| COMMIT |
1.00 |
Update tiny model creation script (#45241) |
|
Commit message contains explicit AI assi |
2026-04-04 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
Fix `apply_chat_template` crash on `tool_call` messages with |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Fix EtaLogitsWarper on fully masked logits |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Fix #45305 + add regression test GAS |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Add support for Voxtral-4B-TTS-2603 to transformers |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
add Qianfan-OCR model definition |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
docs: fix 5 docstring errors in Gemma3nTextConfig (typos, gr |
|
PR body explicitly mentions AI collabora |
2026-04-11 |
| PR |
1.00 |
Fix failing `XCLIPModelIntegrationTest` |
|
PR body explicitly mentions AI collabora |
2026-03-27 |
| PR |
1.00 |
Fix MoE routers returning probabilities instead of logits |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
0.20 |
[Doc] MoE routing capture and replay recipe |
|
Technical language and abbrevs; no AI-st |
2026-03-22 |
| PR |
0.20 |
Proposal: Agent-first CLI |
|
Technical, mentions proposal; no AI phra |
2026-04-03 |
| PR |
0.20 |
fix(generation): handle CUDA multinomial limit in beam searc |
|
Technical bugfix summary, informal phras |
2026-04-11 |
| PR |
0.20 |
Add universal phone recognition model - PhoneticXeus |
|
Mentions SOTA model and tasks; incomplet |
2026-04-10 |
| PR |
0.15 |
feat: add Gemma4ForSequenceClassification |
|
Direct, technical addition; straight to |
2026-04-07 |
| PR |
0.15 |
fix(processing): guard message content access in apply_chat_ |
|
Concise domain-specific summary with hum |
2026-04-12 |
| PR |
0.15 |
[PoC] HF exporters |
|
Casual tone, mentions context with domai |
2025-11-03 |
| PR |
0.14 |
Fix torch only support for fast Processors |
|
Direct explanation, domain-specific, hum |
2025-12-11 |
| PR |
0.14 |
[SqueezeBert] Migrate to standardized output collection deco |
|
Uses domain terms and concise changelog, |
2026-02-15 |
| PR |
0.13 |
fix: prevent accelerate from splitting vision encoder by set |
|
Direct explanation, no AI-style formal p |
2025-12-26 |
| PR |
0.11 |
[Trainer] Support multi-loss component logging |
|
Structured changelog with domain jargon, |
2026-04-06 |
| COMMIT |
0.10 |
Fix `SmolVLM` video processor `resize` using wrong interpola |
|
Technical, detailed explanation with dom |
2026-04-06 |
| COMMIT |
0.10 |
empty (#45261) |
|
Casual tone, technical context, some com |
2026-04-06 |
| PR |
0.10 |
Dynamic auto mapping (PoC) |
|
Slightly verbose but personal thought pr |
2026-03-26 |
| PR |
0.10 |
FSDP2 native support in transformers |
|
Informal tone and technical shorthand in |
2026-02-17 |
| PR |
0.10 |
🚨 Distributed training API |
|
Uses code snippets and technical context |
2026-03-25 |
| PR |
0.10 |
Fix flash_attention_3 detection and import for hopper wheel |
|
Direct and technical phrasing; domain-sp |
2026-04-12 |
| PR |
0.10 |
TP refactor for FSDP + TP integration |
|
Informal, uses domain abbreviations and |
2026-03-26 |
| PR |
0.10 |
Refactor core_model_loading to support FSDP shard-on-read lo |
|
Human-like TODO list, abbreviations, and |
2026-03-24 |
| PR |
0.10 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Short, specific, and technical correctio |
2026-04-09 |
| PR |
0.10 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Direct description, project reference, c |
2026-03-23 |
| PR |
0.10 |
Parakeet tdt |
|
Technical, domain-specific, includes ref |
2026-02-20 |
| PR |
0.10 |
generation/stopping_criteria: short-circuit StoppingCriteria |
|
Technical explanation, project conventio |
2026-04-12 |
| PR |
0.10 |
Fix NaN weights on non-rank-0 FSDP processes |
|
Technical references, concise and inform |
2026-03-27 |
| PR |
0.10 |
Add PolarQuant backend to QuantizedCache (Hadamard-rotated L |
|
Technical summary, detailed, and uses pr |
2026-04-10 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Direct technical fix summary, domain-spe |
2026-04-09 |
| PR |
0.10 |
Make Gemma4ClippableLinear inherit from nn.Linear for PEFT/L |
|
Terse, technical explanation with clear |
2026-04-12 |
| PR |
0.10 |
Less unnecessary RoPE warnings |
|
Terse, domain-specific and issue-linked; |
2026-04-07 |
| PR |
0.10 |
[`Tokenizers`] Move gpt sw3 tokenizer out |
|
Terse, domain-specific context; informal |
2026-04-13 |
| PR |
0.10 |
audio tester class |
|
Concise and domain-specific; incomplete |
2026-04-13 |
| PR |
0.10 |
Add CLIP-like models in conversion to VLMs |
|
Succinct, issue links, domain abbreviati |
2026-04-10 |
| PR |
0.10 |
Modular playground |
|
Bulleted, terse, and casual use of abbre |
2026-02-04 |
| PR |
0.10 |
[WIP][Fix] GLM 5 set `apply_rotary_pos_emb` to `is_neox_styl |
|
Technical, uses comments and abbreviatio |
2026-03-26 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Succinct, technical, uses abbreviation; |
2026-04-09 |
| PR |
0.10 |
Fix unintended Hub metadata calls from _patch_mistral_regex |
|
Technical, references a specific functio |
2026-01-29 |
| PR |
0.10 |
Add heterogeneous config support (per-layer configuration) |
|
Technical, terse, sentence abruptly ends |
2026-04-09 |
| PR |
0.10 |
Add example for iterative chatting with MLLMs |
|
Mentions another user, informal grammar, |
2026-04-13 |
| PR |
0.10 |
Gemma4 resizing per layer inputs |
|
Uses domain jargon and brief project ref |
2026-04-08 |
| PR |
0.10 |
fix: check CUDA availability before calling get_device_capab |
|
Technical, concise, domain-specific and |
2026-04-11 |
| PR |
0.09 |
Add deepseek 3.2 exp |
|
Python code and terse description indica |
2025-10-01 |
| PR |
0.06 |
remove cache file from tree |
|
Brief, informal, domain-specific, human- |
2026-04-13 |
| PR |
0.05 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Informal, concise tone with domain conte |
2026-04-03 |
| PR |
0.05 |
Fix the response schema for the gemma4 converter |
|
Casual, direct explanation and repo-spec |
2026-04-13 |
| PR |
0.05 |
Module Fusion API |
|
Technical jargon, list formatting, not A |
2026-03-24 |
| PR |
0.05 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
Specific technical references, informal |
2026-04-02 |
| PR |
0.05 |
Add SAM3-LiteText |
|
Domain-specific paper reference and natu |
2026-02-27 |
| PR |
0.05 |
Adds type checking to `src/transformers/*py` |
|
Very brief explanation, technical contex |
2026-04-13 |
| PR |
0.05 |
Require input_ids for repetition penalty |
|
Concise, context-aware phrasing, human e |
2026-04-13 |
| PR |
0.05 |
Generic Sequence Classifier works for multimodal models |
|
Casual tone and technical references, na |
2026-03-13 |
| PR |
0.05 |
Fix Qwen2.5VL temporal grid positions |
|
Informal style, technical aside, emotico |
2026-04-13 |
| PR |
0.05 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Reopened issue, terse summary, domain-sp |
2026-04-13 |
| PR |
0.05 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Technical explanation and informal voice |
2026-04-13 |
| PR |
0.05 |
perceptron: Isaac-0.1 implementation |
|
Concise summary, technical context, clea |
2025-09-18 |
| PR |
0.05 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Casual language and explicit technical d |
2026-04-07 |
| PR |
0.05 |
fix(config): add deepstack_visual_indexes to Qwen3_5MoeVisio |
|
Uses domain-specific terminology and con |
2026-04-11 |
| PR |
0.05 |
Add RF-DETR |
|
Brief, domain-focused, and jargon-filled |
2025-03-21 |
| PR |
0.05 |
[inference_fusion] convert conv3d patch embed to linear |
|
Technical performance discussion in info |
2026-03-27 |
| PR |
0.05 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
Clear technical detail and informal, bri |
2026-04-13 |
| PR |
0.05 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Uses domain-specific terms with terse hu |
2026-04-10 |
| PR |
0.05 |
Fix: ObjectDetectionPipeline batch inference only returns fi |
|
Technical language, includes working cod |
2026-04-03 |
| PR |
0.05 |
fix(serving): resolve rust tokenizer from ProcessorMixin in |
|
Brief, technical, informal in tone and s |
2026-04-11 |
| PR |
0.05 |
Fix ty for transformers cli |
|
Short, minimal, and domain-specific phra |
2026-04-02 |
| PR |
0.05 |
avoid wrap 4bit-quantized model into DP |
|
Direct, informal language with abbreviat |
2026-04-13 |
| PR |
0.05 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Concise, technical, domain-jargon heavy |
2026-04-08 |
| PR |
0.05 |
from_pretrained orchestration + distributed save/load |
|
Technical, concise, with unfinished sent |
2026-04-13 |
| PR |
0.05 |
MoE expert parallelism + sequence parallelism |
|
Terse, domain-jargon summary, no AI indi |
2026-04-13 |
| PR |
0.05 |
Adding hierarchical classification example |
|
Informal tone ('i added'), typos; very h |
2026-04-11 |
| PR |
0.05 |
Fix Double Application of Softmax for Router Logits in MoE m |
|
Title only, zero AI-indicative text. |
2026-04-09 |
| PR |
0.05 |
fix(clipseg): auto-fix failing tests |
|
Changelog style; no AI markers or phrasi |
2026-04-13 |
| PR |
0.05 |
fix(x_clip): auto-fix failing tests |
|
Bugfix changelog style, no evidence of A |
2026-04-13 |
| PR |
0.05 |
throw error when conversion required |
|
Very terse and informal, typical of huma |
2026-03-27 |
| PR |
0.05 |
typing: rule 15 - checks for tie_word_embeddings presence |
|
Brief, specific, domain-focused content; |
2026-03-25 |
| PR |
0.05 |
Add qwen3 tts |
|
Domain-specific and concise, lacks AI-st |
2026-03-07 |
| PR |
0.05 |
[Draft] Add Llasa TTS family of models |
|
Lists specific models, references concre |
2025-07-29 |
| PR |
0.05 |
Implement VibeVoice |
|
Direct reference to model repo and HF li |
2025-08-29 |
| PR |
0.05 |
Add VibeVoice Realtime |
|
References dependency PR and user handle |
2025-12-10 |
| PR |
0.05 |
Adding Omnilingual ASR models |
|
Checklist, repo links, domain specificit |
2026-01-13 |
| PR |
0.05 |
Proposal to add Qwen3-ASR support [WIP] |
|
Direct, domain-specific, WIP label; huma |
2026-02-08 |
| PR |
0.05 |
Add AudioFlamingoNext model |
|
Succinct technical description, domain d |
2026-03-18 |
| PR |
0.05 |
[docs] training on specific hardware |
|
Concise changelog, clear structure, doma |
2026-03-17 |
| PR |
0.05 |
[docs] model testing |
|
Informal, contributor-focused refactorin |
2026-03-31 |
| COMMIT |
0.00 |
Fix `apply_chat_template` crash on `tool_call` messages with |
|
Terse commit messages with domain terms; |
2026-04-13 |
| COMMIT |
0.00 |
Add SAM3-LiteText (#44320) |
|
Very terse, incremental commit messages; |
2026-04-13 |
| COMMIT |
0.00 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Detailed explanation with domain context |
2026-04-13 |
| COMMIT |
0.00 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Brief, informal commit messages; typical |
2026-04-13 |
| COMMIT |
0.00 |
[inference_fusion] convert conv3d patch embed to linear (#45 |
|
Informal commit messages with technical |
2026-04-13 |
| COMMIT |
0.00 |
Fix #45305 + add regression test GAS (#45349) |
|
Terse, informal tone with inline technic |
2026-04-13 |
| COMMIT |
0.00 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Mostly placeholder commit messages, no A |
2026-04-13 |
| COMMIT |
0.00 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Technical bugfixes and test updates in b |
2026-04-13 |
| COMMIT |
0.00 |
Less unnecessary RoPE warnings (#45289) |
|
Terse, domain-specific commit; no AI sig |
2026-04-13 |
| COMMIT |
0.00 |
[`Tokenizers`] Move gpt sw3 tokenizer out (#45404) |
|
Brief, technical fix; no AI phrasing pre |
2026-04-13 |
| COMMIT |
0.00 |
Fix unintended Hub metadata calls from _patch_mistral_regex |
|
Detailed, technical with domain slang; h |
2026-04-13 |
| COMMIT |
0.00 |
Fix MoE routers returning probabilities instead of logits (# |
|
Technical fix, informal style; clearly h |
2026-04-13 |
| COMMIT |
0.00 |
Fix NaN weights on non-rank-0 FSDP processes (#45050) |
|
Short, domain-specific; lacks AI tone. |
2026-04-13 |
| COMMIT |
0.00 |
remove cache file from tree (#45392) |
|
Very terse commit about cache file; huma |
2026-04-13 |
| COMMIT |
0.00 |
[docs] training on specific hardware (#44799) |
|
Commit messages are terse and minimal, t |
2026-04-10 |
| COMMIT |
0.00 |
[docs] zero + sequence parallelism (#44605) |
|
Very brief, domain-specific shorthand; n |
2026-04-10 |
| COMMIT |
0.00 |
Fix vlm weight mappings (#45358) |
|
Informal tone and shorthand imply human |
2026-04-10 |
| COMMIT |
0.00 |
Copy the template resolution logic from the base apply_chat_ |
|
Direct, informal commit log—no AI phrasi |
2026-04-10 |
| COMMIT |
0.00 |
add kwargs to all methods in the CallbackHandler class (#453 |
|
No text beyond the conventional PR title |
2026-04-10 |
| COMMIT |
0.00 |
Close file handler (#45187) |
|
Terse, technical fix with human co-autho |
2026-04-10 |
| COMMIT |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Technical summary with explicit changelo |
2026-04-10 |
| COMMIT |
0.00 |
`cohere_asr`: fix device issue for `test_model_parallel_beam |
|
Domain-specific fixes, Signed-off-by and |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in Gemma3ForConditionalGeneration and Gem |
|
Standard patch title and human co-author |
2026-04-10 |
| COMMIT |
0.00 |
fix bug for videomt model device mismatch (#45204) |
|
Domain-specific, includes Signed-off-by; |
2026-04-10 |
| COMMIT |
0.00 |
fix gemma4 gradient accumulation loss and last token incorre |
|
Terse commit messages and domain abbrevi |
2026-04-10 |
| COMMIT |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode (#453 |
|
Very short, casual commit messages sugge |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Technical problem explanation, domain de |
2026-04-10 |
| COMMIT |
0.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
Technical and precise, contains domain-s |
2026-04-10 |
| COMMIT |
0.00 |
musicflamingo: add test support for Intel XPU device (#45212 |
|
Terse, domain-specific, and includes sig |
2026-04-10 |
| COMMIT |
0.00 |
nomic_bert: make the test suitable for general device. (#452 |
|
Minimal, with only template sign-off and |
2026-04-10 |
| COMMIT |
0.00 |
Skip invalid flash-attn tests for `pi0` model (#45011) |
|
Terse, informal commit messages and incl |
2026-04-10 |
| COMMIT |
0.00 |
Add cuda compatibility check for using `grouped_mm` (#45001) |
|
Terse commit subject; technical co-autho |
2026-04-10 |
| COMMIT |
0.00 |
Load adapter with TP (#45155) |
|
Terse, uses technical abbreviations, no |
2026-04-09 |
| COMMIT |
0.00 |
[docs] tp training (#44613) |
|
Very minimal, domain-specific shorthand, |
2026-04-09 |
| COMMIT |
0.00 |
[docs] training performance (#44342) |
|
Short, informal phrases typical of human |
2026-04-09 |
| COMMIT |
0.00 |
[docs] optimizers, hyperparam search, training features (#44 |
|
Informal list format, domain terms, not |
2026-04-09 |
| COMMIT |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Domain-specific wording, includes co-aut |
2026-04-09 |
| COMMIT |
0.00 |
Use torchvision `decode_image` to load images in the torchv |
|
Succinct, technical, includes co-author, |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Fix device map auto (#45347) |
|
Brief, uses domain terminology, not AI g |
2026-04-09 |
| COMMIT |
0.00 |
Refactor CLIP-like models (#44431) |
|
Terse, casual edits, clear human voice t |
2026-04-09 |
| COMMIT |
0.00 |
refactor: display test duration (#45344) |
|
Minimal, domain-focused, typical of huma |
2026-04-09 |
| COMMIT |
0.00 |
http retries on audio file downloads (#45126) |
|
Technical, brief, informal, no AI-like l |
2026-04-09 |
| COMMIT |
0.00 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` (#45108 |
|
Succinct commit, human style, includes d |
2026-04-09 |
| COMMIT |
0.00 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, technical, informal and direct co |
2026-04-09 |
| COMMIT |
0.00 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical jargon and fix history, not AI |
2026-04-09 |
| COMMIT |
0.00 |
Add THD support in ESM (#44145) |
|
Sequence of technical steps and signed-o |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Short, informal phrases and abbreviation |
2026-04-09 |
| COMMIT |
0.00 |
Fix conversion mappings for vlms (#45340) |
|
Terse and informal with typos, not AI-ge |
2026-04-09 |
| COMMIT |
0.00 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Casual tone, short phrasing, domain lang |
2026-04-09 |
| COMMIT |
0.00 |
chore: added circleci python script to ruff and ty checkers |
|
Single-sentence changes, abbreviations, |
2026-04-09 |
| COMMIT |
0.00 |
tweak checkers output on errors (#45163) |
|
Succinct, with human-like test fix expla |
2026-04-09 |
| COMMIT |
0.00 |
fix: leak in tokenizer registry for `test_processors` (#4531 |
|
Short, typo ('reigstry'), human casual s |
2026-04-09 |
| COMMIT |
0.00 |
chore: remove test_hub for now (#45337) |
|
Concise, informal commit message typical |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Dissociate kv states sharing from the Cache (#45312 |
|
Terse commit history, domain-specific an |
2026-04-09 |
| COMMIT |
0.00 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Direct, domain-specific commit message w |
2026-04-08 |
| COMMIT |
0.00 |
[docs] pipeline cleanup (#44954) |
|
Extremely terse; typical human doc updat |
2026-04-08 |
| COMMIT |
0.00 |
Add MoE to Gemma4 TP plan (#45219) |
|
Concise, domain-language, signed off by |
2026-04-08 |
| COMMIT |
0.00 |
Fix export for gemma4 and add Integration tests (#45285) |
|
Choppy informal commit breakdowns; clear |
2026-04-08 |
| COMMIT |
0.00 |
[docs] static model rules (#45232) |
|
Minimal, iterative commit style; clearly |
2026-04-08 |
| COMMIT |
0.00 |
fix(security): prevent untrusted users from triggering TRL C |
|
Technical detail, informal, and concise |
2026-04-07 |
| COMMIT |
0.00 |
Fix missing image processors backends (#45165) |
|
Very terse commit summary and message, c |
2026-04-07 |
| COMMIT |
0.00 |
[AMD CI] Fix Qwen2 expectations (#45284) |
|
Terse and informal phrasing; human style |
2026-04-07 |
| COMMIT |
0.00 |
Add `hasattr(torch.backends.cudnn, "conv")` to `conftest.py` |
|
Concise commit message; domain-specific |
2026-04-06 |
| COMMIT |
0.00 |
Fix `Qwen2IntegrationTest` (#45268) |
|
Single word summary; strongly human. |
2026-04-06 |
| COMMIT |
0.00 |
doc: fix TokenizersBackend.convert_to_native_format docstrin |
|
Minimal, template-style docstring fix; n |
2026-04-06 |
| COMMIT |
0.00 |
Fix unexpected TF32 being enabled in testing (#45252) |
|
Minimal, terse commit message; human typ |
2026-04-05 |
| COMMIT |
0.00 |
Fix tf32 issue: set `torch.backends.cudnn.conv.fp32_precisio |
|
Commit message is terse, with typical de |
2026-04-05 |
| COMMIT |
0.00 |
Nvidia CI with `torch 2.11` (#45243) |
|
Short, domain-specific, and informal com |
2026-04-04 |
| COMMIT |
0.00 |
Update `get_test_info.py` (related to tiny model creation) ( |
|
Brief, technical, and uses domain abbrev |
2026-04-04 |
| COMMIT |
0.00 |
More fix for tiny model creation (#45228) |
|
Commit message and bullet points are ter |
2026-04-03 |
| COMMIT |
0.00 |
remove unnecessary entries in some auto model mappings (#452 |
|
Extremely terse commit note; clear human |
2026-04-03 |
| COMMIT |
0.00 |
fix: hf-doc-builder insallation was failing (#45225) |
|
Brief and informal phrasing; no AI trait |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Add per-request logits processors (#45026) |
|
Terse, minimal commit message; typical h |
2026-04-03 |
| COMMIT |
0.00 |
[docs] formatting (#45196) |
|
Sparse, technical note with nonstandard |
2026-04-03 |
| COMMIT |
0.00 |
fix `test_register_result_handler` (#45188) |
|
Very brief commit message, lacks AI hall |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Tweaks to update and minor fixes (#45179) |
|
Bullet style, typos, and brevity point t |
2026-04-03 |
| COMMIT |
0.00 |
Fix pypi release (#45210) |
|
Minimal, direct style, some typos, clear |
2026-04-03 |
| COMMIT |
0.00 |
update to dev version 5.6.0-dev0 |
|
Simple version update note, extremely br |
2026-04-03 |
| COMMIT |
0.00 |
fix(docs): correct gemma4 docs and examples (#45197) |
|
Standard commit message with co-author t |
2026-04-02 |
| COMMIT |
0.00 |
Add Turkish (tr) translation for Get Started section (#45158 |
|
Human style, domain-specific, and inform |
2026-04-02 |
| COMMIT |
0.00 |
[docs] transformers serve (#45174) |
|
Very brief and informal commit messages, |
2026-04-02 |
| COMMIT |
0.00 |
casually dropping the most capable open weights on the plane |
|
Casual, informal tone; no AI signatures; |
2026-04-02 |
| COMMIT |
0.00 |
Internalise the NomicBERT model (#43067) |
|
Structured sequential work, domain-speci |
2026-04-02 |
| COMMIT |
0.00 |
Fix resized LM head weights being overwritten by post_init ( |
|
Technically detailed, with typos and dir |
2026-04-02 |
| COMMIT |
0.00 |
[Qwen3.5 MoE] Add _tp_plan to ForConditionalGeneration (#451 |
|
Technical, domain-specific, and succinct |
2026-04-02 |
| COMMIT |
0.00 |
Fix TypeError: 'NoneType' object is not iterable in Generati |
|
Very brief and specific to bug; human co |
2026-04-02 |
| COMMIT |
0.00 |
fix(models): Fix dtype mismatch in SwitchTransformers and Ti |
|
Uses human-like shorthand and changelog |
2026-04-02 |
| COMMIT |
0.00 |
Generalize gemma vision mask to videos (#45185) |
|
Short, informal, with direct responses t |
2026-04-02 |
| COMMIT |
0.00 |
[misc] fix qwen35 tests: correct the text model type and ski |
|
Concise commit message, domain-specific, |
2026-04-02 |
| COMMIT |
0.00 |
🔒 Pin GitHub Actions to commit SHAs (#45180) |
|
Explicit, terse commit titles; no AI hal |
2026-04-02 |
| COMMIT |
0.00 |
Use doc-builder runnable example for GLM-ASR (#44277) |
|
Informal language and typographical erro |
2026-04-02 |
| COMMIT |
0.00 |
CI] Small T5 expectations updated (#45138) |
|
Extremely terse commit message; clearly |
2026-04-02 |
| PR |
0.00 |
[GGUF] Reduce peak RAM usage by casting dequantized tensors |
|
PR content is technical and concise; no |
2026-04-12 |
| PR |
0.00 |
[WIP] Add DINO DETR Model to HuggingFace Transformers |
|
Domain-specific jargon and terse style; |
2025-03-14 |
| PR |
0.00 |
[serve] Forward `tool_calls`/`tool_call_id` in processor inp |
|
Technical, concise with domain reference |
2026-04-13 |
| PR |
0.00 |
n-to-1 kernel fusion via `KernelConfig` |
|
Informal, concise technical writing; lac |
2026-04-10 |
| PR |
0.00 |
fix(altclip): auto-fix failing tests |
|
Commit is extremely terse with abbreviat |
2026-04-13 |
| PR |
0.00 |
Add xcodec2 model |
|
Contains TODO checkboxes, informal, and |
2026-02-20 |
| PR |
0.00 |
Add dtype config options for Four Over Six |
|
Minimal, domain-specific, informal thank |
2026-04-11 |
| PR |
0.00 |
fix(mistral): guard ReasoningEffort import for older mistral |
|
Uses domain-specific references and a te |
2026-04-11 |