| COMMIT |
1.00 |
Fix vllm cis (#45139) |
|
Commit message contains explicit AI assi |
2026-04-08 |
| COMMIT |
1.00 |
Update tiny model creation script (#45241) |
|
Commit message contains explicit AI assi |
2026-04-04 |
| COMMIT |
1.00 |
fix: prefer registered config over remote code in AutoConfig |
|
Commit message contains explicit AI assi |
2026-03-31 |
| COMMIT |
1.00 |
Fix stupid test fetcher (#45140) |
|
Commit message contains explicit AI assi |
2026-03-31 |
| PR |
1.00 |
feat: make timesfm2_5 onnx export compatible |
|
PR body explicitly mentions AI collabora |
2026-04-04 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Fix apply_chat_template crash on tool_call messages without |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
Fix #45305 + add regression test GAS |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
add expert parallelism for gemma-4-26B-A4B-it |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Pxeus |
|
PR body explicitly mentions AI collabora |
2026-04-10 |
| PR |
1.00 |
WIP: Add support for Granite4VisionForConditionalGeneration |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
fix gemma4 gradient accumulation loss and last token incorre |
|
PR body explicitly mentions AI collabora |
2026-04-10 |
| PR |
1.00 |
add Qianfan-OCR model definition |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Add cuda compatibility check for using `grouped_mm` |
|
PR body explicitly mentions AI collabora |
2026-03-25 |
| PR |
1.00 |
Drop unused Gemma4TextAttention weights when sharing KV Cach |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Fix MoE routers returning probabilities instead of logits |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
fix: liger unnecessarily materializes logits in VRAM during |
|
PR body explicitly mentions AI collabora |
2026-04-06 |
| PR |
0.70 |
🚨 Refactor ViT to updated standards |
|
Phrases like 'This PR aims at' and forma |
2025-10-17 |
| PR |
0.20 |
Refactor CLIP-like models |
|
Casual style, issue references, and proj |
2026-03-04 |
| PR |
0.20 |
add HyperClovaX Vision |
|
Polite greeting but otherwise specific a |
2026-02-27 |
| PR |
0.20 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, concise, slightly formal but |
2026-03-30 |
| PR |
0.20 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Enumerates tests, specifics, and parenth |
2026-03-14 |
| PR |
0.15 |
FSDP2 native support in transformers |
|
Terse informal notes and abbreviations; |
2026-02-17 |
| PR |
0.15 |
Add SAM3-LiteText |
|
Short, to-the-point, mentions fix, human |
2026-02-27 |
| COMMIT |
0.10 |
Fix `SmolVLM` video processor `resize` using wrong interpola |
|
Technical, detailed explanation with dom |
2026-04-06 |
| COMMIT |
0.10 |
empty (#45261) |
|
Casual tone, technical context, some com |
2026-04-06 |
| PR |
0.10 |
Refactor GPT-J output tracing to use standardized decorators |
|
Technical references with informal, trun |
2026-04-10 |
| PR |
0.10 |
[docs] training on specific hardware |
|
Structured changelog, domain terms, not |
2026-03-17 |
| PR |
0.10 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Brief, context-aware, and domain-specifi |
2026-04-03 |
| PR |
0.10 |
Copy the template resolution logic from the base apply_chat_ |
|
Uses shorthand and informal expressions, |
2026-03-30 |
| PR |
0.10 |
[Gemma4] Fix chat template and stop tokens for OpenAI tool c |
|
Jargon, bullet points, technical referen |
2026-04-05 |
| PR |
0.10 |
fix(qwen3_moe): correct return type annotation on Qwen3MoeSp |
|
Brief, technical, and domain-specific la |
2026-04-09 |
| PR |
0.10 |
Fix softmaxing router logits |
|
Clear, specific about errors; tone natur |
2026-04-08 |
| PR |
0.10 |
Fix ByteLevel-BPE tokenizers silently breaking in `LlamaToke |
|
Strong technical jargon and brevity; no |
2026-04-09 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Focused, domain-specific fix; language i |
2026-04-09 |
| PR |
0.10 |
Replace deprecated `huggingface-cli` references with `hf` |
|
Direct domain-specific statement without |
2026-04-10 |
| PR |
0.10 |
Configuration insoncistencies |
|
Concise technical content with minor typ |
2026-04-02 |
| PR |
0.10 |
Fix AttributeError in Gemma3ForConditionalGeneration and Gem |
|
Technical, terse PR summary, no AI signa |
2026-04-07 |
| PR |
0.10 |
Generic Sequence Classifier works for multimodal models |
|
Uses domain-specific terms; issue refere |
2026-03-13 |
| PR |
0.10 |
[inference_fusion] convert conv3d patch embed to linear |
|
Technical explanation, domain links; no |
2026-03-27 |
| PR |
0.10 |
Less unnecessary RoPE warnings |
|
Domain references, specific and informal |
2026-04-07 |
| PR |
0.10 |
Fix conversion mappings for vlms |
|
Uses terse phrases, issues, and informal |
2026-04-09 |
| PR |
0.10 |
Add support for H2O cache eviction with LLaMA |
|
Technical jargon, concise phrasing, huma |
2024-12-21 |
| PR |
0.10 |
Conversion for LLM class loading with VLM ckpt |
|
Brief, direct issue references; standard |
2026-04-08 |
| PR |
0.10 |
Fix NaN weights on non-rank-0 FSDP processes |
|
Concise, uses technical language, refere |
2026-03-27 |
| PR |
0.10 |
Load adapter with TP |
|
Brief, domain-specific answer with jargo |
2026-03-31 |
| PR |
0.10 |
[docs] training performance |
|
Bullet points, domain-specific; concise, |
2026-02-27 |
| PR |
0.10 |
Use torchvision `decode_image` to load images in the torchv |
|
Concise, domain-specific; headings templ |
2026-04-02 |
| PR |
0.10 |
fix(ernie4_5_vl_moe): resolve three config loading failures |
|
Terse, error-focused, uses abbreviations |
2026-04-07 |
| PR |
0.10 |
Fix ty for transformers cli |
|
Domain abbreviations, direct writing; no |
2026-04-02 |
| PR |
0.10 |
Gemma4 resizing per layer inputs |
|
Terse, includes shorthand and references |
2026-04-08 |
| PR |
0.10 |
Fix broken HQQ support |
|
Terse bullet points, informal language; |
2026-03-31 |
| PR |
0.10 |
Modular playground |
|
Uses technical shorthand and bullet poin |
2026-02-04 |
| PR |
0.10 |
Add RF-DETR |
|
Terse, direct, domain-specific; no AI ge |
2025-03-21 |
| PR |
0.10 |
fix: dont download artifacts from the test hub |
|
Direct fix, minimal language, friendly r |
2026-04-08 |
| PR |
0.10 |
Refactor OwlViT to modular Transformers |
|
Highly technical, uses jargon and shorth |
2026-03-27 |
| PR |
0.10 |
Trainer: set skip_logits for loss-only eval when liger enabl |
|
Technical context, jargon, precise langu |
2026-03-25 |
| PR |
0.05 |
Fix vlm weight mappings |
|
Terse, technical, specific to ongoing is |
2026-04-10 |
| PR |
0.05 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Concise changelog-like, uses domain refe |
2026-04-10 |
| PR |
0.05 |
add kwargs to all methods in the CallbackHandler class |
|
Direct, explains precisely, minimal form |
2026-04-09 |
| PR |
0.05 |
Fix: ObjectDetectionPipeline batch inference only returns fi |
|
Correct, technical explanation, lacks AI |
2026-04-03 |
| PR |
0.05 |
test: add batched inference test for ObjectDetectionPipeline |
|
Technical test description; very focused |
2025-12-27 |
| PR |
0.05 |
Fix batch object detection 31356 |
|
In-line mention of reviewers, direct, te |
2025-07-09 |
| PR |
0.05 |
[PoC] HF exporters |
|
Clearly a technical WIP PoC summary, wit |
2025-11-03 |
| PR |
0.05 |
Fix Zamba2MambaMixer ignoring use_mamba_kernels=False |
|
Direct technical explanation of a bug an |
2026-03-19 |
| PR |
0.05 |
cohere_asr: fix bug for model_parallel_beam_search test case |
|
Technical, direct; bugfix description ty |
2026-04-03 |
| PR |
0.05 |
musicflamingo: add test support for Intel XPU device |
|
Casual review ask, abbreviations; no AI |
2026-04-03 |
| PR |
0.05 |
skip 2 invalid test cases for pi0 model |
|
Informal tone and chat; domain-specific |
2026-03-26 |
| PR |
0.05 |
refactor: display test duration |
|
Brief, technical content with example; i |
2026-04-09 |
| PR |
0.05 |
http retries on audio file downloads |
|
Uses domain terms, concise phrasing, not |
2026-03-30 |
| PR |
0.01 |
TP refactor for FSDP + TP integration |
|
Domain-specific abbreviations and terse |
2026-03-26 |
| PR |
0.01 |
Allow to bypass remote code if we want to try and convert it |
|
Very brief, informal, and minimal explan |
2026-02-09 |
| COMMIT |
0.00 |
[docs] training on specific hardware (#44799) |
|
Commit messages are terse and minimal, t |
2026-04-10 |
| COMMIT |
0.00 |
[docs] zero + sequence parallelism (#44605) |
|
Very brief, domain-specific shorthand; n |
2026-04-10 |
| COMMIT |
0.00 |
Fix vlm weight mappings (#45358) |
|
Informal tone and shorthand imply human |
2026-04-10 |
| COMMIT |
0.00 |
Copy the template resolution logic from the base apply_chat_ |
|
Direct, informal commit log—no AI phrasi |
2026-04-10 |
| COMMIT |
0.00 |
add kwargs to all methods in the CallbackHandler class (#453 |
|
No text beyond the conventional PR title |
2026-04-10 |
| COMMIT |
0.00 |
Close file handler (#45187) |
|
Terse, technical fix with human co-autho |
2026-04-10 |
| COMMIT |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Technical summary with explicit changelo |
2026-04-10 |
| COMMIT |
0.00 |
`cohere_asr`: fix device issue for `test_model_parallel_beam |
|
Domain-specific fixes, Signed-off-by and |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in Gemma3ForConditionalGeneration and Gem |
|
Standard patch title and human co-author |
2026-04-10 |
| COMMIT |
0.00 |
fix bug for videomt model device mismatch (#45204) |
|
Domain-specific, includes Signed-off-by; |
2026-04-10 |
| COMMIT |
0.00 |
fix gemma4 gradient accumulation loss and last token incorre |
|
Terse commit messages and domain abbrevi |
2026-04-10 |
| COMMIT |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode (#453 |
|
Very short, casual commit messages sugge |
2026-04-10 |
| COMMIT |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Technical problem explanation, domain de |
2026-04-10 |
| COMMIT |
0.00 |
Fix Qwen2.5-VL temporal RoPE scaling applied to still images |
|
Technical and precise, contains domain-s |
2026-04-10 |
| COMMIT |
0.00 |
musicflamingo: add test support for Intel XPU device (#45212 |
|
Terse, domain-specific, and includes sig |
2026-04-10 |
| COMMIT |
0.00 |
nomic_bert: make the test suitable for general device. (#452 |
|
Minimal, with only template sign-off and |
2026-04-10 |
| COMMIT |
0.00 |
Skip invalid flash-attn tests for `pi0` model (#45011) |
|
Terse, informal commit messages and incl |
2026-04-10 |
| COMMIT |
0.00 |
Add cuda compatibility check for using `grouped_mm` (#45001) |
|
Terse commit subject; technical co-autho |
2026-04-10 |
| COMMIT |
0.00 |
Load adapter with TP (#45155) |
|
Terse, uses technical abbreviations, no |
2026-04-09 |
| COMMIT |
0.00 |
[docs] tp training (#44613) |
|
Very minimal, domain-specific shorthand, |
2026-04-09 |
| COMMIT |
0.00 |
[docs] training performance (#44342) |
|
Short, informal phrases typical of human |
2026-04-09 |
| COMMIT |
0.00 |
[docs] optimizers, hyperparam search, training features (#44 |
|
Informal list format, domain terms, not |
2026-04-09 |
| COMMIT |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Domain-specific wording, includes co-aut |
2026-04-09 |
| COMMIT |
0.00 |
Use torchvision `decode_image` to load images in the torchv |
|
Succinct, technical, includes co-author, |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Fix device map auto (#45347) |
|
Brief, uses domain terminology, not AI g |
2026-04-09 |
| COMMIT |
0.00 |
Refactor CLIP-like models (#44431) |
|
Terse, casual edits, clear human voice t |
2026-04-09 |
| COMMIT |
0.00 |
refactor: display test duration (#45344) |
|
Minimal, domain-focused, typical of huma |
2026-04-09 |
| COMMIT |
0.00 |
http retries on audio file downloads (#45126) |
|
Technical, brief, informal, no AI-like l |
2026-04-09 |
| COMMIT |
0.00 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` (#45108 |
|
Succinct commit, human style, includes d |
2026-04-09 |
| COMMIT |
0.00 |
fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash tes |
|
Terse, technical, informal and direct co |
2026-04-09 |
| COMMIT |
0.00 |
[Qwen3_5]Remove unnecessary masked_fill_ in torch_chunk_gate |
|
Technical jargon and fix history, not AI |
2026-04-09 |
| COMMIT |
0.00 |
Add THD support in ESM (#44145) |
|
Sequence of technical steps and signed-o |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Remove all shared weights, and silently skip them d |
|
Short, informal phrases and abbreviation |
2026-04-09 |
| COMMIT |
0.00 |
Fix conversion mappings for vlms (#45340) |
|
Terse and informal with typos, not AI-ge |
2026-04-09 |
| COMMIT |
0.00 |
Fix resize failure caused by zero-sized masks in PP-DocLayou |
|
Casual tone, short phrasing, domain lang |
2026-04-09 |
| COMMIT |
0.00 |
chore: added circleci python script to ruff and ty checkers |
|
Single-sentence changes, abbreviations, |
2026-04-09 |
| COMMIT |
0.00 |
tweak checkers output on errors (#45163) |
|
Succinct, with human-like test fix expla |
2026-04-09 |
| COMMIT |
0.00 |
fix: leak in tokenizer registry for `test_processors` (#4531 |
|
Short, typo ('reigstry'), human casual s |
2026-04-09 |
| COMMIT |
0.00 |
chore: remove test_hub for now (#45337) |
|
Concise, informal commit message typical |
2026-04-09 |
| COMMIT |
0.00 |
[gemma4] Dissociate kv states sharing from the Cache (#45312 |
|
Terse commit history, domain-specific an |
2026-04-09 |
| COMMIT |
0.00 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Direct, domain-specific commit message w |
2026-04-08 |
| COMMIT |
0.00 |
[docs] pipeline cleanup (#44954) |
|
Extremely terse; typical human doc updat |
2026-04-08 |
| COMMIT |
0.00 |
Add MoE to Gemma4 TP plan (#45219) |
|
Concise, domain-language, signed off by |
2026-04-08 |
| COMMIT |
0.00 |
Fix export for gemma4 and add Integration tests (#45285) |
|
Choppy informal commit breakdowns; clear |
2026-04-08 |
| COMMIT |
0.00 |
[docs] static model rules (#45232) |
|
Minimal, iterative commit style; clearly |
2026-04-08 |
| COMMIT |
0.00 |
fix(security): prevent untrusted users from triggering TRL C |
|
Technical detail, informal, and concise |
2026-04-07 |
| COMMIT |
0.00 |
Fix missing image processors backends (#45165) |
|
Very terse commit summary and message, c |
2026-04-07 |
| COMMIT |
0.00 |
[AMD CI] Fix Qwen2 expectations (#45284) |
|
Terse and informal phrasing; human style |
2026-04-07 |
| COMMIT |
0.00 |
Add `hasattr(torch.backends.cudnn, "conv")` to `conftest.py` |
|
Concise commit message; domain-specific |
2026-04-06 |
| COMMIT |
0.00 |
Fix `Qwen2IntegrationTest` (#45268) |
|
Single word summary; strongly human. |
2026-04-06 |
| COMMIT |
0.00 |
doc: fix TokenizersBackend.convert_to_native_format docstrin |
|
Minimal, template-style docstring fix; n |
2026-04-06 |
| COMMIT |
0.00 |
Fix unexpected TF32 being enabled in testing (#45252) |
|
Minimal, terse commit message; human typ |
2026-04-05 |
| COMMIT |
0.00 |
Fix tf32 issue: set `torch.backends.cudnn.conv.fp32_precisio |
|
Commit message is terse, with typical de |
2026-04-05 |
| COMMIT |
0.00 |
Nvidia CI with `torch 2.11` (#45243) |
|
Short, domain-specific, and informal com |
2026-04-04 |
| COMMIT |
0.00 |
Update `get_test_info.py` (related to tiny model creation) ( |
|
Brief, technical, and uses domain abbrev |
2026-04-04 |
| COMMIT |
0.00 |
More fix for tiny model creation (#45228) |
|
Commit message and bullet points are ter |
2026-04-03 |
| COMMIT |
0.00 |
remove unnecessary entries in some auto model mappings (#452 |
|
Extremely terse commit note; clear human |
2026-04-03 |
| COMMIT |
0.00 |
fix: hf-doc-builder insallation was failing (#45225) |
|
Brief and informal phrasing; no AI trait |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Add per-request logits processors (#45026) |
|
Terse, minimal commit message; typical h |
2026-04-03 |
| COMMIT |
0.00 |
[docs] formatting (#45196) |
|
Sparse, technical note with nonstandard |
2026-04-03 |
| COMMIT |
0.00 |
fix `test_register_result_handler` (#45188) |
|
Very brief commit message, lacks AI hall |
2026-04-03 |
| COMMIT |
0.00 |
[CB] Tweaks to update and minor fixes (#45179) |
|
Bullet style, typos, and brevity point t |
2026-04-03 |
| COMMIT |
0.00 |
Fix pypi release (#45210) |
|
Minimal, direct style, some typos, clear |
2026-04-03 |
| COMMIT |
0.00 |
update to dev version 5.6.0-dev0 |
|
Simple version update note, extremely br |
2026-04-03 |
| COMMIT |
0.00 |
fix(docs): correct gemma4 docs and examples (#45197) |
|
Standard commit message with co-author t |
2026-04-02 |
| COMMIT |
0.00 |
Add Turkish (tr) translation for Get Started section (#45158 |
|
Human style, domain-specific, and inform |
2026-04-02 |
| COMMIT |
0.00 |
[docs] transformers serve (#45174) |
|
Very brief and informal commit messages, |
2026-04-02 |
| COMMIT |
0.00 |
casually dropping the most capable open weights on the plane |
|
Casual, informal tone; no AI signatures; |
2026-04-02 |
| COMMIT |
0.00 |
Internalise the NomicBERT model (#43067) |
|
Structured sequential work, domain-speci |
2026-04-02 |
| COMMIT |
0.00 |
Fix resized LM head weights being overwritten by post_init ( |
|
Technically detailed, with typos and dir |
2026-04-02 |
| COMMIT |
0.00 |
[Qwen3.5 MoE] Add _tp_plan to ForConditionalGeneration (#451 |
|
Technical, domain-specific, and succinct |
2026-04-02 |
| COMMIT |
0.00 |
Fix TypeError: 'NoneType' object is not iterable in Generati |
|
Very brief and specific to bug; human co |
2026-04-02 |
| COMMIT |
0.00 |
fix(models): Fix dtype mismatch in SwitchTransformers and Ti |
|
Uses human-like shorthand and changelog |
2026-04-02 |
| COMMIT |
0.00 |
Generalize gemma vision mask to videos (#45185) |
|
Short, informal, with direct responses t |
2026-04-02 |
| COMMIT |
0.00 |
[misc] fix qwen35 tests: correct the text model type and ski |
|
Concise commit message, domain-specific, |
2026-04-02 |
| COMMIT |
0.00 |
🔒 Pin GitHub Actions to commit SHAs (#45180) |
|
Explicit, terse commit titles; no AI hal |
2026-04-02 |
| COMMIT |
0.00 |
Use doc-builder runnable example for GLM-ASR (#44277) |
|
Informal language and typographical erro |
2026-04-02 |
| COMMIT |
0.00 |
CI] Small T5 expectations updated (#45138) |
|
Extremely terse commit message; clearly |
2026-04-02 |
| COMMIT |
0.00 |
fix: correct type annotations across config classes for @str |
|
Contains domain-specific detail, terse s |
2026-04-01 |
| COMMIT |
0.00 |
Fix explicit local code resolution for tokenizers and image |
|
Technical, uses signing trailer but no A |
2026-04-01 |
| COMMIT |
0.00 |
Fix T5Attention shape mismatch under Tensor Parallelism (#45 |
|
Technical explanation, review references |
2026-04-01 |
| COMMIT |
0.00 |
[refactor] Serving into proper modules (#44796) |
|
Commit history is terse, informal, and t |
2026-04-01 |
| COMMIT |
0.00 |
Re-add regex substitutions to the response parsing spec (#45 |
|
Informal changelog, abbreviations, and t |
2026-04-01 |
| COMMIT |
0.00 |
fix bug for janus model image generation (#45044) |
|
Informal, domain-specific, signed by hum |
2026-04-01 |
| COMMIT |
0.00 |
Fix incorrect TrainingArguments example in training.md (#451 |
|
Brief commit messages, domain-specific, |
2026-03-31 |
| COMMIT |
0.00 |
Add parse_response to Processor, make it a bit more official |
|
Direct, minimal phrasing; no AI indicato |
2026-03-31 |
| COMMIT |
0.00 |
DeepGEMM (#44832) |
|
Informal, technical jargon, no sign of A |
2026-03-31 |
| COMMIT |
0.00 |
🚨 [Cache] Native mamba & hybrid cache (#44950) |
|
Casual commit style, lots of terse fixes |
2026-03-31 |
| COMMIT |
0.00 |
[serving] Fix continuous batching JSON response serializatio |
|
Detailed but technical, regression test |
2026-03-31 |
| COMMIT |
0.00 |
refactoring: speedup static checks with disk cache (#44992) |
|
Terse, to-the-point, refactoring context |
2026-03-31 |
| COMMIT |
0.00 |
:rotating_light: [`LightGlue`] Remove remote code execution |
|
Minimal commit messages, no AI hallmarks |
2026-03-31 |
| COMMIT |
0.00 |
[CB] Add warmup feature (#45112) |
|
Brief, iterative commits, domain terms, |
2026-03-31 |
| COMMIT |
0.00 |
feature: added import complexity checker (#45013) |
|
Simple feature summary, short updates, h |
2026-03-31 |
| COMMIT |
0.00 |
Fix tests for `janus` model (#44739) |
|
Issue/PR style with signed-off trailer, |
2026-03-31 |
| PR |
0.00 |
Fix tf32 issue: set `torch.backends.cudnn.conv.fp32_precisio |
|
Template plus human-written technical co |
2026-04-05 |
| PR |
0.00 |
Fix OLMoE routing and Mistral4 RoPE dimensions |
|
Template structure; technical explanatio |
2026-04-10 |
| PR |
0.00 |
Fused kernels support |
|
Very brief; uses typical terse human PR |
2026-04-10 |
| PR |
0.00 |
Add Videoprism |
|
Contains casual tone, model and repo ref |
2025-08-04 |
| PR |
0.00 |
Add PolarQuant backend to QuantizedCache (Hadamard-rotated L |
|
Domain-specific terms and human summary |
2026-04-10 |
| PR |
0.00 |
Add CLIP-like models in conversion to VLMs |
|
Casual language (e.g., 'lemme'), informa |
2026-04-10 |
| PR |
0.00 |
Add Deepseek-OCR-2 model |
|
Direct, technical, and references specif |
2026-03-27 |
| PR |
0.00 |
Update `trackio` integration to use Buckets and "freeze" Spa |
|
Technical, standard PR phraseology witho |
2026-04-08 |
| PR |
0.00 |
Fix AttributeError in AssistantToTargetTranslator.unmap_inpu |
|
Technical exception; casual use of templ |
2026-04-08 |
| PR |
0.00 |
Logger has `[transformers]` prefix in non-verbose mode |
|
Casual phrasing, brief and references co |
2026-04-08 |
| PR |
0.00 |
typing: rule 15 - checks for tie_word_embeddings presence |
|
Domain-specific rule, technical language |
2026-03-25 |
| PR |
0.00 |
Dynamic auto mapping (PoC) |
|
Reflective, personal explanation typical |
2026-03-26 |
| PR |
0.00 |
[docs] zero + sequence parallelism |
|
Technical summarization, clear manual st |
2026-03-11 |
| PR |
0.00 |
Fix: NotebookProgressCallback crash when evaluating with the |
|
Bug fix with issue reference, human-like |
2026-03-23 |
| PR |
0.00 |
Close file handler |
|
Very brief, domain-specific; lacks AI-li |
2026-04-02 |
| PR |
0.00 |
fix: restore mypy type checking for PreTrainedConfig subclas |
|
Explains technical bug fix with domain-s |
2026-04-04 |
| PR |
0.00 |
fix(videomt): auto-fix failing tests |
|
No free-text content provided to judge. |
2026-04-07 |
| PR |
0.00 |
fix(nomic_bert): auto-fix failing tests |
|
No free-text content provided. |
2026-04-07 |
| PR |
0.00 |
fix(cohere_asr): auto-fix failing tests |
|
No free-text content provided. |
2026-04-07 |
| PR |
0.00 |
fix bug for videomt model device mismatch |
|
Informal and terse, uses shorthand and d |
2026-04-03 |
| PR |
0.00 |
Fix gemma4 has flash-attention incompatbile head-dim=512 |
|
Contains informal tone and direct admiss |
2026-04-02 |
| PR |
0.00 |
Add HyperCLOVAX SEED Think 14B |
|
Rich technical details and formatting; h |
2026-03-23 |
| PR |
0.00 |
nomic_bert: make the test suitable for general device. |
|
Very brief, direct, and informal with '@ |
2026-04-03 |
| PR |
0.00 |
[WIP][Fix] GLM 5 set `apply_rotary_pos_emb` to `is_neox_styl |
|
Uses technical jargon, WIP marker, and i |
2026-03-26 |
| PR |
0.00 |
[docs] tp training |
|
Very brief, to-the-point; lacking AI hal |
2026-03-11 |
| PR |
0.00 |
[docs] optimizers, hyperparam search, training features |
|
Brief, domain-specific, non-formal chang |
2026-02-26 |
| PR |
0.00 |
Add AudioFlamingoNext model |
|
Terse changelog, references issue and sp |
2026-03-18 |
| PR |
0.00 |
Remove unused parameters and improve add_tensor_parallel_hoo |
|
Lists concise technical changes in non-f |
2026-03-16 |
| PR |
0.00 |
[docs] model testing |
|
Informal tone, minimal capitalisation; c |
2026-03-31 |
| PR |
0.00 |
Fix Double Application of Softmax for Router Logits in MoE m |
|
Contains only a technical title; no free |
2026-04-09 |
| PR |
0.00 |
[gemma4] Fix device map auto |
|
Technical phrasing, domain details, info |
2026-04-09 |
| PR |
0.00 |
Add GGUF support to Gemma4 (31B & 26B-A4B) text |
|
Most text is template, no AI-style conte |
2026-04-07 |