| COMMIT |
1.00 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Explicit mention of AI collaboration: 'd |
2026-05-11 |
| COMMIT |
1.00 |
Add Qwen3.5 support for token classification (#45833) |
|
Commit message contains explicit AI assi |
2026-05-08 |
| COMMIT |
1.00 |
Fix WeightConverter regex incorrectly matching shared_expert |
|
Commit message contains explicit AI assi |
2026-05-06 |
| COMMIT |
1.00 |
Add Granite 4.1 Vision (granite4_vision) (#45597) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Unwrap `text_config` in `AutoModelFor*.from_config` (#45770) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Add EXAONE 4.5 implementations (#45471) |
|
Commit message contains explicit AI assi |
2026-05-04 |
| COMMIT |
1.00 |
Add DeepSeek V4 (#45643) |
|
Commit message contains explicit AI assi |
2026-05-02 |
| COMMIT |
1.00 |
Support for a new Granite-Speech-Plus model (#45695) |
|
Commit message contains explicit AI assi |
2026-04-29 |
| COMMIT |
1.00 |
fixing more typos (#45689) |
|
Commit message contains explicit AI assi |
2026-04-28 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Add Conformer model |
|
PR body explicitly mentions AI collabora |
2026-05-03 |
| PR |
1.00 |
DeepGEMM BF16 + mixed FP8/FP4 + MegaMoE + refactor |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
fix: ModuleNotFoundError caused by distributed race conditio |
|
PR body explicitly mentions AI collabora |
2026-05-08 |
| PR |
1.00 |
[Cache] Add `Cache.snapshot()` / `Cache.restore(snapshot)` f |
|
PR body explicitly mentions AI collabora |
2026-05-08 |
| PR |
1.00 |
torch.backends.fp32_precision cascade conv/rnn so removing t |
|
PR body explicitly mentions AI collabora |
2026-05-04 |
| PR |
1.00 |
Fix deepseek v4 |
|
PR body explicitly mentions AI collabora |
2026-05-11 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
security: enforce weights_only=True across multiple conversi |
|
PR body explicitly mentions AI collabora |
2026-05-10 |
| PR |
1.00 |
feat: Add GGUF loading support for Llama 4 (text) |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
fix: route Granite models to TokenizersBackend to preserve t |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
fix: add HF_USE_MLX opt-out for MLX detection |
|
PR body explicitly mentions AI collabora |
2026-05-09 |
| PR |
0.70 |
🚨 Refactor ViT to updated standards |
|
Phrase 'This PR aims at refactoring...', |
2025-10-17 |
| COMMIT |
0.20 |
Warn about forgetting attention mask functions (#45811) |
|
Somewhat more formal and explanatory, bu |
2026-05-11 |
| PR |
0.20 |
Enhance apply_chat_template to support custom field prefilli |
|
Somewhat more structured, but technical |
2026-05-08 |
| PR |
0.20 |
Add V-JEPA 2.1 inference support |
|
Slightly more formal/structured but stil |
2026-04-17 |
| PR |
0.20 |
add HyperClovaX Vision |
|
Brief domain-specific introduction; slig |
2026-02-27 |
| PR |
0.20 |
TST Run fast PEFT tests in normal CI |
|
Slightly more explanation, but context-s |
2026-04-28 |
| PR |
0.20 |
Fix Gemma4 inputs_embeds OOM during per-layer lookup |
|
Contains technical summary; structure an |
2026-05-11 |
| PR |
0.20 |
Remove deprecation cycle for `cache_position` in masking pri |
|
Technical phrasing, mentions specific is |
2026-05-11 |
| PR |
0.20 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Mildly formal section but technical and |
2026-04-09 |
| PR |
0.20 |
Fix "AttributeError: NewTokenizer has no attribute special_a |
|
Somewhat formal format, but technical an |
2026-04-07 |
| PR |
0.20 |
[Trackio] support trackio gpu logging |
|
Clear technical writing common in PRs; n |
2026-01-09 |
| PR |
0.20 |
fix: AutoConfig reloads wrong class after save_pretrained + |
|
Technical issue explanation, human style |
2026-04-30 |
| PR |
0.20 |
Add FP8 kernel acceleration for compressed-tensors quantized |
|
Technical changelog, domain-specific voc |
2026-04-29 |
| PR |
0.20 |
Add Qwen3.5 support for token classification |
|
Somewhat formal, but includes domain ter |
2026-05-07 |
| COMMIT |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
Commit contains domain context and code |
2026-05-01 |
| COMMIT |
0.10 |
Doc translate to Persian(farsi) (#45664) |
|
Slightly formal in part, but domain-spec |
2026-04-30 |
| COMMIT |
0.10 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Somewhat formal wording but still within |
2026-04-28 |
| PR |
0.10 |
Add new model: Kimi2-6 |
|
Casual comments, human uncertainty, not |
2026-04-24 |
| PR |
0.10 |
Fix/pe audio video bugs |
|
Technical bullet points, specific refere |
2026-05-11 |
| PR |
0.10 |
Add Videoprism |
|
Concise, domain-specific, casual tone, m |
2025-08-04 |
| PR |
0.10 |
fix(rope): read original_max_position_embeddings from yarn v |
|
Technical focus, domain terms, no AI-sty |
2026-05-11 |
| PR |
0.10 |
fix(text-generation): use token-level slicing for return_ful |
|
Bug description is technical and concise |
2026-05-09 |
| PR |
0.10 |
End-to-end test of Gemma 3 + FA2 construction |
|
Minimal, technical, with slight informal |
2026-05-03 |
| PR |
0.10 |
[Weight converter] Revert unnecessary changes to `rename_sou |
|
Natural justification, context-specific |
2026-05-07 |
| PR |
0.10 |
Do not keep refs to submodules internally |
|
Direct, technical explanation; lacks AI- |
2026-05-11 |
| PR |
0.10 |
:rotating_light: [`Attn`] Remove all old mask APIs from mode |
|
Direct commit style with technical conte |
2026-02-11 |
| PR |
0.10 |
fix(minicpmv4_6): skip invalid failing tests |
|
Terse, domain-specific, informal with mi |
2026-05-08 |
| PR |
0.10 |
audio tester class |
|
Domain-specific, informal, and uses abbr |
2026-04-13 |
| PR |
0.10 |
fix model parallel issues for deimv2 |
|
Uses code references, informal, clear do |
2026-05-08 |
| PR |
0.10 |
fix: raise ValueError when num_beams × vocab_size exceeds to |
|
Technical explanation, domain-specific w |
2026-05-07 |
| PR |
0.10 |
[CB] [Major] Add tensor paralellism |
|
Uses technical abbreviations, bullet poi |
2026-05-07 |
| PR |
0.10 |
FIX Restore LoRA hotswapping functionality |
|
References PR numbers, test coverage; do |
2026-04-28 |
| PR |
0.10 |
fix(laguna): fix failing tests |
|
Domain-specific explanation and referenc |
2026-05-08 |
| PR |
0.10 |
fix(granite4_vision): auto-fix failing tests |
|
Concise, technical reference, and domain |
2026-05-08 |
| PR |
0.10 |
Fix `tie_word_embeddings` not lifted from `text_config` for |
|
Technical, context-rich, with config ref |
2026-05-09 |
| PR |
0.10 |
WIP: fix(deepseek_v4) correct CSA per-query block mask and c |
|
Concise technical explanation, domain-sp |
2026-05-08 |
| PR |
0.10 |
[Gemma4] Fix SharedKVCache identity loss under FSDP2 cast_fo |
|
Precise, technical language; typical of |
2026-04-30 |
| PR |
0.10 |
Warn about forgetting attention mask functions |
|
Explains implementation detail; phrasing |
2026-05-06 |
| PR |
0.10 |
Add deepseek 3.2 exp |
|
Domain code snippet, terse style, no AI |
2025-10-01 |
| PR |
0.10 |
Fix OOM regression for FSDP2 + cpu_ram_efficient_loading on |
|
Technical detail, natural human changelo |
2026-04-25 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Specific technical explanation, natural |
2026-04-09 |
| PR |
0.10 |
add xpu expectation for lw_detr model |
|
Technical test references, casual addres |
2026-01-19 |
| PR |
0.10 |
handle 1D position_ids for modeling_flash_attention_utils as |
|
Bugfix description, test reference, conc |
2026-01-22 |
| PR |
0.10 |
refactor: replace wildcard imports with explicit imports in |
|
Technical jargon and incomplete sentence |
2026-04-15 |
| PR |
0.10 |
Modularize `ProcessorMixin` into smaller components |
|
Technical tone, domain-specific, no AI i |
2026-04-17 |
| PR |
0.10 |
Streamable chat parsing |
|
Casual, original naming, clear domain en |
2026-05-08 |
| PR |
0.10 |
Add Maximal Update Parametrization (μP) |
|
Contains domain terms (μP), concise, hum |
2026-05-08 |
| PR |
0.10 |
Add xcodec2 model |
|
Uses checkbox TODOs, repo links, informa |
2026-02-20 |
| PR |
0.10 |
:rotating_light: Generic Sequence Classifier works for multi |
|
Uses informal tone, repo-specific phrase |
2026-03-13 |
| PR |
0.10 |
🚨 [Fuyu] Remove FuyuBatchFeature subclass, use BatchFeature |
|
Concise domain-specific phrasing, refere |
2026-05-06 |
| PR |
0.10 |
Fix import error in moe.py by providing explicit schema to c |
|
Technical explanation with specific cont |
2026-05-06 |
| PR |
0.10 |
Parakeet tdt |
|
Brief, technical language; shows domain |
2026-02-20 |
| PR |
0.06 |
[CI] Replace PAT with GitHub App token in repo-consistency-b |
|
Technical details, domain abbreviations, |
2026-05-07 |
| COMMIT |
0.05 |
🚨 Refactor ViT to updated standards (#41693) |
|
Commit messages are terse, use domain te |
2026-05-08 |
| COMMIT |
0.05 |
Add image processors refactor to v5 migration guide (#45556) |
|
Brief, changelog-focused; no AI hallmark |
2026-04-28 |
| COMMIT |
0.05 |
[docs] modular transformers (#45327) |
|
Standard PR commit log, minimal free tex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] dtype (#45659) |
|
Short, lacks AI-generated phrasing or ex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cb memory management (#45587) |
|
Extremely brief chunked log, no AI phras |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cpu offloading (#45660) |
|
Minimal, no formal language or AI stylis |
2026-04-28 |
| COMMIT |
0.05 |
Fix `x_clip`: 8 failed test cases (#45394) |
|
Test fix, highly specific, fully normal |
2026-04-28 |
| PR |
0.05 |
Migrate TF32 API calls to new fp32_precision API |
|
Technical, concise, uses domain terms; n |
2026-05-09 |
| PR |
0.05 |
Require input_ids for repetition penalty |
|
Direct, technical, filled with domain de |
2026-04-13 |
| PR |
0.05 |
Fix undefined 'input' variable |
|
Concise bug fix explanation, domain-jarg |
2026-05-11 |
| PR |
0.05 |
Fix M-RoPE device mismatch in Qwen3VL family under FSDP2 CPU |
|
Includes explicit reference, technical c |
2026-05-09 |
| PR |
0.05 |
Fix slow Trainer path with 4D attention mask |
|
Technical explanation, domain-specific, |
2026-05-08 |
| PR |
0.05 |
Qwen3 ASR and Forced Aligner |
|
Brief, task-focused, and has a checklist |
2026-02-08 |
| PR |
0.05 |
Enhance apply_chat_template to support custom field prefilli |
|
Technical enhancements, detailed, domain |
2026-05-11 |
| PR |
0.05 |
🚨 [ALM] Add base model without head |
|
Terse, motivation noted, uses domain jar |
2026-04-20 |
| PR |
0.05 |
hy_v3: add XPU expectations |
|
Very terse, informal request; clearly hu |
2026-05-09 |
| PR |
0.05 |
Update latest revision for Phi-4-multimodal test |
|
Brief, domain-specific content with a mi |
2026-04-28 |
| PR |
0.05 |
fix(bitsandbytes): implement reverse_op for Bnb4bitDeseriali |
|
Technical subject, domain abbreviations, |
2026-05-02 |
| PR |
0.05 |
feat: add crop() to StaticCache layers for assisted generati |
|
Technical language, domain-specific term |
2026-05-02 |
| PR |
0.03 |
fix: correct typo 'seperate' -> 'separate' in comments acros |
|
Fix description, includes typo correctio |
2026-05-07 |
| COMMIT |
0.00 |
[docs] paper cuts (#45798) |
|
Minimal, informal phrasing with no AI ma |
2026-05-11 |
| COMMIT |
0.00 |
[Weight converter] Revert unnecessary changes to `rename_sou |
|
Informal, pragmatic, and terse—lacks AI |
2026-05-11 |
| COMMIT |
0.00 |
fix(minicpmv4_6): skip invalid failing tests (#45836) |
|
Terse, technical, specific—no AI style d |
2026-05-11 |
| COMMIT |
0.00 |
audio tester class (#45391) |
|
Domain-specific shorthand and brief note |
2026-05-11 |
| COMMIT |
0.00 |
Remove deprecation cycle for inputs embeds (#45885) |
|
Extremely brief, laconic style; no AI si |
2026-05-11 |
| COMMIT |
0.00 |
Remove deprecation cycle for `cache_position` in masking pri |
|
Minimal, informal commit message. |
2026-05-11 |
| COMMIT |
0.00 |
Revert "[deepseek_v4] fix CSA per-query masking in eager pat |
|
Standard revert message; no AI hallmarks |
2026-05-11 |
| COMMIT |
0.00 |
Revert "Merge branch 'main' of github.com:huggingface/transf |
|
Auto-generated revert, template-based fo |
2026-05-11 |
| COMMIT |
0.00 |
[deepseek_v4] fix CSA per-query masking in eager path |
|
In-depth technical detail, human tone, a |
2026-05-11 |
| COMMIT |
0.00 |
fix(rf_detr): fix failing tests (#45845) |
|
Direct, minimal language, issue context, |
2026-05-08 |
| COMMIT |
0.00 |
fix(granite4_vision): auto-fix failing tests (#45844) |
|
Direct technical description, signatures |
2026-05-08 |
| COMMIT |
0.00 |
fix(laguna): fix failing tests (#45842) |
|
Minimal edit, direct cause, no AI hallma |
2026-05-08 |
| COMMIT |
0.00 |
:rotating_light: Generic Sequence Classifier works for multi |
|
Informal, nonstandard grammar, domain-sp |
2026-05-08 |
| COMMIT |
0.00 |
Fix import error in moe.py by providing explicit schema to c |
|
Technical context, precise, realistic er |
2026-05-08 |
| COMMIT |
0.00 |
fix: correct typo 'seperate' -> 'separate' in comments acros |
|
Direct typo fix description, minimal det |
2026-05-08 |
| COMMIT |
0.00 |
🚨 [Fuyu] Remove FuyuBatchFeature subclass, use BatchFeature |
|
Terse style, clear purpose, co-author hu |
2026-05-08 |
| COMMIT |
0.00 |
Keep deleting (#45802) |
|
Extremely terse, informal commit message |
2026-05-08 |
| COMMIT |
0.00 |
Fix gemma4 with multi-gpu setup (#45826) |
|
Brief and informal; no AI hallmarks pres |
2026-05-07 |
| COMMIT |
0.00 |
Add HyperCLOVAX SEED Think 14B (#44956) |
|
Commit trailers but no AI signals; conci |
2026-05-07 |
| COMMIT |
0.00 |
Fix `kernelize()` crash for gpt_oss: missing `@use_kernel_fu |
|
Technical fixes, domain jargon, no AI ph |
2026-05-07 |
| COMMIT |
0.00 |
Cache `merged_typed_dict` to not break `validate_typed_dict` |
|
Direct changelog with domain terms; lack |
2026-05-07 |
| COMMIT |
0.00 |
Get rid of deprecated use_return_dict call. (#45815) |
|
Minimal, terse, human-written style, no |
2026-05-07 |
| COMMIT |
0.00 |
fix(qianfan_ocr): add XPU expectations (#45615) |
|
Technical listing; informal, lacks AI te |
2026-05-07 |
| COMMIT |
0.00 |
Fix shared config mutation issue in flash_attn_from_config ( |
|
Domain-specific, concise, signed-off; no |
2026-05-07 |
| COMMIT |
0.00 |
Add RF-DETR (#36895) |
|
Changelog is technical and terse, lacks |
2026-05-07 |
| COMMIT |
0.00 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Highly technical, multiple fixes, domain |
2026-05-07 |
| COMMIT |
0.00 |
refector: renamed file glob to cache to make it clearer (#45 |
|
Brief, informal commit message indicates |
2026-05-06 |
| COMMIT |
0.00 |
Fix decorator order (#45806) |
|
Very terse, sometimes typoed commit mess |
2026-05-06 |
| COMMIT |
0.00 |
[`Granite 4.1 Vision`] Fixup integration tests (#45805) |
|
Terse commit messages, human abbreviatio |
2026-05-06 |
| COMMIT |
0.00 |
fix: validate special token ids against attribute values (#4 |
|
Technical, concise; shows informal engin |
2026-05-06 |
| COMMIT |
0.00 |
Blockwise mask fn as opt arg in all masking functions (#4547 |
|
Highly informal, fragmented notes; no si |
2026-05-06 |
| COMMIT |
0.00 |
[CB] Refactor any model-related code in a separate class (#4 |
|
Uses domain jargon, informal corrections |
2026-05-06 |
| COMMIT |
0.00 |
fix: forward use_cache kwarg to attention mixer in nemotron_ |
|
Brief technical commit messages with dom |
2026-05-05 |
| COMMIT |
0.00 |
fix: correct spelling in continuous_api docstring (#45749) |
|
Minimal, terse technical correction; typ |
2026-05-05 |
| COMMIT |
0.00 |
Fix link to modular transformers documentation (#45746) |
|
Direct update explanation; clear, concis |
2026-05-05 |
| COMMIT |
0.00 |
Gemma4: fix failed test cases (#45568) |
|
List of technical fixes and updates, sig |
2026-05-05 |
| COMMIT |
0.00 |
First model (#45788) |
|
Casual tone and specific jargon; reflect |
2026-05-05 |
| COMMIT |
0.00 |
Fix CI: Allow more artifacts to be download in CI (#45785) |
|
Debug note and domain-specific context, |
2026-05-05 |
| COMMIT |
0.00 |
Add `concurrency` to `PR CI` workflow file (`pr-ci-caller.ym |
|
Single word commit, terse; standard huma |
2026-05-05 |
| COMMIT |
0.00 |
Reorder decorators for autodoc and dataclass (#45702) |
|
Short, technical log phrases typical of |
2026-05-05 |
| COMMIT |
0.00 |
deepseek r1 distilled tokenizer fix for qwen2 mapping (#4574 |
|
Typo present, informal tone; clear human |
2026-05-05 |
| COMMIT |
0.00 |
fix: Added Mps support in float fallback backends list (#45 |
|
Structured code fixes, informal language |
2026-05-05 |
| COMMIT |
0.00 |
Github Actions PR CI (caller) (#45476) |
|
Commit message is terse and includes dom |
2026-05-04 |
| COMMIT |
0.00 |
make sure we call check_auto in CI (#45775) |
|
Informal tone; domain-specific and conci |
2026-05-04 |
| COMMIT |
0.00 |
Better Grouped GEMM + EP (#45621) |
|
Commit log is informal, terse, and techn |
2026-05-04 |
| COMMIT |
0.00 |
DeepSeek OCR specifies an incorrect tokenizer class on the H |
|
Human-written; issue title is concise an |
2026-05-04 |
| COMMIT |
0.00 |
Fix auto mapping script (#45774) |
|
Extremely terse and informal commit mess |
2026-05-04 |
| COMMIT |
0.00 |
PythonBackend slow tokenizer convert_ids_to_tokens fix (#457 |
|
Commit message is terse and lacks AI-sty |
2026-05-04 |
| COMMIT |
0.00 |
[MINISTRAL3] Fix conversion script yarn's apply_scale suppor |
|
Commit message is terse and technical; n |
2026-05-03 |
| COMMIT |
0.00 |
🚨 Get rid of most Apex references (#45723) |
|
Concise and informal message, domain-typ |
2026-05-01 |
| COMMIT |
0.00 |
fix(utils): Resolve backbone utils test regressions (#45594) |
|
Short, domain-specific commit message; t |
2026-05-01 |
| COMMIT |
0.00 |
[CB] Better overall script and decode bucketting (#45653) |
|
Informal, terse checklist style is human |
2026-05-01 |
| COMMIT |
0.00 |
[docs] model testing (#45152) |
|
Terese commit messages; human style; dom |
2026-04-30 |
| COMMIT |
0.00 |
update dev (#45726) |
|
Brief update message; lacks AI hallmarks |
2026-04-30 |
| COMMIT |
0.00 |
[`OAI Privacy Filter`] Add integration test (#45725) |
|
Short, action-based messages; human comm |
2026-04-30 |
| COMMIT |
0.00 |
Speedup Qwen2VLImageProcessor (#45719) |
|
Technical, terse commit messages; human |
2026-04-30 |
| COMMIT |
0.00 |
Remove dead beam-search dummies from dummy_pt_objects.py (#4 |
|
Single, precise technical commit message |
2026-04-30 |
| COMMIT |
0.00 |
[Model] Add PP-FormulaNet Model Support (#45626) |
|
Terse, domain-specific commits; includes |
2026-04-30 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 10 utility files (#4 |
|
Clear domain context and project-specifi |
2026-04-30 |
| COMMIT |
0.00 |
[serve] cb error (#45691) |
|
Brief commit log format, no AI signals. |
2026-04-29 |
| COMMIT |
0.00 |
Fix trust_remote_code local cache collisions for local model |
|
Technical commit log, concise, no AI hal |
2026-04-29 |
| COMMIT |
0.00 |
Llama3 video fix (#45040) |
|
Standard iterative commit log, domain te |
2026-04-29 |
| COMMIT |
0.00 |
[Fix Phi4 test] Fall back to model config for image processo |
|
Technical changelog, brief summary, no A |
2026-04-29 |
| COMMIT |
0.00 |
Fix custom-module copies inheriting read-only permissions (# |
|
Detailed technical context, not overly f |
2026-04-29 |
| COMMIT |
0.00 |
Python code in model docs (#45608) |
|
Informal, terse commit log, domain-speci |
2026-04-29 |
| COMMIT |
0.00 |
fix failed test cases for blt model (#45596) |
|
Technical commit log, signed off by huma |
2026-04-29 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 3 pipeline files (#4 |
|
Standard technical changelog and co-auth |
2026-04-29 |
| COMMIT |
0.00 |
change got reverted (#45680) |
|
Extremely terse, typical human revert me |
2026-04-28 |
| COMMIT |
0.00 |
fix padding side issue for fast_vlm tests (#45592) |
|
Terse, domain-specific, includes real na |
2026-04-28 |
| COMMIT |
0.00 |
zero_shot_object_detection ValueError fix for python 3.13 (# |
|
Very brief, precise, with clear domain r |
2026-04-28 |
| COMMIT |
0.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback ( |
|
Concise, technical commit message with d |
2026-04-28 |
| COMMIT |
0.00 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Short, informal message; human co-author |
2026-04-28 |
| COMMIT |
0.00 |
No serving in quality docker image (#45677) |
|
Brief commit message, human co-author, n |
2026-04-28 |
| COMMIT |
0.00 |
Laguna XS.2 implementation (#45673) |
|
Minimal title-only commit, no sign of AI |
2026-04-28 |
| COMMIT |
0.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
Structured changelog, domain-specific co |
2026-04-28 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| PR |
0.00 |
[docs] ALMModelTest |
|
Single brief phrase filled into template |
2026-05-11 |
| PR |
0.00 |
[new model] Add Zyphra/ZAYA1-8B |
|
Concise, specific model addition; no AI |
2026-05-09 |
| PR |
0.00 |
[docs] decode fast path |
|
Minimal, domain-specific title; clearly |
2026-05-11 |
| PR |
0.00 |
[fix] Add `fatal_error` to `ContinuousBatchingManager` so th |
|
Uses domain-specific references and ters |
2026-05-11 |
| PR |
0.00 |
[docs] paper cuts |
|
Terse description; template with brief h |
2026-05-06 |
| PR |
0.00 |
[docs] contributing |
|
Natural, detailed, human-refactor langua |
2026-04-15 |
| PR |
0.00 |
fix: kosmos2.5: properly expand embeddings table |
|
Issue-specific, domain-detail, natural p |
2026-05-08 |
| PR |
0.00 |
feat(t5gemma2): add Flash Attention 2 support |
|
Extensive domain detail; natural, techni |
2026-05-10 |
| PR |
0.00 |
utils: handle flash_attn missing from importlib packages_dis |
|
Terse, technical fix summary, no AI trai |
2026-04-20 |
| PR |
0.00 |
[Model] Add PP-OCRv6 Text Recognition Models Support |
|
Just a model name addition, no free-text |
2026-05-08 |
| PR |
0.00 |
fix(rf_detr): fix failing tests |
|
No free-text evidence suggesting AI or h |
2026-05-08 |
| PR |
0.00 |
exaone4_5: add XPU expectations |
|
Informal and brief with domain-specific |
2026-05-11 |
| PR |
0.00 |
add DeepSeek-V4-Flash-Base support, also add the testcase(de |
|
Highly terse, uses domain/team tagging, |
2026-05-11 |
| PR |
0.00 |
Remove deprecation cycle for inputs embeds |
|
Very brief, domain-style commit; human t |
2026-05-11 |
| PR |
0.00 |
Add DeepSeek V4 |
|
Very terse summary; domain-specific and |
2026-04-25 |
| PR |
0.00 |
[deepseek_v4] fix CSA per-query masking in eager path |
|
Minimal summary, references an issue, no |
2026-05-11 |
| PR |
0.00 |
qa: speed up dtype regex weight load + reduce dtype tests to |
|
Informal bullet points, domain details, |
2026-04-24 |
| PR |
0.00 |
[deepseek_v4] fix CSA per-query masking in eager path |
|
Duplicate of #3; minimal, informal, no A |
2026-05-11 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Contains terse technical TODOs and domai |
2026-03-26 |
| PR |
0.00 |
fix |
|
Extremely brief and informal; clearly hu |
2026-05-10 |
| PR |
0.00 |
More robust processor from pretrained |
|
Technical description with domain refere |
2025-12-05 |
| PR |
0.00 |
trainer: clear MPS graph cache after each optimizer step (py |
|
Uses domain slang (MPSGraph) and brief t |
2026-05-07 |
| PR |
0.00 |
Implement VibeVoice |
|
Concise, uses domain-specific references |
2025-08-29 |
| PR |
0.00 |
- |
|
No free-text content provided; template |
2026-05-09 |
| PR |
0.00 |
[docs] adding audio/video processors |
|
Terse, references issue comment, clearly |
2026-05-05 |
| PR |
0.00 |
[docs] chat template |
|
Brief, direct, minimal doc update detail |
2026-05-08 |