| COMMIT |
1.00 |
Add Qwen3.5 support for token classification (#45833) |
|
Commit message contains explicit AI assi |
2026-05-08 |
| COMMIT |
1.00 |
Fix WeightConverter regex incorrectly matching shared_expert |
|
Commit message contains explicit AI assi |
2026-05-06 |
| COMMIT |
1.00 |
Add Granite 4.1 Vision (granite4_vision) (#45597) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Unwrap `text_config` in `AutoModelFor*.from_config` (#45770) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Add EXAONE 4.5 implementations (#45471) |
|
Commit message contains explicit AI assi |
2026-05-04 |
| COMMIT |
1.00 |
Add DeepSeek V4 (#45643) |
|
Commit message contains explicit AI assi |
2026-05-02 |
| COMMIT |
1.00 |
Support for a new Granite-Speech-Plus model (#45695) |
|
Commit message contains explicit AI assi |
2026-04-29 |
| COMMIT |
1.00 |
fixing more typos (#45689) |
|
Commit message contains explicit AI assi |
2026-04-28 |
| PR |
1.00 |
security: enforce weights_only=True across multiple conversi |
|
PR body explicitly mentions AI collabora |
2026-05-10 |
| PR |
1.00 |
feat: Add GGUF loading support for Llama 4 (text) |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
fix: add HF_USE_MLX opt-out for MLX detection |
|
PR body explicitly mentions AI collabora |
2026-05-09 |
| PR |
1.00 |
[Cache] Add `Cache.snapshot()` / `Cache.restore(snapshot)` f |
|
PR body explicitly mentions AI collabora |
2026-05-08 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
fix: ModuleNotFoundError caused by distributed race conditio |
|
PR body explicitly mentions AI collabora |
2026-05-08 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
fix: ModuleNotFoundError caused by distributed race conditio |
|
PR body explicitly mentions AI collabora |
2026-05-05 |
| PR |
1.00 |
torch.backends.fp32_precision cascade conv/rnn so removing t |
|
PR body explicitly mentions AI collabora |
2026-05-04 |
| PR |
1.00 |
Cache `merged_typed_dict` to not break `validate_typed_dict` |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
WIP: Add support for Granite4VisionForConditionalGeneration |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
fix: use per-instance lru_cache in compile_compatible_method |
|
PR body explicitly mentions AI collabora |
2026-05-07 |
| PR |
1.00 |
fix: route Granite models to TokenizersBackend to preserve t |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
Add pretrained model-family distributed smoke matrix |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
0.70 |
🚨 Refactor ViT to updated standards |
|
Phrase 'This PR aims at refactoring...', |
2025-10-17 |
| PR |
0.60 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Uses AI-typical phrasing like 'This PR a |
2026-04-27 |
| PR |
0.20 |
[Trackio] support trackio gpu logging |
|
Clear technical writing common in PRs; n |
2026-01-09 |
| PR |
0.20 |
fix: AutoConfig reloads wrong class after save_pretrained + |
|
Technical issue explanation, human style |
2026-04-30 |
| PR |
0.20 |
Add FP8 kernel acceleration for compressed-tensors quantized |
|
Technical changelog, domain-specific voc |
2026-04-29 |
| PR |
0.20 |
fix(text-generation): use token-level slicing for return_ful |
|
Bug explanation with jargon, conversatio |
2026-05-09 |
| PR |
0.20 |
Enhance apply_chat_template to support custom field prefilli |
|
Neutral technical writing, some formalit |
2026-05-08 |
| PR |
0.20 |
Add Qwen3.5 support for token classification |
|
Somewhat formal, but includes domain ter |
2026-05-07 |
| PR |
0.15 |
Improve explanation of pipelines for beginners |
|
Direct, minimal, clearly aimed at beginn |
2026-05-06 |
| PR |
0.15 |
Add RF-DETR |
|
List of tasks, domain-specific name, mod |
2025-03-21 |
| PR |
0.13 |
add HyperClovaX Vision |
|
Opening greeting, but technical context |
2026-02-27 |
| COMMIT |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
Commit contains domain context and code |
2026-05-01 |
| COMMIT |
0.10 |
Doc translate to Persian(farsi) (#45664) |
|
Slightly formal in part, but domain-spec |
2026-04-30 |
| COMMIT |
0.10 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Somewhat formal wording but still within |
2026-04-28 |
| PR |
0.10 |
fix model parallel issues for deimv2 |
|
Terse, technical content; lacks AI hallm |
2026-05-08 |
| PR |
0.10 |
[new model] Add Zyphra/ZAYA1-8B |
|
Domain-specific, concise explanation; no |
2026-05-09 |
| PR |
0.10 |
[Model] Add PP-OCRv6 Text Recognition Models Support |
|
Terse content, domain-specific, human re |
2026-05-08 |
| PR |
0.10 |
add xpu expectation for lw_detr model |
|
Technical test references, casual addres |
2026-01-19 |
| PR |
0.10 |
handle 1D position_ids for modeling_flash_attention_utils as |
|
Bugfix description, test reference, conc |
2026-01-22 |
| PR |
0.10 |
refactor: replace wildcard imports with explicit imports in |
|
Technical jargon and incomplete sentence |
2026-04-15 |
| PR |
0.10 |
fix(minicpmv4_6): skip invalid failing tests |
|
Direct bug-fix language, informal, human |
2026-05-08 |
| PR |
0.10 |
Qwen3 ASR and Forced Aligner |
|
Brief technical content, direct, no AI s |
2026-02-08 |
| PR |
0.10 |
Fix slow Trainer path with 4D attention mask |
|
Concise, domain jargon, human technical |
2026-05-08 |
| PR |
0.10 |
Modularize `ProcessorMixin` into smaller components |
|
Technical tone, domain-specific, no AI i |
2026-04-17 |
| PR |
0.10 |
Streamable chat parsing |
|
Casual, original naming, clear domain en |
2026-05-08 |
| PR |
0.10 |
FIX Restore LoRA hotswapping functionality |
|
Direct technical explanation, clear issu |
2026-04-28 |
| PR |
0.10 |
Add Maximal Update Parametrization (μP) |
|
Contains domain terms (μP), concise, hum |
2026-05-08 |
| PR |
0.10 |
fix(granite4_vision): auto-fix failing tests |
|
Technical details, abbreviations, natura |
2026-05-08 |
| PR |
0.10 |
fix(laguna): fix failing tests |
|
Explains bug fix with repo jargon, infor |
2026-05-08 |
| PR |
0.10 |
Add xcodec2 model |
|
Uses checkbox TODOs, repo links, informa |
2026-02-20 |
| PR |
0.10 |
:rotating_light: Generic Sequence Classifier works for multi |
|
Uses informal tone, repo-specific phrase |
2026-03-13 |
| PR |
0.10 |
🚨 [Fuyu] Remove FuyuBatchFeature subclass, use BatchFeature |
|
Concise domain-specific phrasing, refere |
2026-05-06 |
| PR |
0.10 |
Fix import error in moe.py by providing explicit schema to c |
|
Technical explanation with specific cont |
2026-05-06 |
| PR |
0.10 |
Parakeet tdt |
|
Brief, technical language; shows domain |
2026-02-20 |
| PR |
0.10 |
fix: kosmos2.5: properly expand embeddings table |
|
Contains specific error message, informa |
2026-05-08 |
| PR |
0.10 |
WIP: fix(deepseek_v4) correct CSA per-query block mask and c |
|
Domain-specific descriptions and abbrevi |
2026-05-08 |
| PR |
0.10 |
Add DeepSeek V4 |
|
Short direct draft note; reviews have in |
2026-04-25 |
| PR |
0.10 |
Fix WeightConverter regex incorrectly matching shared_expert |
|
Issue is described precisely using techn |
2026-05-05 |
| PR |
0.10 |
Gemma4: fix failed test cases |
|
Bullet points, informal typo, direct ref |
2026-04-22 |
| PR |
0.10 |
Fix shared config mutation issue in flash_attn_from_config |
|
Technical bug description and domain jar |
2026-04-28 |
| PR |
0.10 |
feat: add bf16_loss training argument for VRAM-efficient QLo |
|
Domain-specific jargon and concise techn |
2026-05-03 |
| PR |
0.10 |
[CB] [Major] Add tensor paralellism |
|
Uses TP jargon and terse descriptions, n |
2026-05-07 |
| PR |
0.10 |
Get rid of deprecated use_return_dict call. |
|
Casual tone and specific error context, |
2026-05-07 |
| PR |
0.10 |
Modular playground |
|
Domain details, concise listing, lacks A |
2026-02-04 |
| PR |
0.10 |
Fix "AttributeError: NewTokenizer has no attribute special_a |
|
Technical problem explanation, domain ab |
2026-04-07 |
| PR |
0.10 |
Restore TokenizersBackend override for DeepSeek V3/R1 tokeni |
|
Direct technical problem, jargon, no exc |
2026-04-28 |
| PR |
0.10 |
Fix `torch.compile` graph breaks in Qwen2.5-Omni vision enco |
|
Technical references, concise error expl |
2026-05-07 |
| PR |
0.08 |
Add new model: Kimi2-6 |
|
Casual tone and domain-specific context |
2026-04-24 |
| PR |
0.07 |
Extended n-to-1 kernel fusion via `KernelConfig` |
|
Detailed technical explanation, informal |
2026-04-27 |
| PR |
0.06 |
[CI] Replace PAT with GitHub App token in repo-consistency-b |
|
Technical details, domain abbreviations, |
2026-05-07 |
| PR |
0.06 |
Fix EP + DeepSpeed ZeRO-3 loading via accelerate launch |
|
Domain abbreviations and concise summary |
2026-04-21 |
| COMMIT |
0.05 |
🚨 Refactor ViT to updated standards (#41693) |
|
Commit messages are terse, use domain te |
2026-05-08 |
| COMMIT |
0.05 |
Add image processors refactor to v5 migration guide (#45556) |
|
Brief, changelog-focused; no AI hallmark |
2026-04-28 |
| COMMIT |
0.05 |
[docs] modular transformers (#45327) |
|
Standard PR commit log, minimal free tex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] dtype (#45659) |
|
Short, lacks AI-generated phrasing or ex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cb memory management (#45587) |
|
Extremely brief chunked log, no AI phras |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cpu offloading (#45660) |
|
Minimal, no formal language or AI stylis |
2026-04-28 |
| COMMIT |
0.05 |
Fix `x_clip`: 8 failed test cases (#45394) |
|
Test fix, highly specific, fully normal |
2026-04-28 |
| PR |
0.05 |
hy_v3: add XPU expectations |
|
Very terse, informal request; clearly hu |
2026-05-09 |
| PR |
0.05 |
Update latest revision for Phi-4-multimodal test |
|
Brief, domain-specific content with a mi |
2026-04-28 |
| PR |
0.05 |
Fix `tie_word_embeddings` not lifted from `text_config` for |
|
Uses specific config names and informal |
2026-05-09 |
| PR |
0.05 |
Migrate TF32 API calls to new fp32_precision API |
|
Domain-specific references and concise t |
2026-05-09 |
| PR |
0.05 |
Add V-JEPA 2.1 inference support |
|
Template for PR, but actual content is t |
2026-04-17 |
| PR |
0.05 |
fix(bitsandbytes): implement reverse_op for Bnb4bitDeseriali |
|
Technical subject, domain abbreviations, |
2026-05-02 |
| PR |
0.05 |
feat: add crop() to StaticCache layers for assisted generati |
|
Technical language, domain-specific term |
2026-05-02 |
| PR |
0.05 |
Keep deleting |
|
Very terse, informal style with direct r |
2026-05-06 |
| PR |
0.05 |
fix padding side issue for fast_vlm tests |
|
Short, technical fix with direct context |
2026-04-23 |
| PR |
0.05 |
fix(qianfan_ocr): add XPU expectations |
|
Brief, informal, review request; clear h |
2026-04-24 |
| PR |
0.05 |
[Weight converter] Revert unnecessary changes to `rename_sou |
|
Uses domain vernacular, concise explanat |
2026-05-07 |
| PR |
0.05 |
Drop `content=None` from messages in `apply_chat_template` |
|
Uses domain jargon and concise phrasing |
2026-04-14 |
| PR |
0.05 |
Fix `kernelize()` crash for gpt_oss: missing `@use_kernel_fu |
|
Technical, concise; uses PR numbers and |
2026-05-06 |
| PR |
0.05 |
gpt_oss multi-GPU AMD support |
|
Technical, uses domain vocab (multi-GPU, |
2026-04-30 |
| PR |
0.04 |
Fix model parallel bugs for Gemma4 |
|
Terce, technical, includes error message |
2026-05-07 |
| PR |
0.04 |
[docs] contributing |
|
Brief, technical, and uses domain terms |
2026-04-15 |
| PR |
0.04 |
[CB] Fixes for SDPA and CPU offloading |
|
Includes domain jargon, terse explanatio |
2026-05-01 |
| PR |
0.03 |
fix: correct typo 'seperate' -> 'separate' in comments acros |
|
Fix description, includes typo correctio |
2026-05-07 |
| PR |
0.03 |
fix: raise ValueError when num_beams × vocab_size exceeds to |
|
Uses code error details and direct expla |
2026-05-07 |
| PR |
0.03 |
Fix gemma4 with multi-gpu setup |
|
Terse, technical style: clear human sign |
2026-05-07 |
| PR |
0.03 |
fix(rf_detr): correct paper URL and stale checkpoint referen |
|
Brief, specific, uses standard commit ph |
2026-05-07 |
| PR |
0.03 |
Fix added-token prefix space in SentencePiece fast tokenizer |
|
Clear technical focus, issue referencing |
2026-05-07 |
| PR |
0.02 |
Delete dead code in qwen-vl series |
|
Informal tone ('Dont review yet!'), doma |
2026-05-07 |
| COMMIT |
0.00 |
fix(rf_detr): fix failing tests (#45845) |
|
Direct, minimal language, issue context, |
2026-05-08 |
| COMMIT |
0.00 |
fix(granite4_vision): auto-fix failing tests (#45844) |
|
Direct technical description, signatures |
2026-05-08 |
| COMMIT |
0.00 |
fix(laguna): fix failing tests (#45842) |
|
Minimal edit, direct cause, no AI hallma |
2026-05-08 |
| COMMIT |
0.00 |
:rotating_light: Generic Sequence Classifier works for multi |
|
Informal, nonstandard grammar, domain-sp |
2026-05-08 |
| COMMIT |
0.00 |
Fix import error in moe.py by providing explicit schema to c |
|
Technical context, precise, realistic er |
2026-05-08 |
| COMMIT |
0.00 |
fix: correct typo 'seperate' -> 'separate' in comments acros |
|
Direct typo fix description, minimal det |
2026-05-08 |
| COMMIT |
0.00 |
🚨 [Fuyu] Remove FuyuBatchFeature subclass, use BatchFeature |
|
Terse style, clear purpose, co-author hu |
2026-05-08 |
| COMMIT |
0.00 |
Keep deleting (#45802) |
|
Extremely terse, informal commit message |
2026-05-08 |
| COMMIT |
0.00 |
Fix gemma4 with multi-gpu setup (#45826) |
|
Brief and informal; no AI hallmarks pres |
2026-05-07 |
| COMMIT |
0.00 |
Add HyperCLOVAX SEED Think 14B (#44956) |
|
Commit trailers but no AI signals; conci |
2026-05-07 |
| COMMIT |
0.00 |
Fix `kernelize()` crash for gpt_oss: missing `@use_kernel_fu |
|
Technical fixes, domain jargon, no AI ph |
2026-05-07 |
| COMMIT |
0.00 |
Cache `merged_typed_dict` to not break `validate_typed_dict` |
|
Direct changelog with domain terms; lack |
2026-05-07 |
| COMMIT |
0.00 |
Get rid of deprecated use_return_dict call. (#45815) |
|
Minimal, terse, human-written style, no |
2026-05-07 |
| COMMIT |
0.00 |
fix(qianfan_ocr): add XPU expectations (#45615) |
|
Technical listing; informal, lacks AI te |
2026-05-07 |
| COMMIT |
0.00 |
Fix shared config mutation issue in flash_attn_from_config ( |
|
Domain-specific, concise, signed-off; no |
2026-05-07 |
| COMMIT |
0.00 |
Add RF-DETR (#36895) |
|
Changelog is technical and terse, lacks |
2026-05-07 |
| COMMIT |
0.00 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Highly technical, multiple fixes, domain |
2026-05-07 |
| COMMIT |
0.00 |
refector: renamed file glob to cache to make it clearer (#45 |
|
Brief, informal commit message indicates |
2026-05-06 |
| COMMIT |
0.00 |
Fix decorator order (#45806) |
|
Very terse, sometimes typoed commit mess |
2026-05-06 |
| COMMIT |
0.00 |
[`Granite 4.1 Vision`] Fixup integration tests (#45805) |
|
Terse commit messages, human abbreviatio |
2026-05-06 |
| COMMIT |
0.00 |
fix: validate special token ids against attribute values (#4 |
|
Technical, concise; shows informal engin |
2026-05-06 |
| COMMIT |
0.00 |
Blockwise mask fn as opt arg in all masking functions (#4547 |
|
Highly informal, fragmented notes; no si |
2026-05-06 |
| COMMIT |
0.00 |
[CB] Refactor any model-related code in a separate class (#4 |
|
Uses domain jargon, informal corrections |
2026-05-06 |
| COMMIT |
0.00 |
fix: forward use_cache kwarg to attention mixer in nemotron_ |
|
Brief technical commit messages with dom |
2026-05-05 |
| COMMIT |
0.00 |
fix: correct spelling in continuous_api docstring (#45749) |
|
Minimal, terse technical correction; typ |
2026-05-05 |
| COMMIT |
0.00 |
Fix link to modular transformers documentation (#45746) |
|
Direct update explanation; clear, concis |
2026-05-05 |
| COMMIT |
0.00 |
Gemma4: fix failed test cases (#45568) |
|
List of technical fixes and updates, sig |
2026-05-05 |
| COMMIT |
0.00 |
First model (#45788) |
|
Casual tone and specific jargon; reflect |
2026-05-05 |
| COMMIT |
0.00 |
Fix CI: Allow more artifacts to be download in CI (#45785) |
|
Debug note and domain-specific context, |
2026-05-05 |
| COMMIT |
0.00 |
Add `concurrency` to `PR CI` workflow file (`pr-ci-caller.ym |
|
Single word commit, terse; standard huma |
2026-05-05 |
| COMMIT |
0.00 |
Reorder decorators for autodoc and dataclass (#45702) |
|
Short, technical log phrases typical of |
2026-05-05 |
| COMMIT |
0.00 |
deepseek r1 distilled tokenizer fix for qwen2 mapping (#4574 |
|
Typo present, informal tone; clear human |
2026-05-05 |
| COMMIT |
0.00 |
fix: Added Mps support in float fallback backends list (#45 |
|
Structured code fixes, informal language |
2026-05-05 |
| COMMIT |
0.00 |
Github Actions PR CI (caller) (#45476) |
|
Commit message is terse and includes dom |
2026-05-04 |
| COMMIT |
0.00 |
make sure we call check_auto in CI (#45775) |
|
Informal tone; domain-specific and conci |
2026-05-04 |
| COMMIT |
0.00 |
Better Grouped GEMM + EP (#45621) |
|
Commit log is informal, terse, and techn |
2026-05-04 |
| COMMIT |
0.00 |
DeepSeek OCR specifies an incorrect tokenizer class on the H |
|
Human-written; issue title is concise an |
2026-05-04 |
| COMMIT |
0.00 |
Fix auto mapping script (#45774) |
|
Extremely terse and informal commit mess |
2026-05-04 |
| COMMIT |
0.00 |
PythonBackend slow tokenizer convert_ids_to_tokens fix (#457 |
|
Commit message is terse and lacks AI-sty |
2026-05-04 |
| COMMIT |
0.00 |
[MINISTRAL3] Fix conversion script yarn's apply_scale suppor |
|
Commit message is terse and technical; n |
2026-05-03 |
| COMMIT |
0.00 |
🚨 Get rid of most Apex references (#45723) |
|
Concise and informal message, domain-typ |
2026-05-01 |
| COMMIT |
0.00 |
fix(utils): Resolve backbone utils test regressions (#45594) |
|
Short, domain-specific commit message; t |
2026-05-01 |
| COMMIT |
0.00 |
[CB] Better overall script and decode bucketting (#45653) |
|
Informal, terse checklist style is human |
2026-05-01 |
| COMMIT |
0.00 |
[docs] model testing (#45152) |
|
Terese commit messages; human style; dom |
2026-04-30 |
| COMMIT |
0.00 |
update dev (#45726) |
|
Brief update message; lacks AI hallmarks |
2026-04-30 |
| COMMIT |
0.00 |
[`OAI Privacy Filter`] Add integration test (#45725) |
|
Short, action-based messages; human comm |
2026-04-30 |
| COMMIT |
0.00 |
Speedup Qwen2VLImageProcessor (#45719) |
|
Technical, terse commit messages; human |
2026-04-30 |
| COMMIT |
0.00 |
Remove dead beam-search dummies from dummy_pt_objects.py (#4 |
|
Single, precise technical commit message |
2026-04-30 |
| COMMIT |
0.00 |
[Model] Add PP-FormulaNet Model Support (#45626) |
|
Terse, domain-specific commits; includes |
2026-04-30 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 10 utility files (#4 |
|
Clear domain context and project-specifi |
2026-04-30 |
| COMMIT |
0.00 |
[serve] cb error (#45691) |
|
Brief commit log format, no AI signals. |
2026-04-29 |
| COMMIT |
0.00 |
Fix trust_remote_code local cache collisions for local model |
|
Technical commit log, concise, no AI hal |
2026-04-29 |
| COMMIT |
0.00 |
Llama3 video fix (#45040) |
|
Standard iterative commit log, domain te |
2026-04-29 |
| COMMIT |
0.00 |
[Fix Phi4 test] Fall back to model config for image processo |
|
Technical changelog, brief summary, no A |
2026-04-29 |
| COMMIT |
0.00 |
Fix custom-module copies inheriting read-only permissions (# |
|
Detailed technical context, not overly f |
2026-04-29 |
| COMMIT |
0.00 |
Python code in model docs (#45608) |
|
Informal, terse commit log, domain-speci |
2026-04-29 |
| COMMIT |
0.00 |
fix failed test cases for blt model (#45596) |
|
Technical commit log, signed off by huma |
2026-04-29 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 3 pipeline files (#4 |
|
Standard technical changelog and co-auth |
2026-04-29 |
| COMMIT |
0.00 |
change got reverted (#45680) |
|
Extremely terse, typical human revert me |
2026-04-28 |
| COMMIT |
0.00 |
fix padding side issue for fast_vlm tests (#45592) |
|
Terse, domain-specific, includes real na |
2026-04-28 |
| COMMIT |
0.00 |
zero_shot_object_detection ValueError fix for python 3.13 (# |
|
Very brief, precise, with clear domain r |
2026-04-28 |
| COMMIT |
0.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback ( |
|
Concise, technical commit message with d |
2026-04-28 |
| COMMIT |
0.00 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Short, informal message; human co-author |
2026-04-28 |
| COMMIT |
0.00 |
No serving in quality docker image (#45677) |
|
Brief commit message, human co-author, n |
2026-04-28 |
| COMMIT |
0.00 |
Laguna XS.2 implementation (#45673) |
|
Minimal title-only commit, no sign of AI |
2026-04-28 |
| COMMIT |
0.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
Structured changelog, domain-specific co |
2026-04-28 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| COMMIT |
0.00 |
[gemma4] infer from config instead of hardcoding (#45606) |
|
Informal, concise updates and normal cod |
2026-04-27 |
| COMMIT |
0.00 |
Update quants tests (#45480) |
|
Minimalist, lacks AI style, uses terse c |
2026-04-27 |
| COMMIT |
0.00 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Technical summary with rationale, inform |
2026-04-27 |
| COMMIT |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Patch details, domain-specific explanati |
2026-04-27 |
| COMMIT |
0.00 |
Fix colmodernvbert tests (#45652) |
|
Very terse, informal, intentionally mini |
2026-04-27 |
| COMMIT |
0.00 |
[CB] [Major] Add CPU request offloading (#45184) |
|
Commit messages are terse, informal, and |
2026-04-27 |
| COMMIT |
0.00 |
Fix peft constructors (#45622) |
|
Very terse and non-formal commit, human |
2026-04-27 |
| COMMIT |
0.00 |
chore: speedup modular converter (~30%) (#45046) |
|
Highly technical, terse, and informal; n |
2026-04-27 |
| COMMIT |
0.00 |
Fix whisper return language (#42227) |
|
Technical commit log with co-author trai |
2026-04-27 |
| COMMIT |
0.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
Short, direct commit style typical of hu |
2026-04-27 |
| COMMIT |
0.00 |
Raise clear error for `problem_type="single_label_classifica |
|
Explanation is technical with domain det |
2026-04-24 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Contains terse technical TODOs and domai |
2026-03-26 |
| PR |
0.00 |
fix |
|
Extremely brief and informal; clearly hu |
2026-05-10 |
| PR |
0.00 |
feat(t5gemma2): add Flash Attention 2 support |
|
Technical jargon and informal listing in |
2026-05-10 |
| PR |
0.00 |
More robust processor from pretrained |
|
Technical description with domain refere |
2025-12-05 |
| PR |
0.00 |
trainer: clear MPS graph cache after each optimizer step (py |
|
Uses domain slang (MPSGraph) and brief t |
2026-05-07 |
| PR |
0.00 |
Fix M-RoPE device mismatch in Qwen3VL family under FSDP2 CPU |
|
Direct fix description with domain conte |
2026-05-09 |
| PR |
0.00 |
Implement VibeVoice |
|
Concise, uses domain-specific references |
2025-08-29 |
| PR |
0.00 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Technical bug fix, uses domain jargon an |
2026-04-09 |
| PR |
0.00 |
- |
|
No free-text content provided; template |
2026-05-09 |
| PR |
0.00 |
[docs] adding audio/video processors |
|
Terse, references issue comment, clearly |
2026-05-05 |
| PR |
0.00 |
[docs] chat template |
|
Brief, direct, minimal doc update detail |
2026-05-08 |
| PR |
0.00 |
:rotating_light: [`Attn`] Remove all old mask APIs from mode |
|
Concise, uses emojis, informal technical |
2026-02-11 |
| PR |
0.00 |
fix(rf_detr): fix failing tests |
|
Very terse, uses repo jargon, human styl |
2026-05-08 |
| PR |
0.00 |
fix failed test cases for blt model |
|
Short, informal, and contains domain abb |
2026-04-23 |
| PR |
0.00 |
Add HyperCLOVAX SEED Think 14B |
|
PR template structure; actual content is |
2026-03-23 |