| COMMIT |
1.00 |
Add DeepSeek V4 (#45643) |
|
Commit message contains explicit AI assi |
2026-05-02 |
| COMMIT |
1.00 |
Support for a new Granite-Speech-Plus model (#45695) |
|
Commit message contains explicit AI assi |
2026-04-29 |
| COMMIT |
1.00 |
fixing more typos (#45689) |
|
Commit message contains explicit AI assi |
2026-04-28 |
| COMMIT |
1.00 |
Fix configuration reading and error handling for kernels (#4 |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Fix `AttributeError` on `s_aux=None` in `flash_attention_for |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Add expert parallelism (EP) config support for Qwen3 MoE (# |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
[`Privacy Filter`] Add model (#45580) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add ForSequenceClassification heads for the OLMo family (#45 |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| PR |
1.00 |
Better Grouped GEMM + EP |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
fix: Made histc_input robust for broader hardware |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
1.00 |
Fix split batch size |
|
PR body explicitly mentions AI collabora |
2026-05-02 |
| PR |
1.00 |
Fix link to modular transformers documentation |
|
PR body explicitly mentions AI collabora |
2026-05-02 |
| PR |
1.00 |
[MINISTRAL3] Fix conversion script yarn's apply_scale suppor |
|
PR body explicitly mentions AI collabora |
2026-05-02 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
Add ctsm model |
|
PR body explicitly mentions AI collabora |
2026-04-17 |
| PR |
1.00 |
Fix memory leak in T5 by adding opt-out for apex FusedRMSNor |
|
PR body explicitly mentions AI collabora |
2026-04-30 |
| PR |
1.00 |
Speedup Qwen2VLImageProcessor |
|
PR body explicitly mentions AI collabora |
2026-04-30 |
| PR |
1.00 |
chore(typing): add ty type checking for 10 utility files |
|
PR body explicitly mentions AI collabora |
2026-04-29 |
| PR |
1.00 |
DeepGEMM BF16 + mixed FP8/FP4 + MegaMoE + refactor |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Exclude audio modules from conversion process |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
0.70 |
Doc translate to Persian(farsi) |
|
"This PR adds..." and boilerplate tone s |
2026-04-27 |
| PR |
0.65 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Opening 'This PR aims to...' is a typica |
2026-04-27 |
| PR |
0.25 |
Adding support for Nandi Models |
|
Mostly human, but last sentence slightly |
2026-03-29 |
| PR |
0.20 |
Fix gemma gguf tokenizer |
|
Technical root cause analysis; clear, no |
2025-10-15 |
| PR |
0.20 |
Docs add custom loss example |
|
Standard changelog language, focused and |
2025-10-15 |
| PR |
0.20 |
fix(bitsandbytes): implement reverse_op for Bnb4bitDeseriali |
|
Structured, concise, with technical deta |
2026-05-02 |
| PR |
0.20 |
Modularize `ProcessorMixin` into smaller components |
|
PR is clear, but modularization explanat |
2026-04-17 |
| PR |
0.20 |
Add regression test for MusicgenMelody audio conditioning (G |
|
Domain-specific, clear regression test s |
2026-05-01 |
| PR |
0.20 |
fix: the serving api endpoints in server in server.py |
|
Slightly formal but technical and contai |
2026-04-30 |
| PR |
0.20 |
Fix check_auto.py wiping auto_mappings.py due to missing flu |
|
Technical and specific; wordy but not AI |
2026-04-30 |
| PR |
0.20 |
Update latest revision for Phi-4-multimodal test |
|
Informal tone, specific references, and |
2026-04-28 |
| PR |
0.20 |
Add docstrings to type validator functions |
|
Concise, technical update; no AI-like ph |
2026-04-29 |
| PR |
0.15 |
[SAM3] Enable single-scale input support in Mask Decoder |
|
Technical domain detail and concise summ |
2025-12-26 |
| PR |
0.15 |
Optimize Qwen3 RoPE: precompute cos/sin cache for static rop |
|
Uses domain-specific abbreviations and c |
2026-05-02 |
| PR |
0.15 |
fix(processing): Filter kwargs in ProcessorMixin call to pre |
|
Technical explanation with clear context |
2025-10-15 |
| PR |
0.15 |
Proposal: Agent-first CLI |
|
Slightly more formal, but still mostly t |
2026-04-03 |
| PR |
0.15 |
Fix trust_remote_code local cache collisions for local model |
|
Structured, technical changelog; human s |
2026-04-24 |
| PR |
0.15 |
Feature/add axk1 |
|
Brief, domain-specific, lacking AI-style |
2026-04-14 |
| PR |
0.15 |
Fix train_batch_size and eval_batch_size to respect split_ba |
|
Domain-specific context, direct, not ove |
2026-04-29 |
| PR |
0.15 |
Python code in model docs |
|
Informal, batched update, technical shor |
2026-04-23 |
| COMMIT |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
Commit contains domain context and code |
2026-05-01 |
| COMMIT |
0.10 |
Doc translate to Persian(farsi) (#45664) |
|
Slightly formal in part, but domain-spec |
2026-04-30 |
| COMMIT |
0.10 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Somewhat formal wording but still within |
2026-04-28 |
| PR |
0.10 |
feat: add crop() to StaticCache layers for assisted generati |
|
Informal, technical jargon and abbreviat |
2026-05-02 |
| PR |
0.10 |
feat(llama): add has_weight parameter to LlamaRMSNorm for Fl |
|
Terse technical summary, uses domain abb |
2026-05-02 |
| PR |
0.10 |
Gemma4: fix failed test cases |
|
Brief, specific description with domain |
2026-04-22 |
| PR |
0.10 |
Add DeepSeek V4 |
|
Short, informal, includes human reviewer |
2026-04-25 |
| PR |
0.10 |
Add V-JEPA 2.1 inference support |
|
Technical, context-specific detail—human |
2026-04-17 |
| PR |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
PR content uses technical terms and cont |
2026-04-23 |
| PR |
0.10 |
DeepSeek OCR specifies an incorrect tokenizer class on the H |
|
PR content uses technical specifics, not |
2026-05-01 |
| PR |
0.10 |
Reorder decorators for autodoc and dataclass |
|
Uses informal tone and domain-specific a |
2026-04-29 |
| PR |
0.10 |
Add RF-DETR |
|
Direct, minimal explanation; no AI-style |
2025-03-21 |
| PR |
0.10 |
fix: AutoConfig reloads wrong class after save_pretrained + |
|
Domain-specific, explanatory, informal t |
2026-04-30 |
| PR |
0.10 |
🚨 Get rid of most Apex references |
|
Domain-specific context and informal, tr |
2026-04-30 |
| PR |
0.10 |
Qwen3 ASR and Forced Aligner |
|
Brief, project-specific, no AI signal; u |
2026-02-08 |
| PR |
0.10 |
Add Granite 4.1 Vision (granite4_vision) |
|
Technical changelog, clear context, proj |
2026-04-23 |
| PR |
0.10 |
Blockwise mask fn as opt arg in all masking functions |
|
Informal, domain jargon and terse summar |
2026-04-16 |
| PR |
0.10 |
chore(mlinter): implement part of rule TRF003 |
|
Terse, formatted with domain abbreviatio |
2026-04-30 |
| PR |
0.10 |
Fix xdist collisions for captured_info artifacts and preserv |
|
Detailed technical explanation, domain a |
2026-04-25 |
| PR |
0.10 |
[CB] Better overall script and decode bucketting |
|
Technical summary, realistic usage, not |
2026-04-27 |
| PR |
0.10 |
PythonBackend slow tokenizer convert_ids_to_tokens fix |
|
Technical, concise; typographical errors |
2026-04-30 |
| PR |
0.10 |
[docs] model testing |
|
Brief, informal, domain-specific phrases |
2026-03-31 |
| PR |
0.10 |
[skills] model doc |
|
Short, informal language with abbreviati |
2026-04-29 |
| PR |
0.10 |
gpt_oss multi-GPU AMD support |
|
Contains domain-specific detail and sour |
2026-04-30 |
| PR |
0.10 |
[docs] update model cards |
|
Brief, domain terms, informal and minima |
2026-04-23 |
| PR |
0.10 |
[Don't merge] Call CI workflow |
|
Very brief, terse, lacks AI signals; typ |
2026-04-16 |
| PR |
0.10 |
fix(pytorch_utils): per-instance lru_cache to fix RT-DETR me |
|
Includes code snippet and bug context; n |
2026-04-30 |
| PR |
0.10 |
Fix torch.export compatibility for Mixtral MoE models |
|
Concise technical notes, domain abbrevia |
2025-08-12 |
| PR |
0.10 |
Add Deepseek-OCR-2 model |
|
Direct, technical style; minimal filler |
2026-03-27 |
| PR |
0.10 |
Remove dead beam-search dummies from dummy_pt_objects.py |
|
Uses specific references and issue numbe |
2026-04-30 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Domain-specific phrasing and abbreviatio |
2026-04-09 |
| PR |
0.10 |
[Gemma4] Fix SharedKVCache identity loss under FSDP2 cast_fo |
|
Precise, technical, context-heavy langua |
2026-04-30 |
| PR |
0.10 |
Add FP8 kernel acceleration for compressed-tensors quantized |
|
Technical, descriptive content with no A |
2026-04-29 |
| PR |
0.10 |
Add MiniCPM3 |
|
Concise, domain-specific; no AI phrasing |
2025-09-24 |
| PR |
0.10 |
Fix padding calculation in new_conv_state assignment |
|
Detailed technical explanation, no AI ma |
2026-04-30 |
| PR |
0.10 |
Support for a new Granite-Speech-Plus model |
|
Brief, somewhat informal, references PR |
2026-04-29 |
| PR |
0.10 |
[serve] cb error |
|
Direct, specific, no AI phrasing. |
2026-04-28 |
| PR |
0.10 |
Fix void segmentation map label reduction |
|
Informal checklist style and clear techn |
2026-04-14 |
| PR |
0.10 |
Fix custom-module copies inheriting read-only permissions |
|
Human explanation with technical details |
2026-04-28 |
| PR |
0.10 |
fix(testing): check torch.cuda.is_available() before get_dev |
|
Technical and succinct, human-typical bu |
2026-04-29 |
| COMMIT |
0.05 |
Add image processors refactor to v5 migration guide (#45556) |
|
Brief, changelog-focused; no AI hallmark |
2026-04-28 |
| COMMIT |
0.05 |
[docs] modular transformers (#45327) |
|
Standard PR commit log, minimal free tex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] dtype (#45659) |
|
Short, lacks AI-generated phrasing or ex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cb memory management (#45587) |
|
Extremely brief chunked log, no AI phras |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cpu offloading (#45660) |
|
Minimal, no formal language or AI stylis |
2026-04-28 |
| COMMIT |
0.05 |
Fix `x_clip`: 8 failed test cases (#45394) |
|
Test fix, highly specific, fully normal |
2026-04-28 |
| COMMIT |
0.05 |
perf: avoid recomputing rotary_emb for each layer in some Go |
|
Commit messages are terse, technical, an |
2026-04-22 |
| COMMIT |
0.05 |
Gemma4 training with text-only samples (#45454) |
|
Brief, informal, and technical commit me |
2026-04-22 |
| COMMIT |
0.05 |
[nemotron_h] Add support for MLP mixers (#44763) |
|
Short, informal commit messages with dom |
2026-04-22 |
| COMMIT |
0.05 |
add expert parallelism for gemma-4-26B-A4B-it (#45279) |
|
Technical commit messages and Signed-off |
2026-04-22 |
| COMMIT |
0.05 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, detail-oriented commit messag |
2026-04-22 |
| COMMIT |
0.05 |
Update Gemma4 weight conversion script (#45328) |
|
Technical commit messages, informal lang |
2026-04-22 |
| COMMIT |
0.05 |
fix table update versions (#45544) |
|
Very brief, technical, and informal comm |
2026-04-22 |
| COMMIT |
0.05 |
Add disable_mmap kwarg to from_pretrained with hf-mount auto |
|
Technical, template-driven changes, huma |
2026-04-22 |
| COMMIT |
0.05 |
fix(DSV3): parity between native `DeepseekV3MoE` and remote |
|
Technical description, terse phrasing, d |
2026-04-22 |
| PR |
0.05 |
Add Molmo2 |
|
No free-text; only copied template conte |
2026-01-23 |
| PR |
0.05 |
fix(musicgen_melody): use DynamicCache instead of EncoderDec |
|
Domain-specific language, concise, clear |
2026-05-01 |
| PR |
0.05 |
fix(quantizers): make user-supplied skip_modules additive wi |
|
Concise, technical changes; domain langu |
2026-05-01 |
| PR |
0.05 |
fix(utils): Resolve backbone utils test regressions |
|
Uses changelog format, domain references |
2026-04-23 |
| PR |
0.05 |
[CB] Fixes for SDPA and CPU offloading |
|
Succinct, bug-fix oriented, includes tec |
2026-05-01 |
| PR |
0.05 |
refector: renamed file glob to cache to make it clearer |
|
Direct, informal tone with specific tech |
2026-04-30 |
| PR |
0.05 |
Fix loading logic issue |
|
Informal, domain-specific explanation; c |
2026-02-17 |
| PR |
0.05 |
qa: speed up dtype regex weight load + reduce dtype tests to |
|
Technical, terse checklist; no AI-like s |
2026-04-24 |
| PR |
0.05 |
Added SWA and Linear Attention to Llama3.2-1b |
|
Very brief, experimental note; lacks any |
2026-04-30 |
| PR |
0.05 |
[nit] glmasr should be in AutoModelForMultimodalLM |
|
Minimal content, domain reference, typic |
2026-04-28 |
| COMMIT |
0.00 |
🚨 Get rid of most Apex references (#45723) |
|
Concise and informal message, domain-typ |
2026-05-01 |
| COMMIT |
0.00 |
fix(utils): Resolve backbone utils test regressions (#45594) |
|
Short, domain-specific commit message; t |
2026-05-01 |
| COMMIT |
0.00 |
[CB] Better overall script and decode bucketting (#45653) |
|
Informal, terse checklist style is human |
2026-05-01 |
| COMMIT |
0.00 |
[docs] model testing (#45152) |
|
Terese commit messages; human style; dom |
2026-04-30 |
| COMMIT |
0.00 |
update dev (#45726) |
|
Brief update message; lacks AI hallmarks |
2026-04-30 |
| COMMIT |
0.00 |
[`OAI Privacy Filter`] Add integration test (#45725) |
|
Short, action-based messages; human comm |
2026-04-30 |
| COMMIT |
0.00 |
Speedup Qwen2VLImageProcessor (#45719) |
|
Technical, terse commit messages; human |
2026-04-30 |
| COMMIT |
0.00 |
Remove dead beam-search dummies from dummy_pt_objects.py (#4 |
|
Single, precise technical commit message |
2026-04-30 |
| COMMIT |
0.00 |
[Model] Add PP-FormulaNet Model Support (#45626) |
|
Terse, domain-specific commits; includes |
2026-04-30 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 10 utility files (#4 |
|
Clear domain context and project-specifi |
2026-04-30 |
| COMMIT |
0.00 |
[serve] cb error (#45691) |
|
Brief commit log format, no AI signals. |
2026-04-29 |
| COMMIT |
0.00 |
Fix trust_remote_code local cache collisions for local model |
|
Technical commit log, concise, no AI hal |
2026-04-29 |
| COMMIT |
0.00 |
Llama3 video fix (#45040) |
|
Standard iterative commit log, domain te |
2026-04-29 |
| COMMIT |
0.00 |
[Fix Phi4 test] Fall back to model config for image processo |
|
Technical changelog, brief summary, no A |
2026-04-29 |
| COMMIT |
0.00 |
Fix custom-module copies inheriting read-only permissions (# |
|
Detailed technical context, not overly f |
2026-04-29 |
| COMMIT |
0.00 |
Python code in model docs (#45608) |
|
Informal, terse commit log, domain-speci |
2026-04-29 |
| COMMIT |
0.00 |
fix failed test cases for blt model (#45596) |
|
Technical commit log, signed off by huma |
2026-04-29 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 3 pipeline files (#4 |
|
Standard technical changelog and co-auth |
2026-04-29 |
| COMMIT |
0.00 |
change got reverted (#45680) |
|
Extremely terse, typical human revert me |
2026-04-28 |
| COMMIT |
0.00 |
fix padding side issue for fast_vlm tests (#45592) |
|
Terse, domain-specific, includes real na |
2026-04-28 |
| COMMIT |
0.00 |
zero_shot_object_detection ValueError fix for python 3.13 (# |
|
Very brief, precise, with clear domain r |
2026-04-28 |
| COMMIT |
0.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback ( |
|
Concise, technical commit message with d |
2026-04-28 |
| COMMIT |
0.00 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Short, informal message; human co-author |
2026-04-28 |
| COMMIT |
0.00 |
No serving in quality docker image (#45677) |
|
Brief commit message, human co-author, n |
2026-04-28 |
| COMMIT |
0.00 |
Laguna XS.2 implementation (#45673) |
|
Minimal title-only commit, no sign of AI |
2026-04-28 |
| COMMIT |
0.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
Structured changelog, domain-specific co |
2026-04-28 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| COMMIT |
0.00 |
[gemma4] infer from config instead of hardcoding (#45606) |
|
Informal, concise updates and normal cod |
2026-04-27 |
| COMMIT |
0.00 |
Update quants tests (#45480) |
|
Minimalist, lacks AI style, uses terse c |
2026-04-27 |
| COMMIT |
0.00 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Technical summary with rationale, inform |
2026-04-27 |
| COMMIT |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Patch details, domain-specific explanati |
2026-04-27 |
| COMMIT |
0.00 |
Fix colmodernvbert tests (#45652) |
|
Very terse, informal, intentionally mini |
2026-04-27 |
| COMMIT |
0.00 |
[CB] [Major] Add CPU request offloading (#45184) |
|
Commit messages are terse, informal, and |
2026-04-27 |
| COMMIT |
0.00 |
Fix peft constructors (#45622) |
|
Very terse and non-formal commit, human |
2026-04-27 |
| COMMIT |
0.00 |
chore: speedup modular converter (~30%) (#45046) |
|
Highly technical, terse, and informal; n |
2026-04-27 |
| COMMIT |
0.00 |
Fix whisper return language (#42227) |
|
Technical commit log with co-author trai |
2026-04-27 |
| COMMIT |
0.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
Short, direct commit style typical of hu |
2026-04-27 |
| COMMIT |
0.00 |
Raise clear error for `problem_type="single_label_classifica |
|
Explanation is technical with domain det |
2026-04-24 |
| COMMIT |
0.00 |
CircleCI with torch 2.11 (#45633) |
|
Repetitive commit summary, highly typica |
2026-04-24 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for main doc build workflow (#45 |
|
Standard terse chore commit, no AI style |
2026-04-24 |
| COMMIT |
0.00 |
Allow more artifacts to be download in CI (#45629) |
|
Sparse, informal; lacks signature AI phr |
2026-04-24 |
| COMMIT |
0.00 |
chore(qa): split pipeline and add type checking (#45432) |
|
Contains abbreviations and minimal phras |
2026-04-24 |
| COMMIT |
0.00 |
Skip failing offloading tests (#45624) |
|
Commit messages are brief and telegraphi |
2026-04-24 |
| COMMIT |
0.00 |
generate: drop stale num_return_sequences warning on continu |
|
Technical justification, abbreviations, |
2026-04-24 |
| COMMIT |
0.00 |
Remove unnecessary generate warnings (#45619) |
|
Brief, imperative commit style, no AI si |
2026-04-24 |
| COMMIT |
0.00 |
fix: compute auxiliary losses when denoising is disabled in |
|
Commit uses terse, technical language an |
2026-04-23 |
| COMMIT |
0.00 |
qa: bumped mlinter and allow local override (#45585) |
|
Informal commit lines and explicit human |
2026-04-23 |
| COMMIT |
0.00 |
Processing Utils: continue when content is a string (#45605) |
|
Terse commit message; direct technical f |
2026-04-23 |
| COMMIT |
0.00 |
SonicMoe (#45433) |
|
Informal, iterative commit messages with |
2026-04-23 |
| COMMIT |
0.00 |
fix transformers + torchao nvfp4 serialization (#45573) |
|
Casual tone, human-specific comments, an |
2026-04-23 |
| COMMIT |
0.00 |
[AMD CI] Fix expectations for Gemma3n (#45602) |
|
Short, lower-case, informal change descr |
2026-04-23 |
| COMMIT |
0.00 |
[docs] multi-turn tool calling (#45554) |
|
Very terse, typical human commit without |
2026-04-23 |
| COMMIT |
0.00 |
Allow for registered experts from kernels hub (#45577) |
|
Iterative, collaborative commit history |
2026-04-23 |
| COMMIT |
0.00 |
[CB] Changes for long generation (#45530) |
|
Highly informal, domain-heavy and collab |
2026-04-23 |
| COMMIT |
0.00 |
Align latest model attention function dispatch (#45598) |
|
Extremely terse, lacks AI-writing hallma |
2026-04-23 |
| COMMIT |
0.00 |
Gemma3n and Gemma4 cannot use rotary kernel (#45564) |
|
Short, direct human style; contains tech |
2026-04-23 |
| COMMIT |
0.00 |
do not index past decoded chars with special tokens (#45435) |
|
Informal tone, domain-specific phrasing, |
2026-04-22 |
| COMMIT |
0.00 |
Update dev version (#45583) |
|
Brief, informal; lacks AI stylistic cues |
2026-04-22 |
| COMMIT |
0.00 |
Update torchao usage for XPU and CPU (#45560) |
|
Very terse, technical, humanlike style. |
2026-04-22 |
| COMMIT |
0.00 |
[docs] per-request sampling params (#45553) |
|
Minimal, informal; clear domain-specific |
2026-04-22 |
| COMMIT |
0.00 |
Add IndexCache support for GLM5 DSA (#45424) |
|
Casual tone, domain jargon, human abbrev |
2026-04-22 |
| COMMIT |
0.00 |
Fix redundant logic in video processing SmolVLM (#45272) |
|
Brief, casual style, domain-specific wor |
2026-04-22 |
| COMMIT |
0.00 |
Fix typos (#45574) |
|
Domain jargon, typos, terse tone; human |
2026-04-22 |
| COMMIT |
0.00 |
Updated the image cache for Paddle models according to the l |
|
Terse style, domain language, minimal ex |
2026-04-22 |
| COMMIT |
0.00 |
[Model] Add SLANet Model Support (#45532) |
|
Brief changelog, domain signals, informa |
2026-04-22 |
| COMMIT |
0.00 |
refactor(Dots1): drop Dots1MoE override to `pass` (inherits |
|
Domain jargon, concise, no AI signals de |
2026-04-22 |
| COMMIT |
0.00 |
Move some conversion mappings to PrefixChange (#45567) |
|
Extremely terse commit; no AI signals, h |
2026-04-22 |
| COMMIT |
0.00 |
Align gemma3n cache sharing to gemma4 (#45489) |
|
Terse, informal commit messages; lacks A |
2026-04-22 |
| COMMIT |
0.00 |
[modular] Fix modular logic broken in #45045 (#45539) |
|
Very brief, informal with typos; clearly |
2026-04-22 |
| PR |
0.00 |
deepseek r1 distilled tokenizer fix for qwen2 mapping |
|
Minimal, terse phrasing; domain-specific |
2026-05-02 |
| PR |
0.00 |
fix: restore vocabulary loading in CamembertTokenizer |
|
Concise, domain-specific; clear human st |
2026-04-30 |
| PR |
0.00 |
Fix IndexError in sdpa_mask and flex_attention_mask for 0D t |
|
Technical, detailed explanation; entirel |
2026-05-02 |
| PR |
0.00 |
Extended n-to-1 kernel fusion via `KernelConfig` |
|
No free-text; only PR template and title |
2026-04-27 |
| PR |
0.00 |
Added library_name and library_version to HfApi/Hub calls fo |
|
REPO PR TEMPLATE, but free-text is conci |
2026-04-30 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Brief, fragmentary TODO list indicative |
2026-03-26 |
| PR |
0.00 |
[skills] fine-tuning |
|
Casual tone, code snippet, clearly human |
2026-04-30 |
| PR |
0.00 |
[New Model] Add MiniCPM3 support |
|
REPO PR TEMPLATE; actual text is brief a |
2026-04-23 |
| PR |
0.00 |
update dev |
|
Brief, informal, mentions contributors, |
2026-04-30 |
| PR |
0.00 |
Add xcodec2 model |
|
Informal, uses checklist and direct styl |
2026-02-20 |
| PR |
0.00 |
[`OAI Privacy Filter`] Add integration test |
|
Very brief, informal phrasing, no AI hal |
2026-04-30 |
| PR |
0.00 |
Add new model: Kimi2-6 |
|
Content is minimal and non-fluent, likel |
2026-04-24 |
| PR |
0.00 |
[Model] Add PP-FormulaNet Model Support |
|
Only a title with informal human review, |
2026-04-24 |
| PR |
0.00 |
fix: restore vocabulary loading in CamembertTokenizer |
|
Terse, domain-relevant; standard human c |
2026-04-30 |
| PR |
0.00 |
Fix model parallel issue for altclip model and ChineseClip m |
|
Technical, brief description and direct |
2026-04-17 |
| PR |
0.00 |
Llama3 video fix |
|
Informal language, references to people, |
2026-03-27 |
| PR |
0.00 |
[Fix Phi4 test] Fall back to model config for image processo |
|
Template; free-text is terse and technic |
2026-04-28 |