| COMMIT |
1.00 |
fixing more typos (#45689) |
|
Commit message contains explicit AI assi |
2026-04-28 |
| COMMIT |
1.00 |
Fix configuration reading and error handling for kernels (#4 |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Fix `AttributeError` on `s_aux=None` in `flash_attention_for |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Add expert parallelism (EP) config support for Qwen3 MoE (# |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
[`Privacy Filter`] Add model (#45580) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add ForSequenceClassification heads for the OLMo family (#45 |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
fix: Made histc_input robust for broader hardware |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
1.00 |
chore(typing): add ty type checking for 3 pipeline files |
|
PR body explicitly mentions AI collabora |
2026-04-27 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Exclude audio modules from conversion process |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
1.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback |
|
PR body explicitly mentions AI collabora |
2026-04-27 |
| PR |
1.00 |
Better Grouped GEMM + EP |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Add Qwen3.5 GGUF loading support |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Laguna XS.2 implementation |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
1.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
PR body explicitly mentions AI collabora |
2026-04-19 |
| PR |
1.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
0.60 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
ChatGPT-like phrase 'This PR aims to' an |
2026-04-27 |
| PR |
0.50 |
Doc translate to Persian(farsi) |
|
Very formal and explanatory, no clear AI |
2026-04-27 |
| PR |
0.20 |
[serve] Support for reasoning |
|
Concise, domain-specific but slightly fo |
2026-04-28 |
| PR |
0.20 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Brief, domain-specific explanation; no A |
2026-04-28 |
| PR |
0.20 |
Modularize `ProcessorMixin` into smaller components |
|
Concise, technical phrasing without AI-l |
2026-04-17 |
| PR |
0.20 |
Fix shared config mutation issue in flash_attn_from_config |
|
Short, technical bugfix summary, domain- |
2026-04-28 |
| PR |
0.20 |
Add Promptable Concept Segmentation pipeline |
|
Short custom summary without generic AI- |
2026-01-30 |
| PR |
0.20 |
Update latest revision for Phi-4-multimodal test |
|
Test failure context; wording is typical |
2026-04-28 |
| PR |
0.20 |
qa: more lazy loading |
|
Direct reference to an issue and technic |
2026-04-23 |
| PR |
0.20 |
FIX Restore LoRA hotswapping functionality |
|
Straightforward technical summary; no AI |
2026-04-28 |
| PR |
0.20 |
Fix KeyError for flash_attn in import_utils.py on Python 3.1 |
|
Small formatting issue, but includes a t |
2026-04-25 |
| PR |
0.15 |
Fix trust_remote_code local cache collisions for local model |
|
Structured, technical changelog; human s |
2026-04-24 |
| PR |
0.15 |
Add Deepseek-OCR-2 model |
|
References paper and hub, informal headi |
2026-03-27 |
| PR |
0.13 |
Python code in model docs |
|
Uses casual batch update phrasing and te |
2026-04-23 |
| PR |
0.12 |
fix(janus): Handle None values in image generation mode |
|
Direct technical description, specific b |
2026-03-17 |
| COMMIT |
0.10 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Somewhat formal wording but still within |
2026-04-28 |
| PR |
0.10 |
Require input_ids for repetition penalty |
|
Technical, concise human-written content |
2026-04-13 |
| PR |
0.10 |
[serve] cb error |
|
Direct, specific, no AI phrasing. |
2026-04-28 |
| PR |
0.10 |
Fix custom-module copies inheriting read-only permissions |
|
Human explanation with technical details |
2026-04-28 |
| PR |
0.10 |
fix padding side issue for fast_vlm tests |
|
Terse, uses informal tone and domain lan |
2026-04-23 |
| PR |
0.10 |
fix(x_clip): fix 8 failed test cases |
|
Very brief and to the point; non-AI-like |
2026-04-13 |
| PR |
0.10 |
qa: speed up dtype regex weight load + reduce dtype tests to |
|
Uses technical shorthand and short sente |
2026-04-24 |
| PR |
0.10 |
zero_shot_object_detection ValueError fix for python 3.13 |
|
Concise, domain-specific explanation wit |
2026-04-28 |
| PR |
0.10 |
TST Run fast PEFT tests in normal CI |
|
Slightly more formal, but uses short sen |
2026-04-28 |
| PR |
0.10 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Direct, technical language with missing |
2026-04-28 |
| PR |
0.10 |
FSDP2 native support in transformers |
|
Mix of English and French, shorthand, no |
2026-02-17 |
| PR |
0.10 |
Proposal: Agent-first CLI |
|
Slightly formal but focused, with incomp |
2026-04-03 |
| PR |
0.10 |
Fix OOM regression for FSDP2 + cpu_ram_efficient_loading on |
|
Detailed, technical, context-specific, n |
2026-04-25 |
| PR |
0.10 |
Fix peft constructors |
|
Concise, references discussion, typical |
2026-04-24 |
| PR |
0.10 |
Add Multi-Token Prediction (MTP) support for Qwen3.5 |
|
Technical repeat of title, structure is |
2026-04-24 |
| PR |
0.10 |
[GGUF] Add support for Qwen3.5 MoE (qwen35moe arch) |
|
Clear, technical, template usage, no AI |
2026-04-27 |
| PR |
0.10 |
[gemma4] infer from config instead of hardcoding |
|
Terse and specific, minimal free text, i |
2026-04-23 |
| PR |
0.10 |
Update quants tests |
|
Mentions domain issues, informal contrac |
2026-04-16 |
| PR |
0.10 |
Fix the order of `cls.config` resolution |
|
Directly references an issue, concise hu |
2026-04-27 |
| PR |
0.10 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Domain-specific, concise summary, normal |
2026-04-19 |
| PR |
0.10 |
Add regression test for Gemma4 audio relative positional ran |
|
Brief, context-specific; uses normal eng |
2026-04-23 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Technical, focused, somewhat terse—stand |
2026-04-09 |
| PR |
0.10 |
Fix SDPA inference tolerances for MPS backend |
|
Targeted, domain-heavy wording, no AI ha |
2026-04-25 |
| PR |
0.10 |
Adding support for Nandi Models |
|
Concise, only the closing thanks/emoji i |
2026-03-29 |
| PR |
0.10 |
chore: speedup modular converter (~30%) |
|
Short, technical, uses common collaborat |
2026-03-27 |
| PR |
0.10 |
Gemma4: fix failed test cases |
|
Brief, domain-specific, and informal; la |
2026-04-22 |
| PR |
0.09 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Uses domain references, casual phrasing, |
2026-04-27 |
| PR |
0.08 |
n-to-1 kernel fusion via `KernelConfig` |
|
Uses domain-specific terms and terse bul |
2026-04-10 |
| PR |
0.07 |
chore(typing): added modeling_utils to ty |
|
Technical changelog, informal tone, and |
2026-04-14 |
| COMMIT |
0.05 |
Add image processors refactor to v5 migration guide (#45556) |
|
Brief, changelog-focused; no AI hallmark |
2026-04-28 |
| COMMIT |
0.05 |
[docs] modular transformers (#45327) |
|
Standard PR commit log, minimal free tex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] dtype (#45659) |
|
Short, lacks AI-generated phrasing or ex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cb memory management (#45587) |
|
Extremely brief chunked log, no AI phras |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cpu offloading (#45660) |
|
Minimal, no formal language or AI stylis |
2026-04-28 |
| COMMIT |
0.05 |
Fix `x_clip`: 8 failed test cases (#45394) |
|
Test fix, highly specific, fully normal |
2026-04-28 |
| COMMIT |
0.05 |
perf: avoid recomputing rotary_emb for each layer in some Go |
|
Commit messages are terse, technical, an |
2026-04-22 |
| COMMIT |
0.05 |
Gemma4 training with text-only samples (#45454) |
|
Brief, informal, and technical commit me |
2026-04-22 |
| COMMIT |
0.05 |
[nemotron_h] Add support for MLP mixers (#44763) |
|
Short, informal commit messages with dom |
2026-04-22 |
| COMMIT |
0.05 |
add expert parallelism for gemma-4-26B-A4B-it (#45279) |
|
Technical commit messages and Signed-off |
2026-04-22 |
| COMMIT |
0.05 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, detail-oriented commit messag |
2026-04-22 |
| COMMIT |
0.05 |
Update Gemma4 weight conversion script (#45328) |
|
Technical commit messages, informal lang |
2026-04-22 |
| COMMIT |
0.05 |
fix table update versions (#45544) |
|
Very brief, technical, and informal comm |
2026-04-22 |
| COMMIT |
0.05 |
Add disable_mmap kwarg to from_pretrained with hf-mount auto |
|
Technical, template-driven changes, huma |
2026-04-22 |
| COMMIT |
0.05 |
fix(DSV3): parity between native `DeepseekV3MoE` and remote |
|
Technical description, terse phrasing, d |
2026-04-22 |
| PR |
0.05 |
[docs] modular transformers |
|
Casual tone and technical shorthand, con |
2026-04-08 |
| PR |
0.05 |
[CB] [Major] Add CPU request offloading |
|
Technical, concise, uses domain terms an |
2026-04-02 |
| PR |
0.05 |
Fix whisper return language |
|
Incomplete, typical accidental template |
2025-11-16 |
| PR |
0.05 |
Fix CB Accuracy Regression under FA2 |
|
Technical, informal, uses shorthand, not |
2026-04-07 |
| PR |
0.05 |
Fix Seq2SeqTrainer generation path for decoder-only models |
|
Domain-specific breakdown, list, and con |
2026-03-13 |
| PR |
0.05 |
[Trainer] Optimize LengthGroupedSampler computation with sel |
|
Concise technical content and domain ter |
2026-04-26 |
| PR |
0.03 |
[docs] dtype |
|
Brief, technical, and clear, typical of |
2026-04-27 |
| PR |
0.01 |
[docs] cpu offloading |
|
Extremely terse and specific, lacks AI h |
2026-04-27 |
| COMMIT |
0.00 |
change got reverted (#45680) |
|
Extremely terse, typical human revert me |
2026-04-28 |
| COMMIT |
0.00 |
fix padding side issue for fast_vlm tests (#45592) |
|
Terse, domain-specific, includes real na |
2026-04-28 |
| COMMIT |
0.00 |
zero_shot_object_detection ValueError fix for python 3.13 (# |
|
Very brief, precise, with clear domain r |
2026-04-28 |
| COMMIT |
0.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback ( |
|
Concise, technical commit message with d |
2026-04-28 |
| COMMIT |
0.00 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Short, informal message; human co-author |
2026-04-28 |
| COMMIT |
0.00 |
No serving in quality docker image (#45677) |
|
Brief commit message, human co-author, n |
2026-04-28 |
| COMMIT |
0.00 |
Laguna XS.2 implementation (#45673) |
|
Minimal title-only commit, no sign of AI |
2026-04-28 |
| COMMIT |
0.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
Structured changelog, domain-specific co |
2026-04-28 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| COMMIT |
0.00 |
[gemma4] infer from config instead of hardcoding (#45606) |
|
Informal, concise updates and normal cod |
2026-04-27 |
| COMMIT |
0.00 |
Update quants tests (#45480) |
|
Minimalist, lacks AI style, uses terse c |
2026-04-27 |
| COMMIT |
0.00 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Technical summary with rationale, inform |
2026-04-27 |
| COMMIT |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Patch details, domain-specific explanati |
2026-04-27 |
| COMMIT |
0.00 |
Fix colmodernvbert tests (#45652) |
|
Very terse, informal, intentionally mini |
2026-04-27 |
| COMMIT |
0.00 |
[CB] [Major] Add CPU request offloading (#45184) |
|
Commit messages are terse, informal, and |
2026-04-27 |
| COMMIT |
0.00 |
Fix peft constructors (#45622) |
|
Very terse and non-formal commit, human |
2026-04-27 |
| COMMIT |
0.00 |
chore: speedup modular converter (~30%) (#45046) |
|
Highly technical, terse, and informal; n |
2026-04-27 |
| COMMIT |
0.00 |
Fix whisper return language (#42227) |
|
Technical commit log with co-author trai |
2026-04-27 |
| COMMIT |
0.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
Short, direct commit style typical of hu |
2026-04-27 |
| COMMIT |
0.00 |
Raise clear error for `problem_type="single_label_classifica |
|
Explanation is technical with domain det |
2026-04-24 |
| COMMIT |
0.00 |
CircleCI with torch 2.11 (#45633) |
|
Repetitive commit summary, highly typica |
2026-04-24 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for main doc build workflow (#45 |
|
Standard terse chore commit, no AI style |
2026-04-24 |
| COMMIT |
0.00 |
Allow more artifacts to be download in CI (#45629) |
|
Sparse, informal; lacks signature AI phr |
2026-04-24 |
| COMMIT |
0.00 |
chore(qa): split pipeline and add type checking (#45432) |
|
Contains abbreviations and minimal phras |
2026-04-24 |
| COMMIT |
0.00 |
Skip failing offloading tests (#45624) |
|
Commit messages are brief and telegraphi |
2026-04-24 |
| COMMIT |
0.00 |
generate: drop stale num_return_sequences warning on continu |
|
Technical justification, abbreviations, |
2026-04-24 |
| COMMIT |
0.00 |
Remove unnecessary generate warnings (#45619) |
|
Brief, imperative commit style, no AI si |
2026-04-24 |
| COMMIT |
0.00 |
fix: compute auxiliary losses when denoising is disabled in |
|
Commit uses terse, technical language an |
2026-04-23 |
| COMMIT |
0.00 |
qa: bumped mlinter and allow local override (#45585) |
|
Informal commit lines and explicit human |
2026-04-23 |
| COMMIT |
0.00 |
Processing Utils: continue when content is a string (#45605) |
|
Terse commit message; direct technical f |
2026-04-23 |
| COMMIT |
0.00 |
SonicMoe (#45433) |
|
Informal, iterative commit messages with |
2026-04-23 |
| COMMIT |
0.00 |
fix transformers + torchao nvfp4 serialization (#45573) |
|
Casual tone, human-specific comments, an |
2026-04-23 |
| COMMIT |
0.00 |
[AMD CI] Fix expectations for Gemma3n (#45602) |
|
Short, lower-case, informal change descr |
2026-04-23 |
| COMMIT |
0.00 |
[docs] multi-turn tool calling (#45554) |
|
Very terse, typical human commit without |
2026-04-23 |
| COMMIT |
0.00 |
Allow for registered experts from kernels hub (#45577) |
|
Iterative, collaborative commit history |
2026-04-23 |
| COMMIT |
0.00 |
[CB] Changes for long generation (#45530) |
|
Highly informal, domain-heavy and collab |
2026-04-23 |
| COMMIT |
0.00 |
Align latest model attention function dispatch (#45598) |
|
Extremely terse, lacks AI-writing hallma |
2026-04-23 |
| COMMIT |
0.00 |
Gemma3n and Gemma4 cannot use rotary kernel (#45564) |
|
Short, direct human style; contains tech |
2026-04-23 |
| COMMIT |
0.00 |
do not index past decoded chars with special tokens (#45435) |
|
Informal tone, domain-specific phrasing, |
2026-04-22 |
| COMMIT |
0.00 |
Update dev version (#45583) |
|
Brief, informal; lacks AI stylistic cues |
2026-04-22 |
| COMMIT |
0.00 |
Update torchao usage for XPU and CPU (#45560) |
|
Very terse, technical, humanlike style. |
2026-04-22 |
| COMMIT |
0.00 |
[docs] per-request sampling params (#45553) |
|
Minimal, informal; clear domain-specific |
2026-04-22 |
| COMMIT |
0.00 |
Add IndexCache support for GLM5 DSA (#45424) |
|
Casual tone, domain jargon, human abbrev |
2026-04-22 |
| COMMIT |
0.00 |
Fix redundant logic in video processing SmolVLM (#45272) |
|
Brief, casual style, domain-specific wor |
2026-04-22 |
| COMMIT |
0.00 |
Fix typos (#45574) |
|
Domain jargon, typos, terse tone; human |
2026-04-22 |
| COMMIT |
0.00 |
Updated the image cache for Paddle models according to the l |
|
Terse style, domain language, minimal ex |
2026-04-22 |
| COMMIT |
0.00 |
[Model] Add SLANet Model Support (#45532) |
|
Brief changelog, domain signals, informa |
2026-04-22 |
| COMMIT |
0.00 |
refactor(Dots1): drop Dots1MoE override to `pass` (inherits |
|
Domain jargon, concise, no AI signals de |
2026-04-22 |
| COMMIT |
0.00 |
Move some conversion mappings to PrefixChange (#45567) |
|
Extremely terse commit; no AI signals, h |
2026-04-22 |
| COMMIT |
0.00 |
Align gemma3n cache sharing to gemma4 (#45489) |
|
Terse, informal commit messages; lacks A |
2026-04-22 |
| COMMIT |
0.00 |
[modular] Fix modular logic broken in #45045 (#45539) |
|
Very brief, informal with typos; clearly |
2026-04-22 |
| COMMIT |
0.00 |
Fix: propagate quantization_config to text sub-config for co |
|
Commit message uses terse technical lang |
2026-04-21 |
| COMMIT |
0.00 |
T5Gemma2: fix `prepare_decoder_input_ids_from_labels` (#4551 |
|
Technical and succinct commit summary, n |
2026-04-21 |
| COMMIT |
0.00 |
[Trainer] Add ddp_static_graph option (#45519) |
|
Detailed, technical explanation; no AI m |
2026-04-21 |
| COMMIT |
0.00 |
Add dtype config options for Four Over Six (#45367) |
|
Technical message with co-author trailer |
2026-04-21 |
| COMMIT |
0.00 |
[Sam3LiteText] Remove unnecessary modules/configs (#45535) |
|
Brief, domain-specific commit messages, |
2026-04-21 |
| COMMIT |
0.00 |
Fix conditional check for float formatting (#44425) |
|
Minimal, technical statement; no AI sign |
2026-04-21 |
| COMMIT |
0.00 |
Fix AMD CI: rebuild torchvision with libjpeg + refresh expec |
|
Terse, technical, with human-typical sty |
2026-04-21 |
| COMMIT |
0.00 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Highly technical, includes code/jargon; |
2026-04-21 |
| COMMIT |
0.00 |
feat[vLLM × v5]: Add vLLM compatibility for audio models (#4 |
|
Consistent with human commit conventions |
2026-04-21 |
| COMMIT |
0.00 |
Reapply modular to examples (#45527) |
|
Short, domain-specific phrase, likely hu |
2026-04-21 |
| COMMIT |
0.00 |
qa: re-run modular converter when the script itself is modif |
|
Concise, domain-specific commit; informa |
2026-04-20 |
| COMMIT |
0.00 |
[GGUF] Reduce peak RAM usage by casting dequantized tensors |
|
Standard PR/commit with domain details a |
2026-04-20 |
| COMMIT |
0.00 |
Fix CSM `TextToAudioPipeline` missing `` token (#45525) |
|
Terse, human shorthand, dev email signat |
2026-04-20 |
| COMMIT |
0.00 |
Raise 400 on model mismatch when `transformers serve` is pin |
|
Brief, domain-specific terms, minimal ex |
2026-04-20 |
| COMMIT |
0.00 |
[serve] Update tool call to switch to `parse_response` (#454 |
|
Direct, brief language with dev collabor |
2026-04-20 |
| COMMIT |
0.00 |
[`Conversion Mapping`] Small fixups (#45483) |
|
Single-word commit messages, clearly hum |
2026-04-20 |
| COMMIT |
0.00 |
:rotating_light: [`Kernels`] Fix kernel function registratio |
|
Informal, terse, and domain-focused chan |
2026-04-20 |
| COMMIT |
0.00 |
Revert "Fix: modular image processors (#45492)" (#45531) |
|
Standard revert message auto-generated b |
2026-04-20 |
| COMMIT |
0.00 |
fix: return empty tuple from import_protobuf_decode_error wh |
|
Detailed technical bug explanation, info |
2026-04-20 |
| COMMIT |
0.00 |
throw error when conversion required (#45078) |
|
Multiple small, informal commit messages |
2026-04-20 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for PR upload workflow (#45450) |
|
Standard commit message format, no AI si |
2026-04-20 |
| COMMIT |
0.00 |
xpu output align with cuda in test case (#45526) |
|
Domain-specific terms, signed by a human |
2026-04-20 |
| PR |
0.00 |
Fix xdist collisions for captured_info artifacts and preserv |
|
Template used; free-text is technical an |
2026-04-25 |
| PR |
0.00 |
Add DeepSeek V4 |
|
Template; free-text is informal, include |
2026-04-25 |
| PR |
0.00 |
[Fix Phi4 test] Add option to override image_processor_auto_ |
|
Template; free-text is terse and technic |
2026-04-28 |
| PR |
0.00 |
Extended n-to-1 kernel fusion via `KernelConfig` |
|
Minimal content, only template and headi |
2026-04-27 |
| PR |
0.00 |
Add xcodec2 model |
|
Template; free-text has checklist, techn |
2026-02-20 |
| PR |
0.00 |
Add image processors refactor to v5 migration guide |
|
Template; free-text concise, includes hu |
2026-04-21 |
| PR |
0.00 |
[Model] Add PP-FormulaNet Model Support |
|
Jargon and informal tone; human style. |
2026-04-24 |
| PR |
0.00 |
change got reverted |
|
Brief, issue-focused, human style. |
2026-04-28 |
| PR |
0.00 |
fix 2 failed test cases for blt model on XPU |
|
Informal and terse; definitely human. |
2026-04-23 |
| PR |
0.00 |
fixing more typos |
|
Extremely terse, typical of humans. |
2026-04-28 |
| PR |
0.00 |
[docs] cb memory management |
|
Domain-specific wording, informal tone. |
2026-04-22 |
| PR |
0.00 |
Blockwise mask fn as opt arg in all masking functions |
|
— |
2026-04-16 |
| PR |
0.00 |
Fix EP + FSDP2: experts silently overwritten by rank-0 broad |
|
Technical jargon and abrupt ending; foll |
2026-04-27 |
| PR |
0.00 |
TP refactor for FSDP + TP integration |
|
Minimal, bullet point format, heavy on j |
2026-03-26 |
| PR |
0.00 |
fix(utils): Resolve backbone utils test regressions |
|
— |
2026-04-23 |
| PR |
0.00 |
Restore TokenizersBackend override for DeepSeek V3/R1 tokeni |
|
Direct reference to issue and technical |
2026-04-28 |
| PR |
0.00 |
[nit] glmasr should be in AutoModelForMultimodalLM |
|
Extremely terse and domain shorthand; cl |
2026-04-28 |
| PR |
0.00 |
No serving in quality docker image |
|
Colloquial, direct phrasing unlikely fro |
2026-04-28 |
| PR |
0.00 |
Add Granite 4.1 Vision (granite4_vision) |
|
— |
2026-04-23 |
| PR |
0.00 |
Processing Utils: honor pre-built sub-processor kwargs in fr |
|
— |
2026-04-24 |
| PR |
0.00 |
[CB] Refactor any model-related code in a separate class |
|
— |
2026-04-27 |
| PR |
0.00 |
[CB] Better overall script and decode bucketting |
|
— |
2026-04-27 |
| PR |
0.00 |
[docs] model testing |
|
Informal, concise, contributor-focused; |
2026-03-31 |
| PR |
0.00 |
🚨🚨🚨 [Trainer] Default to FSDP2, simplify API around fsdp + f |
|
— |
2026-04-24 |
| PR |
0.00 |
model: Add DEIMv2 to Transformers |
|
— |
2026-02-27 |
| PR |
0.00 |
[Don't merge] Call CI workflow |
|
— |
2026-04-16 |
| PR |
0.00 |
Whisper fix audio out of range |
|
— |
2024-07-03 |
| PR |
0.00 |
Fix NameError in serving CLI due to conditional import asymm |
|
— |
2026-04-24 |
| PR |
0.00 |
audio tester class |
|
— |
2026-04-13 |
| PR |
0.00 |
Config can apply pyndatic validation without torch-dependenc |
|
— |
2026-04-02 |
| PR |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
— |
2026-03-21 |
| PR |
0.00 |
Modular playground |
|
— |
2026-02-04 |
| PR |
0.00 |
[`auto_docstring`] needs to be only run on __doc__ |
|
— |
2026-03-27 |
| PR |
0.00 |
Fix colmodernvbert tests |
|
— |
2026-04-27 |