| COMMIT |
1.00 |
Fix configuration reading and error handling for kernels (#4 |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Fix `AttributeError` on `s_aux=None` in `flash_attention_for |
|
Commit message contains explicit AI assi |
2026-04-23 |
| COMMIT |
1.00 |
Add expert parallelism (EP) config support for Qwen3 MoE (# |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
[`Privacy Filter`] Add model (#45580) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add ForSequenceClassification heads for the OLMo family (#45 |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add Neuron to auto-compile hardware list (#44757) |
|
Commit message contains explicit AI assi |
2026-04-16 |
| PR |
1.00 |
chore(typing): add ty type checking for 3 pipeline files |
|
PR body explicitly mentions AI collabora |
2026-04-27 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback |
|
PR body explicitly mentions AI collabora |
2026-04-27 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
Better Grouped GEMM + EP |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
PR body explicitly mentions AI collabora |
2026-04-19 |
| PR |
1.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Add unified Cache-layer management for GLM-5 DSA Indexer key |
|
PR body explicitly mentions AI collabora |
2026-04-23 |
| PR |
1.00 |
resize_token_embeddings does not effect to output_embeddings |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
feat: Add GGUF loading support for Llama 4 (text) |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Raise clear error for `problem_type="single_label_classifica |
|
PR body explicitly mentions AI collabora |
2026-04-23 |
| PR |
1.00 |
DeepGEMM BF16, isolation, refactor |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
feat: make timesfm2_5 onnx export compatible |
|
PR body explicitly mentions AI collabora |
2026-04-04 |
| PR |
0.60 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
ChatGPT-like phrase 'This PR aims to' an |
2026-04-27 |
| PR |
0.50 |
Doc translate to Persian(farsi) |
|
Very formal and explanatory, no clear AI |
2026-04-27 |
| PR |
0.20 |
Fix KeyError for flash_attn in import_utils.py on Python 3.1 |
|
Small formatting issue, but includes a t |
2026-04-25 |
| PR |
0.20 |
[docs] distributed training |
|
Some template text, but free-text edits |
2026-03-03 |
| PR |
0.20 |
Add Multi-Token Prediction (MTP) support for Qwen3.5 |
|
Somewhat formal but technical and concis |
2026-04-24 |
| PR |
0.15 |
Add Deepseek-OCR-2 model |
|
References paper and hub, informal headi |
2026-03-27 |
| PR |
0.15 |
chore(qa): split pipeline and add type checking |
|
Technical with domain linter references, |
2026-04-14 |
| PR |
0.13 |
Python code in model docs |
|
Uses casual batch update phrasing and te |
2026-04-23 |
| PR |
0.12 |
fix(janus): Handle None values in image generation mode |
|
Direct technical description, specific b |
2026-03-17 |
| COMMIT |
0.10 |
[Doc] Fix 'tokenized' -> 'tokenizer' typo in streamer docstr |
|
Polished but references real authors; cl |
2026-04-20 |
| PR |
0.10 |
[gemma4] infer from config instead of hardcoding |
|
Terse and specific, minimal free text, i |
2026-04-23 |
| PR |
0.10 |
Update quants tests |
|
Mentions domain issues, informal contrac |
2026-04-16 |
| PR |
0.10 |
Fix the order of `cls.config` resolution |
|
Directly references an issue, concise hu |
2026-04-27 |
| PR |
0.10 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Domain-specific, concise summary, normal |
2026-04-19 |
| PR |
0.10 |
Add regression test for Gemma4 audio relative positional ran |
|
Brief, context-specific; uses normal eng |
2026-04-23 |
| PR |
0.10 |
Fix peft constructors |
|
Concise, references discussion, typical |
2026-04-24 |
| PR |
0.10 |
Add heterogeneous model support (per-layer config and modeli |
|
Technical, focused, somewhat terse—stand |
2026-04-09 |
| PR |
0.10 |
Fix OOM regression for FSDP2 + cpu_ram_efficient_loading on |
|
Detailed, technical, context-specific, n |
2026-04-25 |
| PR |
0.10 |
Fix SDPA inference tolerances for MPS backend |
|
Targeted, domain-heavy wording, no AI ha |
2026-04-25 |
| PR |
0.10 |
Adding support for Nandi Models |
|
Concise, only the closing thanks/emoji i |
2026-03-29 |
| PR |
0.10 |
chore: speedup modular converter (~30%) |
|
Short, technical, uses common collaborat |
2026-03-27 |
| PR |
0.10 |
Add Multi-Token Prediction (MTP) support for Qwen3.5 |
|
Technical repeat of title, structure is |
2026-04-24 |
| PR |
0.10 |
Gemma4: fix failed test cases |
|
Brief, domain-specific, and informal; la |
2026-04-22 |
| PR |
0.10 |
Drop `content=None` from messages in `apply_chat_template` |
|
Bug explanation and code-specific refere |
2026-04-14 |
| PR |
0.10 |
Add Sapiens model |
|
Technical, direct explanation with domai |
2024-08-28 |
| PR |
0.10 |
chore: bump doc-builder SHA for main doc build workflow |
|
Technical changelog, uses domain referen |
2026-04-24 |
| PR |
0.10 |
Qwen3 ASR and Forced Aligner |
|
Brief, domain-specific; template explain |
2026-02-08 |
| PR |
0.10 |
Allow more artifacts to be download in CI |
|
Casual, direct explanation of CI reality |
2026-04-24 |
| PR |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
Technical, specific bug description with |
2026-04-23 |
| PR |
0.10 |
Skip failing offloading tests |
|
Terse, domain chat, informal reference t |
2026-04-24 |
| PR |
0.10 |
generate: drop stale num_return_sequences warning on continu |
|
Casual, domain-specific with proper refe |
2026-04-22 |
| PR |
0.10 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Terse summary, domain-specific context, |
2026-04-15 |
| PR |
0.10 |
Fix TypeError in video_processor_class_from_name when torchv |
|
Domain vocab, concise summary, casual bu |
2026-04-24 |
| PR |
0.10 |
Drop noisy generate warnings when do_sample=False (or num_be |
|
Domain jargon, casual summary and techni |
2026-04-22 |
| PR |
0.09 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Uses domain references, casual phrasing, |
2026-04-27 |
| PR |
0.08 |
n-to-1 kernel fusion via `KernelConfig` |
|
Uses domain-specific terms and terse bul |
2026-04-10 |
| PR |
0.07 |
chore(typing): added modeling_utils to ty |
|
Technical changelog, informal tone, and |
2026-04-14 |
| COMMIT |
0.05 |
perf: avoid recomputing rotary_emb for each layer in some Go |
|
Commit messages are terse, technical, an |
2026-04-22 |
| COMMIT |
0.05 |
Gemma4 training with text-only samples (#45454) |
|
Brief, informal, and technical commit me |
2026-04-22 |
| COMMIT |
0.05 |
[nemotron_h] Add support for MLP mixers (#44763) |
|
Short, informal commit messages with dom |
2026-04-22 |
| COMMIT |
0.05 |
add expert parallelism for gemma-4-26B-A4B-it (#45279) |
|
Technical commit messages and Signed-off |
2026-04-22 |
| COMMIT |
0.05 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, detail-oriented commit messag |
2026-04-22 |
| COMMIT |
0.05 |
Update Gemma4 weight conversion script (#45328) |
|
Technical commit messages, informal lang |
2026-04-22 |
| COMMIT |
0.05 |
fix table update versions (#45544) |
|
Very brief, technical, and informal comm |
2026-04-22 |
| COMMIT |
0.05 |
Add disable_mmap kwarg to from_pretrained with hf-mount auto |
|
Technical, template-driven changes, huma |
2026-04-22 |
| COMMIT |
0.05 |
fix(DSV3): parity between native `DeepseekV3MoE` and remote |
|
Technical description, terse phrasing, d |
2026-04-22 |
| PR |
0.05 |
[docs] modular transformers |
|
Casual tone and technical shorthand, con |
2026-04-08 |
| PR |
0.05 |
[CB] [Major] Add CPU request offloading |
|
Technical, concise, uses domain terms an |
2026-04-02 |
| PR |
0.05 |
qa: more lazy loading |
|
Brief, issue reference, informal and dom |
2026-04-23 |
| PR |
0.05 |
qa: speed up dtype regex weight load + reduce dtype tests to |
|
Terse list, direct technical language, n |
2026-04-24 |
| PR |
0.05 |
Require input_ids for repetition penalty |
|
Technical phrasing, domain-specific, lac |
2026-04-13 |
| PR |
0.05 |
Fix xdist collisions for captured_info artifacts and preserv |
|
Terse summary, specific references, huma |
2026-04-25 |
| PR |
0.05 |
Fix whisper return language |
|
Incomplete, typical accidental template |
2025-11-16 |
| PR |
0.05 |
Fix CB Accuracy Regression under FA2 |
|
Technical, informal, uses shorthand, not |
2026-04-07 |
| PR |
0.05 |
Fix Seq2SeqTrainer generation path for decoder-only models |
|
Domain-specific breakdown, list, and con |
2026-03-13 |
| PR |
0.05 |
[Trainer] Optimize LengthGroupedSampler computation with sel |
|
Concise technical content and domain ter |
2026-04-26 |
| PR |
0.05 |
Add missing requests dependency to transformers[serving] |
|
Very terse, code-specific, informal tone |
2026-04-24 |
| PR |
0.03 |
[docs] dtype |
|
Brief, technical, and clear, typical of |
2026-04-27 |
| PR |
0.03 |
[Model] Add PP-FormulaNet Model Support |
|
Minimal content, casual reviewer tone, c |
2026-04-24 |
| PR |
0.01 |
[docs] cpu offloading |
|
Extremely terse and specific, lacks AI h |
2026-04-27 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| COMMIT |
0.00 |
[gemma4] infer from config instead of hardcoding (#45606) |
|
Informal, concise updates and normal cod |
2026-04-27 |
| COMMIT |
0.00 |
Update quants tests (#45480) |
|
Minimalist, lacks AI style, uses terse c |
2026-04-27 |
| COMMIT |
0.00 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Technical summary with rationale, inform |
2026-04-27 |
| COMMIT |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Patch details, domain-specific explanati |
2026-04-27 |
| COMMIT |
0.00 |
Fix colmodernvbert tests (#45652) |
|
Very terse, informal, intentionally mini |
2026-04-27 |
| COMMIT |
0.00 |
[CB] [Major] Add CPU request offloading (#45184) |
|
Commit messages are terse, informal, and |
2026-04-27 |
| COMMIT |
0.00 |
Fix peft constructors (#45622) |
|
Very terse and non-formal commit, human |
2026-04-27 |
| COMMIT |
0.00 |
chore: speedup modular converter (~30%) (#45046) |
|
Highly technical, terse, and informal; n |
2026-04-27 |
| COMMIT |
0.00 |
Fix whisper return language (#42227) |
|
Technical commit log with co-author trai |
2026-04-27 |
| COMMIT |
0.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
Short, direct commit style typical of hu |
2026-04-27 |
| COMMIT |
0.00 |
Raise clear error for `problem_type="single_label_classifica |
|
Explanation is technical with domain det |
2026-04-24 |
| COMMIT |
0.00 |
CircleCI with torch 2.11 (#45633) |
|
Repetitive commit summary, highly typica |
2026-04-24 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for main doc build workflow (#45 |
|
Standard terse chore commit, no AI style |
2026-04-24 |
| COMMIT |
0.00 |
Allow more artifacts to be download in CI (#45629) |
|
Sparse, informal; lacks signature AI phr |
2026-04-24 |
| COMMIT |
0.00 |
chore(qa): split pipeline and add type checking (#45432) |
|
Contains abbreviations and minimal phras |
2026-04-24 |
| COMMIT |
0.00 |
Skip failing offloading tests (#45624) |
|
Commit messages are brief and telegraphi |
2026-04-24 |
| COMMIT |
0.00 |
generate: drop stale num_return_sequences warning on continu |
|
Technical justification, abbreviations, |
2026-04-24 |
| COMMIT |
0.00 |
Remove unnecessary generate warnings (#45619) |
|
Brief, imperative commit style, no AI si |
2026-04-24 |
| COMMIT |
0.00 |
fix: compute auxiliary losses when denoising is disabled in |
|
Commit uses terse, technical language an |
2026-04-23 |
| COMMIT |
0.00 |
qa: bumped mlinter and allow local override (#45585) |
|
Informal commit lines and explicit human |
2026-04-23 |
| COMMIT |
0.00 |
Processing Utils: continue when content is a string (#45605) |
|
Terse commit message; direct technical f |
2026-04-23 |
| COMMIT |
0.00 |
SonicMoe (#45433) |
|
Informal, iterative commit messages with |
2026-04-23 |
| COMMIT |
0.00 |
fix transformers + torchao nvfp4 serialization (#45573) |
|
Casual tone, human-specific comments, an |
2026-04-23 |
| COMMIT |
0.00 |
[AMD CI] Fix expectations for Gemma3n (#45602) |
|
Short, lower-case, informal change descr |
2026-04-23 |
| COMMIT |
0.00 |
[docs] multi-turn tool calling (#45554) |
|
Very terse, typical human commit without |
2026-04-23 |
| COMMIT |
0.00 |
Allow for registered experts from kernels hub (#45577) |
|
Iterative, collaborative commit history |
2026-04-23 |
| COMMIT |
0.00 |
[CB] Changes for long generation (#45530) |
|
Highly informal, domain-heavy and collab |
2026-04-23 |
| COMMIT |
0.00 |
Align latest model attention function dispatch (#45598) |
|
Extremely terse, lacks AI-writing hallma |
2026-04-23 |
| COMMIT |
0.00 |
Gemma3n and Gemma4 cannot use rotary kernel (#45564) |
|
Short, direct human style; contains tech |
2026-04-23 |
| COMMIT |
0.00 |
do not index past decoded chars with special tokens (#45435) |
|
Informal tone, domain-specific phrasing, |
2026-04-22 |
| COMMIT |
0.00 |
Update dev version (#45583) |
|
Brief, informal; lacks AI stylistic cues |
2026-04-22 |
| COMMIT |
0.00 |
Update torchao usage for XPU and CPU (#45560) |
|
Very terse, technical, humanlike style. |
2026-04-22 |
| COMMIT |
0.00 |
[docs] per-request sampling params (#45553) |
|
Minimal, informal; clear domain-specific |
2026-04-22 |
| COMMIT |
0.00 |
Add IndexCache support for GLM5 DSA (#45424) |
|
Casual tone, domain jargon, human abbrev |
2026-04-22 |
| COMMIT |
0.00 |
Fix redundant logic in video processing SmolVLM (#45272) |
|
Brief, casual style, domain-specific wor |
2026-04-22 |
| COMMIT |
0.00 |
Fix typos (#45574) |
|
Domain jargon, typos, terse tone; human |
2026-04-22 |
| COMMIT |
0.00 |
Updated the image cache for Paddle models according to the l |
|
Terse style, domain language, minimal ex |
2026-04-22 |
| COMMIT |
0.00 |
[Model] Add SLANet Model Support (#45532) |
|
Brief changelog, domain signals, informa |
2026-04-22 |
| COMMIT |
0.00 |
refactor(Dots1): drop Dots1MoE override to `pass` (inherits |
|
Domain jargon, concise, no AI signals de |
2026-04-22 |
| COMMIT |
0.00 |
Move some conversion mappings to PrefixChange (#45567) |
|
Extremely terse commit; no AI signals, h |
2026-04-22 |
| COMMIT |
0.00 |
Align gemma3n cache sharing to gemma4 (#45489) |
|
Terse, informal commit messages; lacks A |
2026-04-22 |
| COMMIT |
0.00 |
[modular] Fix modular logic broken in #45045 (#45539) |
|
Very brief, informal with typos; clearly |
2026-04-22 |
| COMMIT |
0.00 |
Fix: propagate quantization_config to text sub-config for co |
|
Commit message uses terse technical lang |
2026-04-21 |
| COMMIT |
0.00 |
T5Gemma2: fix `prepare_decoder_input_ids_from_labels` (#4551 |
|
Technical and succinct commit summary, n |
2026-04-21 |
| COMMIT |
0.00 |
[Trainer] Add ddp_static_graph option (#45519) |
|
Detailed, technical explanation; no AI m |
2026-04-21 |
| COMMIT |
0.00 |
Add dtype config options for Four Over Six (#45367) |
|
Technical message with co-author trailer |
2026-04-21 |
| COMMIT |
0.00 |
[Sam3LiteText] Remove unnecessary modules/configs (#45535) |
|
Brief, domain-specific commit messages, |
2026-04-21 |
| COMMIT |
0.00 |
Fix conditional check for float formatting (#44425) |
|
Minimal, technical statement; no AI sign |
2026-04-21 |
| COMMIT |
0.00 |
Fix AMD CI: rebuild torchvision with libjpeg + refresh expec |
|
Terse, technical, with human-typical sty |
2026-04-21 |
| COMMIT |
0.00 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Highly technical, includes code/jargon; |
2026-04-21 |
| COMMIT |
0.00 |
feat[vLLM × v5]: Add vLLM compatibility for audio models (#4 |
|
Consistent with human commit conventions |
2026-04-21 |
| COMMIT |
0.00 |
Reapply modular to examples (#45527) |
|
Short, domain-specific phrase, likely hu |
2026-04-21 |
| COMMIT |
0.00 |
qa: re-run modular converter when the script itself is modif |
|
Concise, domain-specific commit; informa |
2026-04-20 |
| COMMIT |
0.00 |
[GGUF] Reduce peak RAM usage by casting dequantized tensors |
|
Standard PR/commit with domain details a |
2026-04-20 |
| COMMIT |
0.00 |
Fix CSM `TextToAudioPipeline` missing `` token (#45525) |
|
Terse, human shorthand, dev email signat |
2026-04-20 |
| COMMIT |
0.00 |
Raise 400 on model mismatch when `transformers serve` is pin |
|
Brief, domain-specific terms, minimal ex |
2026-04-20 |
| COMMIT |
0.00 |
[serve] Update tool call to switch to `parse_response` (#454 |
|
Direct, brief language with dev collabor |
2026-04-20 |
| COMMIT |
0.00 |
[`Conversion Mapping`] Small fixups (#45483) |
|
Single-word commit messages, clearly hum |
2026-04-20 |
| COMMIT |
0.00 |
:rotating_light: [`Kernels`] Fix kernel function registratio |
|
Informal, terse, and domain-focused chan |
2026-04-20 |
| COMMIT |
0.00 |
Revert "Fix: modular image processors (#45492)" (#45531) |
|
Standard revert message auto-generated b |
2026-04-20 |
| COMMIT |
0.00 |
fix: return empty tuple from import_protobuf_decode_error wh |
|
Detailed technical bug explanation, info |
2026-04-20 |
| COMMIT |
0.00 |
throw error when conversion required (#45078) |
|
Multiple small, informal commit messages |
2026-04-20 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for PR upload workflow (#45450) |
|
Standard commit message format, no AI si |
2026-04-20 |
| COMMIT |
0.00 |
xpu output align with cuda in test case (#45526) |
|
Domain-specific terms, signed by a human |
2026-04-20 |
| COMMIT |
0.00 |
chore(qa): split out mlinter (#45475) |
|
Terse, informative, and domain-specific |
2026-04-20 |
| COMMIT |
0.00 |
Fix: modular image processors (#45492) |
|
Single word message, clearly human brevi |
2026-04-20 |
| COMMIT |
0.00 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Terse, includes typos and informal comme |
2026-04-20 |
| COMMIT |
0.00 |
Fix Zamba2MambaMixer ignoring use_mamba_kernels=False (#4485 |
|
Terse commits, edits, and references; no |
2026-04-20 |
| COMMIT |
0.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
Brief, technical, terse revert commit ty |
2026-04-17 |
| COMMIT |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
Technical language, specific errors, dir |
2026-04-17 |
| COMMIT |
0.00 |
Remove redundant condition checks in `get_image_size` method |
|
Terse bullet points, technical, informal |
2026-04-17 |
| COMMIT |
0.00 |
add Qianfan-OCR model definition (#45280) |
|
Informal, iterative fix messages, domain |
2026-04-17 |
| COMMIT |
0.00 |
Add check-auto in repo-consistency and fix sorting (#45481) |
|
Colloquial wording, speculative ('maybe? |
2026-04-17 |
| COMMIT |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py (#45 |
|
Concise typo fix, no free-text, purely a |
2026-04-17 |
| COMMIT |
0.00 |
typing: rule 15 - checks for tie_word_embeddings presence (# |
|
Brief update notes, technical context, c |
2026-04-17 |
| COMMIT |
0.00 |
[CB] Fix capture of max_seqlen (#45323) |
|
Informal commit titles, multiple granula |
2026-04-17 |
| COMMIT |
0.00 |
Fix response api support (#45463) |
|
Commit messages are terse, informal, sho |
2026-04-16 |
| COMMIT |
0.00 |
Minor update (#45484) |
|
Minimal human-written commit; contains C |
2026-04-16 |
| PR |
0.00 |
Fix EP + FSDP2: experts silently overwritten by rank-0 broad |
|
Technical jargon and abrupt ending; foll |
2026-04-27 |
| PR |
0.00 |
Extended n-to-1 kernel fusion via `KernelConfig` |
|
Minimal content, only template and headi |
2026-04-27 |
| PR |
0.00 |
[docs] model testing |
|
Informal, concise, contributor-focused; |
2026-03-31 |
| PR |
0.00 |
Add DeepSeek V4 |
|
— |
2026-04-25 |
| PR |
0.00 |
🚨🚨🚨 [Trainer] Default to FSDP2, simplify API around fsdp + f |
|
— |
2026-04-24 |
| PR |
0.00 |
Modularize `ProcessorMixin` into smaller components |
|
— |
2026-04-17 |
| PR |
0.00 |
model: Add DEIMv2 to Transformers |
|
— |
2026-02-27 |
| PR |
0.00 |
[Don't merge] Call CI workflow |
|
— |
2026-04-16 |
| PR |
0.00 |
Blockwise mask fn as opt arg in all masking functions |
|
— |
2026-04-16 |
| PR |
0.00 |
Fix trust_remote_code local cache collisions for local model |
|
— |
2026-04-24 |
| PR |
0.00 |
Whisper fix audio out of range |
|
— |
2024-07-03 |
| PR |
0.00 |
Processing Utils: honor pre-built sub-processor kwargs in fr |
|
— |
2026-04-24 |
| PR |
0.00 |
fix(utils): Resolve backbone utils test regressions |
|
— |
2026-04-23 |
| PR |
0.00 |
Fix NameError in serving CLI due to conditional import asymm |
|
— |
2026-04-24 |
| PR |
0.00 |
Add Granite 4.1 Vision (granite4_vision) |
|
— |
2026-04-23 |
| PR |
0.00 |
[CB] Refactor any model-related code in a separate class |
|
— |
2026-04-27 |
| PR |
0.00 |
[CB] Better overall script and decode bucketting |
|
— |
2026-04-27 |
| PR |
0.00 |
audio tester class |
|
— |
2026-04-13 |
| PR |
0.00 |
Config can apply pyndatic validation without torch-dependenc |
|
— |
2026-04-02 |
| PR |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
— |
2026-03-21 |
| PR |
0.00 |
Modular playground |
|
— |
2026-02-04 |
| PR |
0.00 |
[`auto_docstring`] needs to be only run on __doc__ |
|
— |
2026-03-27 |
| PR |
0.00 |
Fix colmodernvbert tests |
|
— |
2026-04-27 |
| PR |
0.00 |
Remove attribute_map from GptOssConfig |
|
Contains technical details and links, in |
2026-04-22 |
| PR |
0.00 |
Add AudioFlamingoNext model |
|
Direct and brief technical explanation, |
2026-03-18 |
| PR |
0.00 |
Add DeepSeek V4 |
|
Only gives a redirect; no evidence of AI |
2026-04-24 |
| PR |
0.00 |
[New Model] Add MiniCPM3 support |
|
Concise, technical phrasing; minimal det |
2026-04-23 |
| PR |
0.00 |
Add new model: Kimi2-6 |
|
Extremely terse and minimal content; hum |
2026-04-24 |
| PR |
0.00 |
Make patched testing debug logs xdist-safe |
|
Specific jargon, technical structure; no |
2026-04-24 |
| PR |
0.00 |
CircleCI with torch 2.11 |
|
Short, casual, minor typo; natural human |
2026-04-24 |
| PR |
0.00 |
Agent first cli with skill |
|
Very terse title, no formal AI hallmarks |
2026-04-23 |
| PR |
0.00 |
Remove unnecessary generate warnings |
|
Very informal tone, domain references, a |
2026-04-24 |
| PR |
0.00 |
Glm5 change |
|
Minimal description with informal, non-A |
2026-04-24 |