| COMMIT |
1.00 |
Fix WeightConverter regex incorrectly matching shared_expert |
|
Commit message contains explicit AI assi |
2026-05-06 |
| COMMIT |
1.00 |
Add Granite 4.1 Vision (granite4_vision) (#45597) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Unwrap `text_config` in `AutoModelFor*.from_config` (#45770) |
|
Commit message contains explicit AI assi |
2026-05-05 |
| COMMIT |
1.00 |
Add EXAONE 4.5 implementations (#45471) |
|
Commit message contains explicit AI assi |
2026-05-04 |
| COMMIT |
1.00 |
Add DeepSeek V4 (#45643) |
|
Commit message contains explicit AI assi |
2026-05-02 |
| COMMIT |
1.00 |
Support for a new Granite-Speech-Plus model (#45695) |
|
Commit message contains explicit AI assi |
2026-04-29 |
| COMMIT |
1.00 |
fixing more typos (#45689) |
|
Commit message contains explicit AI assi |
2026-04-28 |
| COMMIT |
1.00 |
Fix configuration reading and error handling for kernels (#4 |
|
Commit message contains explicit AI assi |
2026-04-23 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
torch.backends.fp32_precision cascade conv/rnn so removing t |
|
PR body explicitly mentions AI collabora |
2026-05-04 |
| PR |
1.00 |
Cache `merged_typed_dict` to not break `validate_typed_dict` |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
WIP: Add support for Granite4VisionForConditionalGeneration |
|
PR body explicitly mentions AI collabora |
2026-04-09 |
| PR |
1.00 |
fix: use per-instance lru_cache in compile_compatible_method |
|
PR body explicitly mentions AI collabora |
2026-05-07 |
| PR |
1.00 |
fix: route Granite models to TokenizersBackend to preserve t |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
Add pretrained model-family distributed smoke matrix |
|
PR body explicitly mentions AI collabora |
2026-05-06 |
| PR |
1.00 |
fix: ModuleNotFoundError caused by distributed race conditio |
|
PR body explicitly mentions AI collabora |
2026-05-05 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
DeepGEMM BF16 + mixed FP8/FP4 + MegaMoE + refactor |
|
PR body explicitly mentions AI collabora |
2026-04-24 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
fix: correct spelling in continuous_api docstring |
|
PR body explicitly mentions AI collabora |
2026-05-03 |
| PR |
1.00 |
Fix link to modular transformers documentation |
|
PR body explicitly mentions AI collabora |
2026-05-02 |
| PR |
1.00 |
First model |
|
PR body explicitly mentions AI collabora |
2026-05-05 |
| PR |
1.00 |
docstring |
|
PR body explicitly mentions AI collabora |
2026-05-05 |
| PR |
1.00 |
fix: Added Mps support in float fallback backends list |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
1.00 |
Fix split batch size |
|
PR body explicitly mentions AI collabora |
2026-05-02 |
| PR |
1.00 |
.. |
|
PR body explicitly mentions AI collabora |
2026-05-05 |
| PR |
1.00 |
Exclude audio modules from conversion process |
|
PR body explicitly mentions AI collabora |
2026-04-28 |
| PR |
0.60 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Uses AI-typical phrasing like 'This PR a |
2026-04-27 |
| PR |
0.20 |
[codex] save codebase deep dive audio progress |
|
Concise changelog, slight formality but |
2026-05-05 |
| PR |
0.15 |
Improve explanation of pipelines for beginners |
|
Direct, minimal, clearly aimed at beginn |
2026-05-06 |
| PR |
0.15 |
Add RF-DETR |
|
List of tasks, domain-specific name, mod |
2025-03-21 |
| PR |
0.15 |
Fix UnboundLocalError for `is_updated` in encoder-decoder cr |
|
Technical bugfix language, mentions exce |
2026-05-04 |
| PR |
0.13 |
add HyperClovaX Vision |
|
Opening greeting, but technical context |
2026-02-27 |
| COMMIT |
0.10 |
[nemotron_h] respect _no_reinit flag on dt_bias and out_proj |
|
Commit contains domain context and code |
2026-05-01 |
| COMMIT |
0.10 |
Doc translate to Persian(farsi) (#45664) |
|
Slightly formal in part, but domain-spec |
2026-04-30 |
| COMMIT |
0.10 |
docs(README_zh-hans): clarify conditions for not using Trans |
|
Somewhat formal wording but still within |
2026-04-28 |
| PR |
0.10 |
[Fuyu] Remove FuyuBatchFeature subclass, use BatchFeature wi |
|
Mentions issue and contributor, concise |
2026-05-06 |
| PR |
0.10 |
feat: add bf16_loss training argument for VRAM-efficient QLo |
|
Domain-specific jargon and concise techn |
2026-05-03 |
| PR |
0.10 |
[CB] [Major] Add tensor paralellism |
|
Uses TP jargon and terse descriptions, n |
2026-05-07 |
| PR |
0.10 |
Get rid of deprecated use_return_dict call. |
|
Casual tone and specific error context, |
2026-05-07 |
| PR |
0.10 |
Fix shared config mutation issue in flash_attn_from_config |
|
Technical bug description and domain jar |
2026-04-28 |
| PR |
0.10 |
Modular playground |
|
Domain details, concise listing, lacks A |
2026-02-04 |
| PR |
0.10 |
Fix "AttributeError: NewTokenizer has no attribute special_a |
|
Technical problem explanation, domain ab |
2026-04-07 |
| PR |
0.10 |
Restore TokenizersBackend override for DeepSeek V3/R1 tokeni |
|
Direct technical problem, jargon, no exc |
2026-04-28 |
| PR |
0.10 |
Fix `torch.compile` graph breaks in Qwen2.5-Omni vision enco |
|
Technical references, concise error expl |
2026-05-07 |
| PR |
0.10 |
Add V-JEPA 2.1 inference support |
|
Technical, concise free text; domain ter |
2026-04-17 |
| PR |
0.10 |
Add FP8 kernel acceleration for compressed-tensors quantized |
|
Brief technical content and informal sty |
2026-04-29 |
| PR |
0.10 |
Keep deleting |
|
Brief, informal; mirrors human patch not |
2026-05-06 |
| PR |
0.10 |
Gate enable_gqa=True on actual flash-attention eligibility |
|
Presents issue/PR context with informal |
2026-05-04 |
| PR |
0.10 |
[generation] Encode multimodal data only once |
|
Technical, direct, includes concrete imp |
2026-05-05 |
| PR |
0.10 |
[CB] Refactor any model-related code in a separate class |
|
Concise, domain-specific, refactoring me |
2026-04-27 |
| PR |
0.10 |
Add Granite 4.1 Vision (granite4_vision) |
|
Product announcement tone, but domain-ri |
2026-04-23 |
| PR |
0.10 |
No agent PR descriptions |
|
Informal tone and incomplete sentences i |
2026-05-05 |
| PR |
0.10 |
fix: align attention_mask padding with appended eos token in |
|
Technical, terse summary and domain deta |
2026-05-03 |
| PR |
0.10 |
fix attribute access in PermuteForRope._apply |
|
Describes a specific bug with concise, d |
2026-05-03 |
| PR |
0.10 |
fix: return correct forward output in AriaTextForCausalLM |
|
Terse, technical, directly references mo |
2026-05-03 |
| PR |
0.10 |
Optimize Qwen3 RoPE: precompute cos/sin cache for static rop |
|
Technical content, direct reference to m |
2026-05-02 |
| PR |
0.10 |
Fix IndexError in sdpa_mask and flex_attention_mask for 0D t |
|
References a specific issue, clear techn |
2026-05-02 |
| PR |
0.10 |
fix: remove upper bound on tokenizers version constraint |
|
Specific dependency update with technica |
2026-05-05 |
| PR |
0.08 |
Add new model: Kimi2-6 |
|
Casual tone and domain-specific context |
2026-04-24 |
| PR |
0.07 |
Extended n-to-1 kernel fusion via `KernelConfig` |
|
Detailed technical explanation, informal |
2026-04-27 |
| PR |
0.06 |
Fix EP + DeepSpeed ZeRO-3 loading via accelerate launch |
|
Domain abbreviations and concise summary |
2026-04-21 |
| PR |
0.06 |
[CI] Replace PAT with GitHub App token in repo-consistency-b |
|
Technical details, domain abbreviations, |
2026-05-07 |
| COMMIT |
0.05 |
Add image processors refactor to v5 migration guide (#45556) |
|
Brief, changelog-focused; no AI hallmark |
2026-04-28 |
| COMMIT |
0.05 |
[docs] modular transformers (#45327) |
|
Standard PR commit log, minimal free tex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] dtype (#45659) |
|
Short, lacks AI-generated phrasing or ex |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cb memory management (#45587) |
|
Extremely brief chunked log, no AI phras |
2026-04-28 |
| COMMIT |
0.05 |
[docs] cpu offloading (#45660) |
|
Minimal, no formal language or AI stylis |
2026-04-28 |
| COMMIT |
0.05 |
Fix `x_clip`: 8 failed test cases (#45394) |
|
Test fix, highly specific, fully normal |
2026-04-28 |
| PR |
0.05 |
Drop `content=None` from messages in `apply_chat_template` |
|
Uses domain jargon and concise phrasing |
2026-04-14 |
| PR |
0.05 |
Qwen3 ASR and Forced Aligner |
|
Technical, checklist use, and terse doma |
2026-02-08 |
| PR |
0.05 |
feat: add crop() to StaticCache layers for assisted generati |
|
Technical language, domain-specific term |
2026-05-02 |
| PR |
0.05 |
[Weight converter] Revert unnecessary changes to `rename_sou |
|
Uses domain vernacular, concise explanat |
2026-05-07 |
| PR |
0.05 |
Fix `kernelize()` crash for gpt_oss: missing `@use_kernel_fu |
|
Technical, concise; uses PR numbers and |
2026-05-06 |
| PR |
0.05 |
fix(bitsandbytes): implement reverse_op for Bnb4bitDeseriali |
|
Technical subject, domain abbreviations, |
2026-05-02 |
| PR |
0.05 |
Fix import error in moe.py by providing explicit schema to c |
|
Detailed technical explanation; no AI ph |
2026-05-06 |
| PR |
0.05 |
gpt_oss multi-GPU AMD support |
|
Technical, uses domain vocab (multi-GPU, |
2026-04-30 |
| PR |
0.05 |
fix(qianfan_ocr): add XPU expectations |
|
Brief, informal, review request; clear h |
2026-04-24 |
| PR |
0.05 |
TP refactor for FSDP + TP integration |
|
Bullet point unfinished TODOs; informal |
2026-03-26 |
| PR |
0.05 |
refector: renamed file glob to cache to make it clearer |
|
Very concise file rename rationale, matc |
2026-04-30 |
| PR |
0.05 |
Fix decorator order |
|
Brief, informal; domain abbreviations su |
2026-05-06 |
| PR |
0.05 |
Fix WeightConverter regex incorrectly matching shared_expert |
|
Technical, concise explanation typical o |
2026-05-05 |
| PR |
0.05 |
Modularize `ProcessorMixin` into smaller components |
|
Technical, concise; domain-specific abbr |
2026-04-17 |
| PR |
0.05 |
[`Granite 4.1 Vision`] Fixup integration tests |
|
Informal tone; mentions rush and user ha |
2026-05-06 |
| PR |
0.05 |
Add Molmo2 |
|
Incomplete; starts technical, lacks AI s |
2026-01-23 |
| PR |
0.05 |
audio tester class |
|
Informal, project-specific references an |
2026-04-13 |
| PR |
0.05 |
Fix IndexError on 0-d tensor in sdpa_mask/flex_attention_mas |
|
Technical fix, precise language; domain |
2026-05-06 |
| PR |
0.05 |
TST Run fast PEFT tests in normal CI |
|
Informal, uses domain context, no AI hal |
2026-04-28 |
| PR |
0.04 |
[docs] contributing |
|
Brief, technical, and uses domain terms |
2026-04-15 |
| PR |
0.04 |
trainer: clear MPS graph cache after each optimizer step (py |
|
Brief technical summary, domain context, |
2026-05-07 |
| PR |
0.04 |
[CB] Fixes for SDPA and CPU offloading |
|
Includes domain jargon, terse explanatio |
2026-05-01 |
| PR |
0.04 |
Fix model parallel bugs for Gemma4 |
|
Terce, technical, includes error message |
2026-05-07 |
| PR |
0.03 |
Fix gemma4 with multi-gpu setup |
|
Terse, technical style: clear human sign |
2026-05-07 |
| PR |
0.03 |
fix: raise ValueError when num_beams × vocab_size exceeds to |
|
Uses code error details and direct expla |
2026-05-07 |
| PR |
0.03 |
fix(rf_detr): correct paper URL and stale checkpoint referen |
|
Brief, specific, uses standard commit ph |
2026-05-07 |
| PR |
0.03 |
Fix added-token prefix space in SentencePiece fast tokenizer |
|
Clear technical focus, issue referencing |
2026-05-07 |
| PR |
0.03 |
fix: correct typo 'seperate' -> 'separate' in comments acros |
|
Fix description, includes typo correctio |
2026-05-07 |
| PR |
0.02 |
Delete dead code in qwen-vl series |
|
Informal tone ('Dont review yet!'), doma |
2026-05-07 |
| COMMIT |
0.00 |
Fix gemma4 with multi-gpu setup (#45826) |
|
Brief and informal; no AI hallmarks pres |
2026-05-07 |
| COMMIT |
0.00 |
Add HyperCLOVAX SEED Think 14B (#44956) |
|
Commit trailers but no AI signals; conci |
2026-05-07 |
| COMMIT |
0.00 |
Fix `kernelize()` crash for gpt_oss: missing `@use_kernel_fu |
|
Technical fixes, domain jargon, no AI ph |
2026-05-07 |
| COMMIT |
0.00 |
Cache `merged_typed_dict` to not break `validate_typed_dict` |
|
Direct changelog with domain terms; lack |
2026-05-07 |
| COMMIT |
0.00 |
Get rid of deprecated use_return_dict call. (#45815) |
|
Minimal, terse, human-written style, no |
2026-05-07 |
| COMMIT |
0.00 |
fix(qianfan_ocr): add XPU expectations (#45615) |
|
Technical listing; informal, lacks AI te |
2026-05-07 |
| COMMIT |
0.00 |
Fix shared config mutation issue in flash_attn_from_config ( |
|
Domain-specific, concise, signed-off; no |
2026-05-07 |
| COMMIT |
0.00 |
Add RF-DETR (#36895) |
|
Changelog is technical and terse, lacks |
2026-05-07 |
| COMMIT |
0.00 |
[Weight Converter] More fine-grained mappings on classes, sc |
|
Highly technical, multiple fixes, domain |
2026-05-07 |
| COMMIT |
0.00 |
refector: renamed file glob to cache to make it clearer (#45 |
|
Brief, informal commit message indicates |
2026-05-06 |
| COMMIT |
0.00 |
Fix decorator order (#45806) |
|
Very terse, sometimes typoed commit mess |
2026-05-06 |
| COMMIT |
0.00 |
[`Granite 4.1 Vision`] Fixup integration tests (#45805) |
|
Terse commit messages, human abbreviatio |
2026-05-06 |
| COMMIT |
0.00 |
fix: validate special token ids against attribute values (#4 |
|
Technical, concise; shows informal engin |
2026-05-06 |
| COMMIT |
0.00 |
Blockwise mask fn as opt arg in all masking functions (#4547 |
|
Highly informal, fragmented notes; no si |
2026-05-06 |
| COMMIT |
0.00 |
[CB] Refactor any model-related code in a separate class (#4 |
|
Uses domain jargon, informal corrections |
2026-05-06 |
| COMMIT |
0.00 |
fix: forward use_cache kwarg to attention mixer in nemotron_ |
|
Brief technical commit messages with dom |
2026-05-05 |
| COMMIT |
0.00 |
fix: correct spelling in continuous_api docstring (#45749) |
|
Minimal, terse technical correction; typ |
2026-05-05 |
| COMMIT |
0.00 |
Fix link to modular transformers documentation (#45746) |
|
Direct update explanation; clear, concis |
2026-05-05 |
| COMMIT |
0.00 |
Gemma4: fix failed test cases (#45568) |
|
List of technical fixes and updates, sig |
2026-05-05 |
| COMMIT |
0.00 |
First model (#45788) |
|
Casual tone and specific jargon; reflect |
2026-05-05 |
| COMMIT |
0.00 |
Fix CI: Allow more artifacts to be download in CI (#45785) |
|
Debug note and domain-specific context, |
2026-05-05 |
| COMMIT |
0.00 |
Add `concurrency` to `PR CI` workflow file (`pr-ci-caller.ym |
|
Single word commit, terse; standard huma |
2026-05-05 |
| COMMIT |
0.00 |
Reorder decorators for autodoc and dataclass (#45702) |
|
Short, technical log phrases typical of |
2026-05-05 |
| COMMIT |
0.00 |
deepseek r1 distilled tokenizer fix for qwen2 mapping (#4574 |
|
Typo present, informal tone; clear human |
2026-05-05 |
| COMMIT |
0.00 |
fix: Added Mps support in float fallback backends list (#45 |
|
Structured code fixes, informal language |
2026-05-05 |
| COMMIT |
0.00 |
Github Actions PR CI (caller) (#45476) |
|
Commit message is terse and includes dom |
2026-05-04 |
| COMMIT |
0.00 |
make sure we call check_auto in CI (#45775) |
|
Informal tone; domain-specific and conci |
2026-05-04 |
| COMMIT |
0.00 |
Better Grouped GEMM + EP (#45621) |
|
Commit log is informal, terse, and techn |
2026-05-04 |
| COMMIT |
0.00 |
DeepSeek OCR specifies an incorrect tokenizer class on the H |
|
Human-written; issue title is concise an |
2026-05-04 |
| COMMIT |
0.00 |
Fix auto mapping script (#45774) |
|
Extremely terse and informal commit mess |
2026-05-04 |
| COMMIT |
0.00 |
PythonBackend slow tokenizer convert_ids_to_tokens fix (#457 |
|
Commit message is terse and lacks AI-sty |
2026-05-04 |
| COMMIT |
0.00 |
[MINISTRAL3] Fix conversion script yarn's apply_scale suppor |
|
Commit message is terse and technical; n |
2026-05-03 |
| COMMIT |
0.00 |
🚨 Get rid of most Apex references (#45723) |
|
Concise and informal message, domain-typ |
2026-05-01 |
| COMMIT |
0.00 |
fix(utils): Resolve backbone utils test regressions (#45594) |
|
Short, domain-specific commit message; t |
2026-05-01 |
| COMMIT |
0.00 |
[CB] Better overall script and decode bucketting (#45653) |
|
Informal, terse checklist style is human |
2026-05-01 |
| COMMIT |
0.00 |
[docs] model testing (#45152) |
|
Terese commit messages; human style; dom |
2026-04-30 |
| COMMIT |
0.00 |
update dev (#45726) |
|
Brief update message; lacks AI hallmarks |
2026-04-30 |
| COMMIT |
0.00 |
[`OAI Privacy Filter`] Add integration test (#45725) |
|
Short, action-based messages; human comm |
2026-04-30 |
| COMMIT |
0.00 |
Speedup Qwen2VLImageProcessor (#45719) |
|
Technical, terse commit messages; human |
2026-04-30 |
| COMMIT |
0.00 |
Remove dead beam-search dummies from dummy_pt_objects.py (#4 |
|
Single, precise technical commit message |
2026-04-30 |
| COMMIT |
0.00 |
[Model] Add PP-FormulaNet Model Support (#45626) |
|
Terse, domain-specific commits; includes |
2026-04-30 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 10 utility files (#4 |
|
Clear domain context and project-specifi |
2026-04-30 |
| COMMIT |
0.00 |
[serve] cb error (#45691) |
|
Brief commit log format, no AI signals. |
2026-04-29 |
| COMMIT |
0.00 |
Fix trust_remote_code local cache collisions for local model |
|
Technical commit log, concise, no AI hal |
2026-04-29 |
| COMMIT |
0.00 |
Llama3 video fix (#45040) |
|
Standard iterative commit log, domain te |
2026-04-29 |
| COMMIT |
0.00 |
[Fix Phi4 test] Fall back to model config for image processo |
|
Technical changelog, brief summary, no A |
2026-04-29 |
| COMMIT |
0.00 |
Fix custom-module copies inheriting read-only permissions (# |
|
Detailed technical context, not overly f |
2026-04-29 |
| COMMIT |
0.00 |
Python code in model docs (#45608) |
|
Informal, terse commit log, domain-speci |
2026-04-29 |
| COMMIT |
0.00 |
fix failed test cases for blt model (#45596) |
|
Technical commit log, signed off by huma |
2026-04-29 |
| COMMIT |
0.00 |
chore(typing): add ty type checking for 3 pipeline files (#4 |
|
Standard technical changelog and co-auth |
2026-04-29 |
| COMMIT |
0.00 |
change got reverted (#45680) |
|
Extremely terse, typical human revert me |
2026-04-28 |
| COMMIT |
0.00 |
fix padding side issue for fast_vlm tests (#45592) |
|
Terse, domain-specific, includes real na |
2026-04-28 |
| COMMIT |
0.00 |
zero_shot_object_detection ValueError fix for python 3.13 (# |
|
Very brief, precise, with clear domain r |
2026-04-28 |
| COMMIT |
0.00 |
Fix pageable H2D copies in Gated DeltaNet PyTorch fallback ( |
|
Concise, technical commit message with d |
2026-04-28 |
| COMMIT |
0.00 |
Fix UnboundLocalError in shard_and_distribute_module for rep |
|
Short, informal message; human co-author |
2026-04-28 |
| COMMIT |
0.00 |
No serving in quality docker image (#45677) |
|
Brief commit message, human co-author, n |
2026-04-28 |
| COMMIT |
0.00 |
Laguna XS.2 implementation (#45673) |
|
Minimal title-only commit, no sign of AI |
2026-04-28 |
| COMMIT |
0.00 |
[MistralCommonBackend] Soften validation mode and apply_chat |
|
Structured changelog, domain-specific co |
2026-04-28 |
| COMMIT |
0.00 |
Fix `NameError: PeftConfigLike` triggered by `PreTrainedMode |
|
Concise, code-centric, domain-specific c |
2026-04-27 |
| COMMIT |
0.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
Terse, technical, and human-typical with |
2026-04-27 |
| COMMIT |
0.00 |
chore(typing): added modeling_utils to ty (#45425) |
|
Informal, uses jargon and review summari |
2026-04-27 |
| COMMIT |
0.00 |
model: Add DEIMv2 to Transformers (#44339) |
|
Uses changelog format, dense with domain |
2026-04-27 |
| COMMIT |
0.00 |
[Qwen3.5] Fix GDN linear attention multi-token cached forwar |
|
Detailed description with human-like bug |
2026-04-27 |
| COMMIT |
0.00 |
[gemma4] infer from config instead of hardcoding (#45606) |
|
Informal, concise updates and normal cod |
2026-04-27 |
| COMMIT |
0.00 |
Update quants tests (#45480) |
|
Minimalist, lacks AI style, uses terse c |
2026-04-27 |
| COMMIT |
0.00 |
Fix GraniteMoeHybrid _update_mamba_mask crash on attention-o |
|
Technical summary with rationale, inform |
2026-04-27 |
| COMMIT |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Patch details, domain-specific explanati |
2026-04-27 |
| COMMIT |
0.00 |
Fix colmodernvbert tests (#45652) |
|
Very terse, informal, intentionally mini |
2026-04-27 |
| COMMIT |
0.00 |
[CB] [Major] Add CPU request offloading (#45184) |
|
Commit messages are terse, informal, and |
2026-04-27 |
| COMMIT |
0.00 |
Fix peft constructors (#45622) |
|
Very terse and non-formal commit, human |
2026-04-27 |
| COMMIT |
0.00 |
chore: speedup modular converter (~30%) (#45046) |
|
Highly technical, terse, and informal; n |
2026-04-27 |
| COMMIT |
0.00 |
Fix whisper return language (#42227) |
|
Technical commit log with co-author trai |
2026-04-27 |
| COMMIT |
0.00 |
Add `supports_gradient_checkpointing` to `NemotronHPreTraine |
|
Short, direct commit style typical of hu |
2026-04-27 |
| COMMIT |
0.00 |
Raise clear error for `problem_type="single_label_classifica |
|
Explanation is technical with domain det |
2026-04-24 |
| COMMIT |
0.00 |
CircleCI with torch 2.11 (#45633) |
|
Repetitive commit summary, highly typica |
2026-04-24 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for main doc build workflow (#45 |
|
Standard terse chore commit, no AI style |
2026-04-24 |
| COMMIT |
0.00 |
Allow more artifacts to be download in CI (#45629) |
|
Sparse, informal; lacks signature AI phr |
2026-04-24 |
| COMMIT |
0.00 |
chore(qa): split pipeline and add type checking (#45432) |
|
Contains abbreviations and minimal phras |
2026-04-24 |
| COMMIT |
0.00 |
Skip failing offloading tests (#45624) |
|
Commit messages are brief and telegraphi |
2026-04-24 |
| COMMIT |
0.00 |
generate: drop stale num_return_sequences warning on continu |
|
Technical justification, abbreviations, |
2026-04-24 |
| COMMIT |
0.00 |
Remove unnecessary generate warnings (#45619) |
|
Brief, imperative commit style, no AI si |
2026-04-24 |
| COMMIT |
0.00 |
fix: compute auxiliary losses when denoising is disabled in |
|
Commit uses terse, technical language an |
2026-04-23 |
| COMMIT |
0.00 |
qa: bumped mlinter and allow local override (#45585) |
|
Informal commit lines and explicit human |
2026-04-23 |
| PR |
0.00 |
Add HyperCLOVAX SEED Think 14B |
|
PR template structure; actual content is |
2026-03-23 |
| PR |
0.00 |
[skills] model doc |
|
Concise, no AI-typical phrases; informal |
2026-04-29 |
| PR |
0.00 |
[docs] adding audio/video processors |
|
Informal, includes forum link and human |
2026-05-05 |
| PR |
0.00 |
Warn about forgetting attention mask functions |
|
Technical summary with detailed context, |
2026-05-06 |
| PR |
0.00 |
Fix WeightConverter substring match on leaf-style source pat |
|
Technical explanation with in-line code, |
2026-05-05 |
| PR |
0.00 |
fix: validate special token ids against attribute values |
|
Technical bugfix explanation, no AI phra |
2026-05-05 |
| PR |
0.00 |
Blockwise mask fn as opt arg in all masking functions |
|
Informal, to-the-point, abbreviations an |
2026-04-16 |
| PR |
0.00 |
[docs] paper cuts |
|
Very brief, direct, and informal languag |
2026-05-06 |
| PR |
0.00 |
fix: forward use_cache kwarg to attention mixer in nemotron_ |
|
Informal commit message, addresses typo, |
2026-05-05 |
| PR |
0.00 |
feat(llama): add has_weight parameter to LlamaRMSNorm for Fl |
|
Domain terms, technical context, and nat |
2026-05-02 |
| PR |
0.00 |
Fix xdist collisions for captured_info artifacts and preserv |
|
Bug context, references, and non-boilerp |
2026-04-25 |
| PR |
0.00 |
Unwrap `text_config` in `AutoModelFor*.from_config` |
|
PR uses template; filled content is dire |
2026-05-04 |
| PR |
0.00 |
Gemma4: fix failed test cases |
|
Bulleted list, informal typos, and techn |
2026-04-22 |
| PR |
0.00 |
fix: per-instance cache in compile_compatible_method_lru_cac |
|
Terse summary and technical details, wit |
2026-05-05 |
| PR |
0.00 |
fix(testing): use worker-specific captured_info files for py |
|
Problem statement and technical domain c |
2026-05-05 |
| PR |
0.00 |
Fix CI: Allow more artifacts to be download in CI |
|
Informal language and natural error expl |
2026-05-05 |
| PR |
0.00 |
Add `concurrency` to `PR CI` workflow file (`pr-ci-caller.ym |
|
Technical file/path reference and concis |
2026-05-05 |
| PR |
0.00 |
Reorder decorators for autodoc and dataclass |
|
Informal, mentions 'smth', personal tone |
2026-04-29 |
| PR |
0.00 |
deepseek r1 distilled tokenizer fix for qwen2 mapping |
|
Extremely terse, informal, no AI or temp |
2026-05-02 |