| COMMIT |
1.00 |
[Bugfix] Remove incorrect torchvision requirement from PIL b |
|
Commit message contains explicit AI assi |
2026-03-30 |
| COMMIT |
1.00 |
fix: protect torch imports in processing files and fix impor |
|
Commit message contains explicit AI assi |
2026-03-27 |
| COMMIT |
1.00 |
Fix release full (#45029) |
|
Commit message contains explicit AI assi |
2026-03-27 |
| COMMIT |
1.00 |
Add `base_model_tp_plan` to `OlmoeConfig` (#44668) |
|
Commit message contains explicit AI assi |
2026-03-26 |
| COMMIT |
1.00 |
Fix `maybe_autocast` crashing on meta device tensors (#44984 |
|
Commit message contains explicit AI assi |
2026-03-25 |
| COMMIT |
1.00 |
LwDetrImageLoss: Fix dtype casting to prevent crash when usi |
|
Commit message contains explicit AI assi |
2026-03-24 |
| COMMIT |
1.00 |
fix(i18n): replace broken relative links to awesome-transfor |
|
Commit message contains explicit AI assi |
2026-03-23 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Fix MoE routers returning probabilities instead of logits |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
Fix failing `XCLIPModelIntegrationTest` |
|
PR body explicitly mentions AI collabora |
2026-03-27 |
| PR |
1.00 |
Update accelerator_selection.md |
|
PR body explicitly mentions AI collabora |
2026-03-29 |
| PR |
1.00 |
fix: incomplete string literal causes syntax error in config |
|
PR body explicitly mentions AI collabora |
2026-03-29 |
| PR |
1.00 |
Add GDS support for safetensors loading |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
Add SAM 3.1 |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
Refactor/nemotron h inherit granitemoehybrid |
|
PR body explicitly mentions AI collabora |
2026-03-30 |
| PR |
1.00 |
fix: prefer registered config over remote code in AutoConfig |
|
PR body explicitly mentions AI collabora |
2026-03-29 |
| PR |
1.00 |
[fix] BC for legacy configs with top-level rope_theta when r |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Add sarvam model |
|
PR body explicitly mentions AI collabora |
2026-03-25 |
| COMMIT |
0.80 |
docs: add PermuteForRope to conversion operations reverse ta |
|
Formal explanatory tone with unusually p |
2026-03-26 |
| PR |
0.70 |
Osman-Level Innovations: Hardware-Aware Advisor & Selective |
|
Overly formal, self-promotional tone not |
2026-03-27 |
| PR |
0.50 |
Add regression test for ByteLevel added-token Unicode decode |
|
Neutral technical free text; some slight |
2026-03-27 |
| COMMIT |
0.30 |
[CB] Persistent manager (#44435) |
|
Stacked commits with brief human-like co |
2026-03-26 |
| PR |
0.30 |
fix: correct type annotations across config classes for @str |
|
More formal summary section but largely |
2026-03-25 |
| COMMIT |
0.20 |
Add cohere asr (#45023) |
|
Standard PR structure with commit messag |
2026-03-26 |
| COMMIT |
0.20 |
Ensure final evaluation runs with step-based evaluation stra |
|
Typical rebase/merge conflict pattern, h |
2026-03-26 |
| PR |
0.20 |
[WIP] Add Ovis2.5 |
|
Describes a new feature with typical fre |
2025-08-20 |
| PR |
0.20 |
Fix PreTrainedConfig as Pydantic field type after dataclass |
|
Has a bit more structure but clearly tec |
2026-03-28 |
| PR |
0.20 |
[VidEoMT] Update conversion script |
|
Brief, direct update note with link; no |
2026-03-28 |
| PR |
0.18 |
Fix _get_feat_extract_output_lengths in qwen3_omni_moe |
|
Technical with a problem statement; stru |
2026-03-29 |
| PR |
0.16 |
Fix PretrainedConfig type checking with mypy |
|
Domain-specific, describes regression, n |
2026-03-28 |
| PR |
0.16 |
Fix TypeError when chat_template is None in VoxtralProcessor |
|
Technical, describes problem; not overly |
2026-03-29 |
| PR |
0.15 |
Fix auto_docstring crash with from __future__ import annotat |
|
Technical, mentions specific decorator, |
2026-03-29 |
| PR |
0.15 |
add HyperClovaX Vision |
|
Direct, technical, proper names, minor i |
2026-02-27 |
| PR |
0.15 |
Add old InternVL2-1B/2B support to the InternVL conversion s |
|
Technical, specific, some template struc |
2026-03-29 |
| PR |
0.12 |
Fix: Skip meta device initialization for remote code models |
|
Technical, specifies bug and error, info |
2026-03-29 |
| COMMIT |
0.10 |
post release tag update |
|
Terse human comment about tag update. |
2026-03-27 |
| COMMIT |
0.10 |
style was missing sorry @ydshieh :) (#45038) |
|
Informal apology with GitHub mention, hu |
2026-03-27 |
| COMMIT |
0.10 |
Add BC for `_further_process_kwargs` (#45033) |
|
Standard BC fix with signed-off-by trail |
2026-03-26 |
| COMMIT |
0.10 |
Use multi runners to check new failing tests in a CI run (#4 |
|
Concise technical change with co‑authore |
2026-03-26 |
| COMMIT |
0.10 |
[`fix`] Use the correct _tied_weights_keys for CamembertForC |
|
Direct fix description, human technical |
2026-03-26 |
| COMMIT |
0.10 |
change dev ver. we forgot to do this when we released 5.3.0 |
|
Casual mention of forgotten version bump |
2026-03-26 |
| COMMIT |
0.10 |
Fix missing post_processor in DebertaV2Tokenizer causing no |
|
Detailed technical context, natural phra |
2026-03-24 |
| COMMIT |
0.10 |
Add big angry code agent warnings! (#44890) |
|
Commit messages use domain language and |
2026-03-22 |
| PR |
0.10 |
Add RF-DETR |
|
Very terse description; human style, not |
2025-03-21 |
| PR |
0.10 |
Add Music Flamingo |
|
Technical, concise and domain-specific; |
2026-01-27 |
| PR |
0.10 |
[CB] Add warmup feature |
|
Brief, technical, some spelling mistakes |
2026-03-30 |
| PR |
0.10 |
model: Add DEIMv2 to Transformers |
|
Bullet points, direct, uses jargon and a |
2026-02-27 |
| PR |
0.10 |
Add THD support in ESM |
|
Technical, direct explanation with a typ |
2026-02-19 |
| PR |
0.10 |
http retries on audio file downloads |
|
Direct, technical with parenthetical not |
2026-03-30 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, domain-specific, uses abbrevi |
2026-03-30 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366) |
|
Technical, concise, formatted like stand |
2026-03-30 |
| PR |
0.10 |
Adding Omnilingual ASR models |
|
Technical detail, items/checklists, info |
2026-01-13 |
| PR |
0.10 |
fix(models): Fix dtype mismatch in SwitchTransformers and Ti |
|
Technical, bullet points, domain referen |
2026-03-27 |
| PR |
0.10 |
Refactor OwlViT to modular Transformers |
|
Bulleted changelog, domain terms, terse |
2026-03-27 |
| PR |
0.10 |
[PoC] HF exporters |
|
Contains domain abbreviations and inform |
2025-11-03 |
| PR |
0.10 |
Fix dtype mismatches in SwitchTransformers and TimmWrapperMo |
|
Uses technical specifics, no AI phrasing |
2026-03-28 |
| PR |
0.10 |
[CB] Add per-request logits processors |
|
Technical summary and abbreviations; not |
2026-03-26 |
| PR |
0.10 |
fix(testing): Fix Kyutai Speech-To-Text, LLaVA-OneVision, an |
|
Technical, domain-specific language; not |
2026-03-14 |
| PR |
0.10 |
Fix `text-to-speech` pipeline crash when generation config c |
|
Uses domain-specific detail and issue re |
2026-03-30 |
| PR |
0.10 |
[Bugfix] Remove incorrect torchvision requirement from PIL b |
|
Brief, technical, mentions issue and PR |
2026-03-27 |
| PR |
0.10 |
[WIP][Fix] GLM 5 set `apply_rotary_pos_emb` to `is_neox_styl |
|
Domain jargon and terse style; aligned w |
2026-03-26 |
| PR |
0.10 |
Add HyperCLOVAX model |
|
Concise, technical, includes jargon; no |
2026-03-23 |
| PR |
0.10 |
Fix @auto_docstring crash with from __future__ import annota |
|
Technical free-text, specific error, not |
2026-03-29 |
| PR |
0.10 |
[nemotron_h] Add support for MLP mixers |
|
Domain language and informal phrasing; n |
2026-03-16 |
| PR |
0.10 |
Add Deepseek-OCR-2 model |
|
Concise, references, technical content; |
2026-03-27 |
| PR |
0.10 |
fix AttributeError in _patch_mistral_regex |
|
Terse explanation with jargon; strongly |
2026-03-28 |
| PR |
0.10 |
Fix TypeError in RoPE validation when ignore_keys_at_rope_va |
|
Contains domain details and technical sp |
2026-03-10 |
| PR |
0.10 |
add `StaticLayer.crop()` to match `DynamicLayer` API |
|
Technical reasoning, domain discussion; |
2026-03-20 |
| PR |
0.10 |
Fix `_set_model_specific_special_tokens` to accept list-form |
|
Mentions error detail and technical cont |
2026-03-17 |
| COMMIT |
0.07 |
Fix: Update optimization.py (#44909) |
|
Technical explanation and changelog; no |
2026-03-24 |
| COMMIT |
0.05 |
Fix tie_word_embedding issues with `Qwen2VL` (#44976) |
|
Commit messages are terse and domain-spe |
2026-03-24 |
| COMMIT |
0.05 |
Support Modular (!!) + Configs in `check_auto_docstrings` (# |
|
Brief, technical changelog; no AI text h |
2026-03-24 |
| PR |
0.05 |
[docs] @auto_docstring decorator |
|
Domain-specific, concise summary, no AI |
2026-03-30 |
| PR |
0.05 |
Module Fusion API |
|
Concise technical summary, bullet points |
2026-03-24 |
| PR |
0.05 |
fix(config): annotate PreTrainedConfig.dtype as Any to fix p |
|
Contains technical details and natural e |
2026-03-30 |
| PR |
0.05 |
Fix: handle future annotations in _process_kwargs_parameters |
|
Concise technical fix summary; test note |
2026-03-30 |
| PR |
0.05 |
Fix few issues in Qwen_3_Omni_Moe |
|
Concise fix description, references spec |
2026-03-19 |
| PR |
0.05 |
[Qwen3.5 MoE] Add _tp_plan to ForConditionalGeneration |
|
Uses domain-specific code context; struc |
2026-03-30 |
| PR |
0.05 |
refactoring: speedup static checks with disk cache |
|
Informal tone, mentions 'can be quite sl |
2026-03-25 |
| PR |
0.05 |
:rotating_light: [`LightGlue`] Remove remote code execution |
|
Technical, informal explanation, abrupt |
2026-03-30 |
| PR |
0.05 |
perceptron: Isaac-0.1 implementation |
|
Technical, domain links, human style sum |
2025-09-18 |
| PR |
0.05 |
fix: prevent IndexError in Whisper timestamp decode on trail |
|
Concise, technical summary, references i |
2026-03-25 |
| PR |
0.05 |
[serving] Fix continuous batching JSON response serializatio |
|
Concise, domain-specific language, no AI |
2026-03-27 |
| PR |
0.05 |
fix: remove unsafe exec() in serve.py |
|
Technical content, specific details, nor |
2026-03-30 |
| PR |
0.05 |
Fix `Wav2Vec2Config.vocab_size` type to allow `None` |
|
Technical explanation, informal, domain |
2026-03-30 |
| PR |
0.05 |
Fix resized LM head weights being overwritten by post_init |
|
Technical explanation, references issue, |
2026-03-28 |
| PR |
0.05 |
fix: use sys.modules.get() to avoid KeyError in modeling_uti |
|
Direct, technical, concise summary, lack |
2026-03-28 |
| PR |
0.05 |
Fix double softmax in MoE router load-balancing loss |
|
Stepwise technical explanation, domain-s |
2026-03-30 |
| PR |
0.05 |
Fix: Preserve PreTrainedConfig __init__ signatures for type |
|
Direct, technical, refers to type checke |
2026-03-30 |
| PR |
0.05 |
Fix PIL backend fallback when torchvision is unavailable |
|
Direct fix description, domain context, |
2026-03-27 |
| PR |
0.05 |
update release workflow |
|
Terse, informal; uses domain terms and m |
2026-02-18 |
| PR |
0.03 |
Fix NaN weights on non-rank-0 FSDP processes |
|
Brief, direct, references issues, typica |
2026-03-27 |
| PR |
0.03 |
[Cache] Native mamba & hybrid cache |
|
Casual tone, technical content, no AI ha |
2026-03-23 |
| COMMIT |
0.02 |
refactor: mlinter as its own package (#44939) |
|
Informal, domain-specific commit message |
2026-03-24 |
| COMMIT |
0.01 |
[CB] [Minor] Simplify test suite (#44858) |
|
Minimal, terse commit messages; highly h |
2026-03-24 |
| COMMIT |
0.01 |
Allow arbitrary template kwargs in processors (#44881) |
|
Commit messages are brief and informal; |
2026-03-24 |
| COMMIT |
0.01 |
incorrect model list update (#44880) |
|
Terse, casual commit history; human-writ |
2026-03-24 |
| COMMIT |
0.01 |
[CB] Add an option to return logprobs (#44835) |
|
Brief, informal commit messages; human s |
2026-03-23 |
| COMMIT |
0.01 |
[docs] peft (#44804) |
|
Very minimal and informal; no signs of A |
2026-03-23 |
| COMMIT |
0.01 |
Continuous batching thread safety (#44924) |
|
Informal commit log, technical focus; ty |
2026-03-23 |
| COMMIT |
0.01 |
Add static FP8 expert support (#44895) |
|
Highly terse, typical human commit patte |
2026-03-23 |
| COMMIT |
0.00 |
CB improvements for serving (#45063) |
|
All items are terse commit messages typi |
2026-03-30 |
| COMMIT |
0.00 |
Add Music Flamingo (#43538) |
|
Human-typical terse, technical commit me |
2026-03-30 |
| COMMIT |
0.00 |
[docs] continuous batching (#44896) |
|
Brief, informal, technical wording; no A |
2026-03-30 |
| COMMIT |
0.00 |
Fix few issues in Qwen_3_Omni_Moe (#44848) |
|
Commit messages are typical, concise dev |
2026-03-30 |
| COMMIT |
0.00 |
Fix PP test_ocr_queries (#45123) |
|
Short technical fix message, human style |
2026-03-30 |
| COMMIT |
0.00 |
Fix TypeError in rope validation when ignore_keys is a list |
|
Technical explanation with specific deta |
2026-03-30 |
| COMMIT |
0.00 |
refactor: added cache in check_repo (#45012) |
|
Follows conventional commit style; infor |
2026-03-30 |
| COMMIT |
0.00 |
Remove unused TensorFlow env var (#45065) |
|
Very terse, typical human commit style, |
2026-03-27 |
| COMMIT |
0.00 |
fix: add identity reverse_op to dequantize ops for save_pret |
|
Technical, domain-specific, informal, hu |
2026-03-27 |
| COMMIT |
0.00 |
Fix when RoPE params are in kwargs (#45049) |
|
Concise, technical phrasing, typical of |
2026-03-27 |
| COMMIT |
0.00 |
chore: update update_metdata.yml (#45054) |
|
— |
2026-03-27 |
| COMMIT |
0.00 |
[`FA`] Fix BC support for a few versions + add deprecation c |
|
Commit is terse, informal, and uses doma |
2026-03-27 |
| COMMIT |
0.00 |
fix(testing): Fix Parakeet, Evolla, Pi0, and Phi-3 test fail |
|
Commit and co-authored-by trailer show h |
2026-03-27 |
| COMMIT |
0.00 |
Allow advanced users to override `model_type` in `AutoConfig |
|
Brief technical summary; concise, domain |
2026-03-27 |
| COMMIT |
0.00 |
Fix llama4 bnb mode (#44588) |
|
Commit log style; terse, technical, no A |
2026-03-27 |
| COMMIT |
0.00 |
Fix failing `SmolLM3IntegrationTest` (#45048) |
|
Minimal content; repetitive test referen |
2026-03-27 |
| COMMIT |
0.00 |
fix tests/quantization/fp_quant_integration/test_fp_quant.py |
|
Technical jargon, short notes, domain-sp |
2026-03-27 |
| COMMIT |
0.00 |
chore: remove old extras (#45024) |
|
Concise, informal domain language; human |
2026-03-27 |
| COMMIT |
0.00 |
Avoid `Image.open` failure (#44645) |
|
Technical fixes, terse comments, human c |
2026-03-27 |
| COMMIT |
0.00 |
chore: Fix mlinter cache location (#45052) |
|
Brief summary; informal, domain-jargon, |
2026-03-27 |
| COMMIT |
0.00 |
Embedding VLMs don't need a head (#45000) |
|
Informal notes; terse commit style, doma |
2026-03-27 |
| COMMIT |
0.00 |
Fix GraniteConfig type hints to accept int for multiplier fi |
|
Standard technical explanation; concise, |
2026-03-27 |
| COMMIT |
0.00 |
fix: preserve rotary_pct across save/load cycle in GPTNeoX c |
|
Human commit style; technical details, p |
2026-03-27 |
| COMMIT |
0.00 |
refactor: speed up docstring checker (#45009) |
|
Commit messages are terse and domain-spe |
2026-03-27 |
| COMMIT |
0.00 |
ci: add anti-slop action (#44847) |
|
— |
2026-03-26 |
| COMMIT |
0.00 |
Add doc page for capturing outputs (#44947) |
|
— |
2026-03-26 |
| COMMIT |
0.00 |
Dynamic weight conversion is recursive (#44300) |
|
Commit messages are terse, informal, and |
2026-03-26 |
| COMMIT |
0.00 |
Don't run `tests_hub` if no tests found (#45014) |
|
Extremely terse commit messages, typical |
2026-03-26 |
| COMMIT |
0.00 |
Fix type hint for `attention_chunk_size` in `Llama4TextConfi |
|
Brief, domain-specific commit, no AI sig |
2026-03-25 |
| COMMIT |
0.00 |
Fix AutoProcessor.from_pretrained silently dropping hub kwar |
|
Detailed tech explanation, domain jargon |
2026-03-25 |
| COMMIT |
0.00 |
Add VidEoMT (#44285) |
|
Commit history is terse and technical, h |
2026-03-25 |
| COMMIT |
0.00 |
fix: remove Copied from comments between @torch.jit.script a |
|
Concise, domain-specific explanation, cl |
2026-03-25 |
| COMMIT |
0.00 |
More small vllm fixes (#44990) |
|
Commit messages are terse and informal; |
2026-03-25 |
| COMMIT |
0.00 |
fix(models): Fix Perceiver interpolate_pos_encoding interpol |
|
Commit messages use domain jargon and in |
2026-03-25 |
| COMMIT |
0.00 |
Allow `mm_token_type` be non-padded lists (#44563) |
|
Commit log is brief and contains human-l |
2026-03-25 |
| COMMIT |
0.00 |
Fix CPU 16 bytes alignment issue using equivalent fallback ( |
|
Commit messages are terse, technical, an |
2026-03-25 |
| COMMIT |
0.00 |
refactor: unify QA calls (#44879) |
|
Commit messages filled with informal ton |
2026-03-25 |
| COMMIT |
0.00 |
[ `vllm x v5`] nit (#44971) |
|
Very terse nits and technical jargon, ty |
2026-03-24 |
| COMMIT |
0.00 |
[AMD CI] Gemma3/Gemma3n Expectations (#44972) |
|
Direct, slangy commits and clear domain |
2026-03-24 |
| COMMIT |
0.00 |
Officially launch parse_response (#44674) |
|
— |
2026-03-24 |
| COMMIT |
0.00 |
fix load_best_model_checkpoint_at_end do not load the best m |
|
— |
2026-03-24 |
| COMMIT |
0.00 |
fix: split MXFP4 dependency checks for specific error messag |
|
— |
2026-03-24 |
| COMMIT |
0.00 |
Fix failing `T5ModelIntegrationTest` (#44934) |
|
— |
2026-03-24 |
| COMMIT |
0.00 |
Config kwargs (#44953) |
|
— |
2026-03-24 |
| COMMIT |
0.00 |
Fix variable shadowing in pipeline example and typo in BART |
|
Commit message is terse and domain-speci |
2026-03-23 |
| COMMIT |
0.00 |
Fix failing job `Update Transformers metadata` after #43514 |
|
Terse commit messages, minimal free-text |
2026-03-23 |
| COMMIT |
0.00 |
Clearer type hints and fix rope validation in configs (#4494 |
|
Casual phrasing, typos, domain-specific |
2026-03-23 |
| COMMIT |
0.00 |
Correct docstrings for `from_pretrained` (url input deprecat |
|
Technical, short, no ChatGPT markers, hu |
2026-03-23 |
| COMMIT |
0.00 |
Fix backward compatibility for full path imports of Fast Ima |
|
Technical changelog, informal tone, huma |
2026-03-23 |
| COMMIT |
0.00 |
chore(typing): added rule 11 (#44865) |
|
Informal commit titles, domain jargon, n |
2026-03-23 |
| COMMIT |
0.00 |
fix: improve processor loading performance by avoiding redun |
|
Structured technical changes, natural ph |
2026-03-23 |
| COMMIT |
0.00 |
fix(camembert): add tie_word_embeddings=True to CamembertCon |
|
Detailed technical context, some typos, |
2026-03-23 |
| COMMIT |
0.00 |
Support SizeDict import in get_size_dict (#44903) |
|
Short, direct, typical commit phrasing, |
2026-03-23 |
| COMMIT |
0.00 |
fix `processing_utils.py`: avoid deepcopying tokenizer in `P |
|
Concise, minimal, domain-specific, human |
2026-03-23 |
| COMMIT |
0.00 |
fix: set `clean_up_tokenization_spaces=False` in Llama 3 tok |
|
Clear technical explanation, not overly |
2026-03-23 |
| COMMIT |
0.00 |
[docs] model cards (#44837) |
|
Extremely terse and informal; signals hu |
2026-03-20 |
| COMMIT |
0.00 |
[Model] Add UVDoc Model Support (#43385) |
|
Fragmented, minimal commit message style |
2026-03-20 |
| COMMIT |
0.00 |
Add backward compatibility for direct imports from legacy `i |
|
Brief, domain-specific phrasing, no AI s |
2026-03-20 |
| COMMIT |
0.00 |
[`FA4`] Add kernels fallback (#44797) |
|
Informal, technical, and concise message |
2026-03-20 |
| COMMIT |
0.00 |
Bump kernels version dependency to avoid crashes (#44887) |
|
Very terse commit messages with co-autho |
2026-03-20 |
| COMMIT |
0.00 |
[Model] Add SLANeXt Model Support (#43707) |
|
Informal, many quick fixes, joking ('it |
2026-03-20 |
| COMMIT |
0.00 |
Fix core dumped when `NemotronH` is torch compiled (#44854) |
|
Commit messages are terse with typical h |
2026-03-20 |
| COMMIT |
0.00 |
Fix several based models' pipeline parallel support (#44699) |
|
Pragmatic one-line descriptions and stan |
2026-03-20 |
| COMMIT |
0.00 |
fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures |
|
Concise technical message; style is typi |
2026-03-20 |
| COMMIT |
0.00 |
Fix dtype guessing from state dict (#44883) |
|
Very short, domain-specific commit title |
2026-03-20 |
| COMMIT |
0.00 |
Add missing dunder methods to `SizeDict` (#44884) |
|
Standard minimal commit summary; no AI h |
2026-03-20 |
| COMMIT |
0.00 |
Fix VL model rope_deltas batch size mismatch in online RL tr |
|
Short, technical, human-style summary an |
2026-03-20 |
| COMMIT |
0.00 |
Fix `layer_types` type hint for `AFMoE` and `Llama4` (#44874 |
|
Standard type hint update, signed by use |
2026-03-20 |
| COMMIT |
0.00 |
Align lfm2 cache to other mamba caches (#44866) |
|
Minimal, direct messages with informal c |
2026-03-20 |
| COMMIT |
0.00 |
Fix nemotron config docstrings (#44878) |
|
Terse domain description, matches human |
2026-03-20 |
| PR |
0.00 |
Fix: Remove double softmax in MoE router load-balancing loss |
|
PR template; user content is technical a |
2026-03-30 |
| PR |
0.00 |
[WIP] Add CharacterBERT model |
|
PR template only; WIP marker and user co |
2023-10-05 |
| PR |
0.00 |
throw error when conversion required |
|
PR template; user content is brief and t |
2026-03-27 |
| PR |
0.00 |
Fix T5Attention shape mismatch under Tensor Parallelism |
|
Technical language, informal, clear manu |
2026-03-30 |
| PR |
0.00 |
Add SAM3-LiteText |
|
Technical jargon, includes typos and hum |
2026-02-27 |
| PR |
0.00 |
[3dconv][qwenvl] convert patch embed forward to linear |
|
Domain abbreviations, informal, human-ex |
2026-03-27 |
| PR |
0.00 |
Add Videoprism |
|
Concise, dev-specific context, not overl |
2025-08-04 |
| PR |
0.00 |
[refactor] Serving into proper modules |
|
Terse, informal, with human-like structu |
2026-03-17 |
| PR |
0.00 |
CB improvements for serving |
|
Direct, technical, informal, no AI patte |
2026-03-27 |
| PR |
0.00 |
Refactor CLIP-like models |
|
Informal language and abbreviations sign |
2026-03-04 |
| PR |
0.00 |
[docs] continuous batching |
|
Informal, concise list format, some lowe |
2026-03-20 |
| PR |
0.00 |
Fix PP test_ocr_queries |
|
Casual tone, exclamation, direct referen |
2026-03-30 |
| PR |
0.00 |
DeepGEMM |
|
Template section left in, no additional |
2026-03-18 |
| PR |
0.00 |
Add Molmo2 |
|
Template section only, no filled free te |
2026-01-23 |
| PR |
0.00 |
Copy the template resolution logic from the base apply_chat_ |
|
Direct, domain-specific; uses shorthand |
2026-03-30 |
| PR |
0.00 |
fix audio encoder output length formula in qwen3_omni_moe |
|
Concise, technical, uses domain-specific |
2026-03-28 |
| PR |
0.00 |
Fix TypeError in rope validation when ignore_keys is a list |
|
Succinct bug report with technical detai |
2026-03-27 |
| PR |
0.00 |
fix: lets fix all doctests |
|
Informal language and brief editing; typ |
2026-03-30 |
| PR |
0.00 |
Use doc-builder runnable example for GLM-ASR |
|
Direct and short technical update; clear |
2026-02-25 |
| PR |
0.00 |
feature: added import complexity checker |
|
Bullet points, informal spelling errors, |
2026-03-26 |
| PR |
0.00 |
Avoid device sync in training loss accumulation |
|
Terse technical justification, domain la |
2026-02-18 |
| PR |
0.00 |
refactor: added cache in check_repo |
|
Very informal, minimal, with technical s |
2026-03-26 |
| PR |
0.00 |
Adding support for Nandi Models |
|
Extremely brief and minimal; typical for |
2026-03-29 |
| PR |
0.00 |
fix bug for janus model image generation |
|
Informal, contains direct user mentions |
2026-03-27 |
| PR |
0.00 |
[`auto_docstring`] needs to be only run on __doc__ |
|
Informal and terse; shows human-like bre |
2026-03-27 |
| PR |
0.00 |
Fix whisper return language |
|
Minimal content with domain reference, t |
2025-11-16 |
| PR |
0.00 |
Modular playground |
|
Update list format, informal tone, and t |
2026-02-04 |
| PR |
0.00 |
fix: pin 69 unpinned action(s),extract 2 unsafe expression(s |
|
Main content begins in user’s own voice, |
2026-03-26 |
| PR |
0.00 |
fix: pin 50 unpinned actions to commit SHA, extract 1 secret |
|
Brief, informal, mentions previous PR, n |
2026-03-27 |