| COMMIT |
1.00 |
Update AFMoE architecture to use v5-style MoE impl (#44063) |
|
Commit message contains explicit AI assi |
2026-03-19 |
| COMMIT |
1.00 |
Sdpa for owlvit (#42136) |
|
Commit message contains explicit AI assi |
2026-03-17 |
| COMMIT |
1.00 |
:rotating_light: Validate config attributes (#41250) |
|
Commit message contains explicit AI assi |
2026-03-16 |
| PR |
1.00 |
Fix #44155: [AudioFlamingo3] Batched inference produces inco |
|
PR body explicitly mentions AI collabora |
2026-02-21 |
| PR |
0.60 |
🚨 Refactor ViT to updated standards |
|
Contains ChatGPT-like phrasing: 'This PR |
2025-10-17 |
| PR |
0.50 |
🚨🚨 Refactor Image Processors to support different backends |
|
Summary style is structured but not clea |
2026-01-27 |
| PR |
0.50 |
Update AFMoE architecture to use v5-style MoE impl |
|
Well-structured but not overly formal; t |
2026-02-17 |
| PR |
0.40 |
[CB] Add an option to return logprobs |
|
Slightly formal but concise and technica |
2026-03-18 |
| PR |
0.30 |
[generate] Never use `cache_position` anymore in generation |
|
Human-like summary, direct and informal |
2026-03-18 |
| PR |
0.20 |
Add THD support in ESM |
|
Somewhat formal, mostly technical, not A |
2026-02-19 |
| PR |
0.20 |
Internalise the NomicBERT model |
|
Mainly technical content and references. |
2025-12-29 |
| PR |
0.20 |
FSDP2 native support in transformers |
|
Technical summary, domain terms, not ove |
2026-02-17 |
| PR |
0.20 |
[PoC] HF exporters |
|
Slightly more formal, but has informal a |
2025-11-03 |
| PR |
0.20 |
add HyperClovaX Vision |
|
Slightly more verbose, but addresses tea |
2026-02-27 |
| PR |
0.20 |
fix(testing): Fix Kyutai Speech-To-Text, LLaVA-OneVision, an |
|
Uses headings and detailed explanation, |
2026-03-14 |
| PR |
0.20 |
Fix: Update outdated sampler comment in generation/utils.py |
|
Uses headings and proper structure, but |
2026-03-20 |
| PR |
0.20 |
[Docs] Update DeiT model card to new format |
|
Standard doc update with technical focus |
2026-03-19 |
| PR |
0.20 |
perceptron: Isaac-0.1 implementation |
|
Contains model/jargon, summary is specif |
2025-09-18 |
| PR |
0.20 |
Fix KeyError in convert_to_native_format for dict vocab |
|
Technical, problem-focused, uses domain- |
2026-03-05 |
| PR |
0.20 |
Use `index_select` instead of advanced indexing in `batched_ |
|
Technical, detailed, direct issue refere |
2026-03-13 |
| PR |
0.15 |
fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures |
|
Lists failures with terse, technical phr |
2026-03-16 |
| PR |
0.15 |
refactor: improved the cli server module code organization |
|
Terse, technical reorg summary, real cod |
2026-03-20 |
| PR |
0.15 |
Fix VL model rope_deltas batch size mismatch in online RL tr |
|
Starts with problem header, technical co |
2026-03-20 |
| PR |
0.15 |
Add Music Flamingo |
|
Technical tone with domain-specific refe |
2026-01-27 |
| PR |
0.15 |
Add Tamil README.md Documentation. |
|
Brief, direct wording without AI-like st |
2025-10-18 |
| PR |
0.15 |
Refactor gptj output tracing to use standardized decorators |
|
Technical, well-structured but not AI-ty |
2026-03-15 |
| PR |
0.13 |
fix: prevent IndexError in Whisper word timestamp decode |
|
Describes specific corner-case bug, dire |
2026-03-20 |
| PR |
0.12 |
fix: mark new lm_head params as `_is_hf_initialized` after ` |
|
Informal fix message, references issue, |
2026-03-14 |
| PR |
0.12 |
fix: ensure prediction_step returns tensor for logits, not t |
|
Refers to type hints, clear problem/fix, |
2026-03-20 |
| PR |
0.10 |
[Misc] add enable_thinking to template kwargs |
|
Contains domain-specific abbreviations, |
2026-03-18 |
| PR |
0.10 |
Fix llama4 bnb mode |
|
Has jargon and technical details; inform |
2026-03-11 |
| PR |
0.10 |
fix load_best_model_checkpoint_at_end do not load the best m |
|
Contains repetition, typo, and terse exp |
2026-03-10 |
| PR |
0.10 |
Fix core dumped when `NemotronH` is torch compiled |
|
Uses real-world test output and casual l |
2026-03-19 |
| PR |
0.10 |
Bump kernels version dependency to avoid crashes |
|
Informal tone, emojis, includes example |
2026-03-20 |
| PR |
0.10 |
LwDetrImageLoss: Fix dtype casting to prevent crash when usi |
|
Brief, domain-specific, uses informal gr |
2026-03-20 |
| PR |
0.10 |
Fix several based models' pipeline parallel support |
|
Domain jargon, uses abbreviation, inform |
2026-03-14 |
| PR |
0.10 |
Dynamic weight conversion is recursive |
|
Explains motivation, refers to related P |
2026-02-26 |
| PR |
0.10 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
Domain-specific phrasing and concise pro |
2026-03-19 |
| PR |
0.10 |
Fix dtype guessing from state dict |
|
Direct and minimal with use of domain re |
2026-03-20 |
| PR |
0.10 |
Add missing dunder methods to `SizeDict` |
|
Succinct description with technical cont |
2026-03-20 |
| PR |
0.10 |
Support Modular (!!) + Configs in `check_auto_docstrings` |
|
Informal emphasis, punctuation, and clea |
2026-03-17 |
| PR |
0.10 |
[model] Add PenguinVL implementation |
|
Factual reference to resources, no AI pa |
2026-03-13 |
| PR |
0.10 |
fix: handle list-type _tied_weights_keys in _get_tied_weight |
|
Terse, technical commit; includes domain |
2026-03-19 |
| PR |
0.10 |
feat: added cache to the model linter |
|
Technical summary with Makefile mention; |
2026-03-17 |
| PR |
0.10 |
ci: add anti-slop action |
|
Brief, uses bullet points and domain lin |
2026-03-19 |
| PR |
0.10 |
Add AudioFlamingoNext model |
|
Direct technical description; concise ch |
2026-03-18 |
| PR |
0.10 |
chore(typing): extend typing to `src/transformers/cli` |
|
Technical, with link reference; natural |
2026-03-10 |
| PR |
0.10 |
model: Add DEIMv2 to Transformers |
|
Concise, domain-specific language; no AI |
2026-02-27 |
| PR |
0.10 |
Fix Mllama torch.compile failure caused by new attention mas |
|
Clear, technical explanation; lacks AI-t |
2026-03-19 |
| PR |
0.10 |
Officially launch parse_response |
|
Informal, clear explanation lacking AI i |
2026-03-13 |
| PR |
0.10 |
Fix glm dsa |
|
Terse message with informal structure, l |
2026-03-10 |
| PR |
0.10 |
refactor: rope in model, flatten vision, rely on qwen3 backo |
|
Minimal, specific commit-style phrasing; |
2026-03-19 |
| PR |
0.10 |
fix: handle unpicklable tokenizers in ProcessorMixin.to_dict |
|
Brief description, some minor template e |
2026-03-19 |
| PR |
0.10 |
Add xcodec2 model |
|
Informal, brief checklist and WIP signal |
2026-02-20 |
| PR |
0.10 |
[`Mllama`] Fix workaround compile |
|
Casual tone, 'tbh' abbreviation, technic |
2026-03-19 |
| PR |
0.10 |
Fix Zamba2MambaMixer ignoring use_mamba_kernels=False |
|
Domain-specific, direct reference to cod |
2026-03-19 |
| PR |
0.10 |
Fix AutoImageProcessor URL loading regression |
|
Concise, technical, direct style; no AI- |
2026-03-19 |
| PR |
0.10 |
[CB] Better parametrization for compile |
|
Technical, concise summary, and informal |
2026-03-10 |
| PR |
0.09 |
Proposal to add Qwen3-ASR support [WIP] |
|
Direct, domain jargon, filled template i |
2026-02-08 |
| PR |
0.08 |
Ensure final evaluation runs with step-based evaluation stra |
|
Technical detail and casual tone, human- |
2026-02-19 |
| PR |
0.07 |
[refactor] Serving into proper modules |
|
Technical, casual language, some typos ( |
2026-03-17 |
| PR |
0.07 |
Switch FP8 per tensor quant to use `torch._scaled_mm` |
|
Technical, mentions quirks, slight gramm |
2026-03-19 |
| PR |
0.07 |
Fix failing `Qwen3OmniModelIntegrationTests` |
|
Direct, context-rich, references actions |
2026-03-19 |
| PR |
0.06 |
Update some type hints |
|
Terse, domain-specific, informal (also/N |
2026-03-19 |
| COMMIT |
0.05 |
Fix unexpected `position_ids` keys when loading OwlViT model |
|
Uses domain language and concise technic |
2026-03-18 |
| COMMIT |
0.05 |
feat(integration): Add KubeflowCallback to enable automatic |
|
Standard signed-off commits; technical a |
2026-03-18 |
| COMMIT |
0.05 |
Centralize AI agent templates in `.ai` (#44489) |
|
Varsity of edits, casual phrases like 't |
2026-03-18 |
| PR |
0.05 |
Fix missing post_processor in DebertaV2Tokenizer causing no |
|
Minimal template completion, no AI-style |
2026-03-10 |
| PR |
0.05 |
Fix `layer_types` type hint for `AFMoE` and `Llama4` |
|
Direct, technical and includes reference |
2026-03-20 |
| PR |
0.05 |
Allow arbitrary template kwargs in processors |
|
Short, informal rationale; no markers of |
2026-03-20 |
| PR |
0.05 |
[docs] peft |
|
Changelog-like bulleted list with domain |
2026-03-18 |
| PR |
0.05 |
Fix `AutoImageProcessor` to correctly detect local implement |
|
Casual tone and '@' mention suggest huma |
2026-03-13 |
| PR |
0.05 |
Fix how PreTrainedModel checks annotations on Python 3.14+ |
|
Explains Python implementation detail; c |
2026-02-20 |
| PR |
0.05 |
[docs] training on specific hardware |
|
Terse, list-based description; highly hu |
2026-03-17 |
| PR |
0.05 |
[Mistral] Fix query scaling for Mistral4 and Ministral3 |
|
Domain-specific, terse, references offli |
2026-03-19 |
| PR |
0.05 |
Propagate the model loading from transformers serve to chat |
|
Technical, informal phrasing, references |
2026-03-16 |
| PR |
0.05 |
enable tp for benchmark |
|
Very terse, domain jargon, informal. |
2026-02-05 |
| PR |
0.05 |
Goodbye cache position |
|
Informal and terse, mentions WIP, has ty |
2026-03-13 |
| COMMIT |
0.00 |
Fix core dumped when `NemotronH` is torch compiled (#44854) |
|
Commit messages are terse with typical h |
2026-03-20 |
| COMMIT |
0.00 |
Fix several based models' pipeline parallel support (#44699) |
|
Pragmatic one-line descriptions and stan |
2026-03-20 |
| COMMIT |
0.00 |
fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures |
|
Concise technical message; style is typi |
2026-03-20 |
| COMMIT |
0.00 |
Fix dtype guessing from state dict (#44883) |
|
Very short, domain-specific commit title |
2026-03-20 |
| COMMIT |
0.00 |
Add missing dunder methods to `SizeDict` (#44884) |
|
Standard minimal commit summary; no AI h |
2026-03-20 |
| COMMIT |
0.00 |
Fix VL model rope_deltas batch size mismatch in online RL tr |
|
Short, technical, human-style summary an |
2026-03-20 |
| COMMIT |
0.00 |
Fix `layer_types` type hint for `AFMoE` and `Llama4` (#44874 |
|
Standard type hint update, signed by use |
2026-03-20 |
| COMMIT |
0.00 |
Align lfm2 cache to other mamba caches (#44866) |
|
Minimal, direct messages with informal c |
2026-03-20 |
| COMMIT |
0.00 |
Fix nemotron config docstrings (#44878) |
|
Terse domain description, matches human |
2026-03-20 |
| COMMIT |
0.00 |
Fix nemotron_h modular (#44876) |
|
Extremely minimal, rushed style; typical |
2026-03-20 |
| COMMIT |
0.00 |
feat: added cache to the model linter (#44790) |
|
Terse commit messages; no AI traits. |
2026-03-20 |
| COMMIT |
0.00 |
[Model] Add PP-Chart2Table Model Support (#43767) |
|
Minimal messages, human style, no AI hal |
2026-03-19 |
| COMMIT |
0.00 |
[Mistral] Fix query scaling for Mistral4 and Ministral3 (#44 |
|
Extremely brief message, typical human s |
2026-03-19 |
| COMMIT |
0.00 |
Propagate the model loading from transformers serve to chat |
|
Normal human commit structure and tone. |
2026-03-19 |
| COMMIT |
0.00 |
Update some type hints (#44851) |
|
Short, informal commit messages with hum |
2026-03-19 |
| COMMIT |
0.00 |
enable tp for benchmark (#43750) |
|
Informal tone, short commits, human-writ |
2026-03-19 |
| COMMIT |
0.00 |
Fix glm dsa (#44564) |
|
Single-word commit log, human-written. |
2026-03-19 |
| COMMIT |
0.00 |
🚨🚨 Refactor Image Processors to support different backends ( |
|
Short updates, human workflow on large P |
2026-03-19 |
| COMMIT |
0.00 |
[generate] Never use `cache_position` anymore in generation |
|
Human iterative commit process, no forma |
2026-03-19 |
| COMMIT |
0.00 |
Fix KeyError in convert_to_native_format for dict vocab (#44 |
|
Technical explanation, informal, human-w |
2026-03-19 |
| COMMIT |
0.00 |
fix: XLNet: relative_positional_encoding computes on CPU eve |
|
Concise commit messages with clear domai |
2026-03-19 |
| COMMIT |
0.00 |
Fix annotations reader for python 3.14 in `PreTrainedModel` |
|
Brief messages with specific version tar |
2026-03-19 |
| COMMIT |
0.00 |
[CB] Better parametrization for compile (#44578) |
|
Casual language, informal notes, and min |
2026-03-19 |
| COMMIT |
0.00 |
Fix `KeyError` when patching mistral regex (#43376) |
|
Succinct, technical commit logs; include |
2026-03-19 |
| COMMIT |
0.00 |
Correct code block formatting in weightconverter.md (#44839) |
|
Straightforward edit description typical |
2026-03-19 |
| COMMIT |
0.00 |
deepseek_v2, deepseek_v3, and modernbert fix for having inco |
|
Informal PR structure and terse notes su |
2026-03-18 |
| COMMIT |
0.00 |
[Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod |
|
Sequence of brief, domain-specific commi |
2026-03-18 |
| COMMIT |
0.00 |
Add `Jina-Embeddings-V3` Model (#44251) |
|
Modular commit breakdown with terse and |
2026-03-18 |
| COMMIT |
0.00 |
feat(ci): added a network debug report (#44636) |
|
Changelog includes domain jargon and inf |
2026-03-18 |
| COMMIT |
0.00 |
Add GreedyLR adaptive learning rate scheduler (#44271) |
|
Detailed, technical changelog with human |
2026-03-18 |
| COMMIT |
0.00 |
Update more modular examples (#44834) |
|
One-word human-typical commit message an |
2026-03-18 |
| COMMIT |
0.00 |
Fix and re-run modular converter on examples (#44833) |
|
Short, informal commit messages with typ |
2026-03-18 |
| COMMIT |
0.00 |
Remove cache_position in more models (4 and last one) (#4482 |
|
Terse, informal, and non-AI phrasing lik |
2026-03-18 |
| COMMIT |
0.00 |
Fix loading issue in Sam3 (#44831) |
|
Minimal human-typical 'fix loading issue |
2026-03-18 |
| COMMIT |
0.00 |
Add GGUF support for MiniMax-M2.1 model (#44526) |
|
No free-text, only PR title; human typic |
2026-03-18 |
| COMMIT |
0.00 |
support xxxFast alias in v5 tokenizers (#44766) |
|
Domain-typical short test/dev commit mes |
2026-03-18 |
| COMMIT |
0.00 |
Remove cache_position in more models (3) (#44759) |
|
Natural, informal, and technical commit |
2026-03-18 |
| COMMIT |
0.00 |
Fix `supports_{tp/pp}_plan` (#44696) |
|
Commit uses informal, terse messages wit |
2026-03-18 |
| COMMIT |
0.00 |
[CI] Temporarily skip Mistral4 tests as they almost all fail |
|
Extremely minimal message; classic human |
2026-03-18 |
| COMMIT |
0.00 |
update flex attention to use `return_aux` instead of `return |
|
Contains typos and informal language, ty |
2026-03-18 |
| COMMIT |
0.00 |
[Gemma] Update conversion scripts for Transformers v5 Comapt |
|
Direct, domain-specific commit messages; |
2026-03-18 |
| COMMIT |
0.00 |
fix bug embedding_size mismatch with hidden_size in electra |
|
Commit message is terse, with a typical |
2026-03-18 |
| COMMIT |
0.00 |
Fix pegasus conversion (#44571) |
|
Brief, technical, and mentions force mer |
2026-03-18 |
| COMMIT |
0.00 |
Fix repo-check bot (#44812) |
|
Single-word, informal message; clearly h |
2026-03-18 |
| COMMIT |
0.00 |
[docs] is_causal feature (#44777) |
|
Extremely terse, no AI markers, human co |
2026-03-17 |
| COMMIT |
0.00 |
docs(tasks): remove references to removed question-answering |
|
Detailed but natural explanation with do |
2026-03-17 |
| COMMIT |
0.00 |
Fix configs with `@strict` (#44770) |
|
Informal, expressive language; clear sig |
2026-03-17 |
| COMMIT |
0.00 |
[AMD CI] Fix test failures across important models (#44632) |
|
Commit messages are terse, use abbreviat |
2026-03-17 |
| COMMIT |
0.00 |
Move VLM conversions to the main mapping (#44627) |
|
Short, informal commit messages and huma |
2026-03-17 |
| COMMIT |
0.00 |
Fix config loading issues (type issues) (#44789) |
|
All messages are single word 'fix' or eq |
2026-03-17 |
| COMMIT |
0.00 |
Remove `is_causal` from `EuroBertConfig` (#44774) |
|
Very brief informal message; no AI style |
2026-03-17 |
| COMMIT |
0.00 |
model-linter: Added rule 10 (#44761) |
|
Terse summary; no signs of AI tone or ph |
2026-03-17 |
| COMMIT |
0.00 |
[fix] mistral 4 docs (#44776) |
|
Single word commit; no evidence of AI st |
2026-03-16 |
| COMMIT |
0.00 |
Add Mistral 4 (#44760) |
|
Uses typical human commit structure, wit |
2026-03-16 |
| COMMIT |
0.00 |
Fix: Eurobert model was missing @strict decorator and invali |
|
Contains technical explanation with doma |
2026-03-16 |
| COMMIT |
0.00 |
fix: sig lip import (#44764) |
|
Short, practical summary; not AI-like. |
2026-03-16 |
| COMMIT |
0.00 |
Disable async loading when quantizing on the fly (#44576) |
|
Informal style and suggestions; normal h |
2026-03-16 |
| COMMIT |
0.00 |
Bump torchao >=0.15 and fix quantization CI (#44604) |
|
Concise commit messages with domain-spec |
2026-03-16 |
| COMMIT |
0.00 |
Fix tensor indexing crash in serve generate_response KV cach |
|
Technical explanation with direct style |
2026-03-16 |
| COMMIT |
0.00 |
[MistralCommonBackend] Upgrade mistral-common to v1.10.0 (#4 |
|
Standard PR format with technical conten |
2026-03-16 |
| COMMIT |
0.00 |
Fix `mlcd` auto config/model/mapping issues (#44730) |
|
Short, informal commit messages with dom |
2026-03-16 |
| COMMIT |
0.00 |
Fix bug and add XPU Expectations for qwen2 and jamba tests ( |
|
Technical content with repeated signed-o |
2026-03-16 |
| COMMIT |
0.00 |
Add model lerobot PI0 to transformers (#44160) |
|
Informal commit style, domain abbreviati |
2026-03-16 |
| COMMIT |
0.00 |
[medasr] doc update (#44633) |
|
Simple doc update with direct co-authors |
2026-03-16 |
| COMMIT |
0.00 |
Idefics3 without cache fix (#44607) |
|
Technical fixes with direct notes and ex |
2026-03-16 |
| COMMIT |
0.00 |
Add XPU Expectations for vibe voice acoustic tokenizer tests |
|
Domain-specific content, formatted and s |
2026-03-16 |
| COMMIT |
0.00 |
Fix transformers serve's 422 unprocessable entity (#44620) |
|
Direct revert and terse technical descri |
2026-03-16 |
| COMMIT |
0.00 |
Fix missing / incorrect `config` class in some model class d |
|
Terse commit messages and domain-specifi |
2026-03-15 |
| COMMIT |
0.00 |
Update Nvidia CI docker file to use torch 2.10 (#44712) |
|
Direct, technical changelog with minimal |
2026-03-14 |
| COMMIT |
0.00 |
[`FA`] Fix fa detection (#44703) |
|
Short, domain-specific phrasing and fix |
2026-03-14 |
| COMMIT |
0.00 |
Fix `set_encoder` (#44698) |
|
Minimal message with domain context and |
2026-03-14 |
| COMMIT |
0.00 |
[docs] cb config (#44675) |
|
Extremely brief, informal commit message |
2026-03-13 |
| COMMIT |
0.00 |
Fix more model tester missing `parent` issue (#44685) |
|
Single-word message indicates typical hu |
2026-03-13 |
| COMMIT |
0.00 |
:rotating_light: [`FA4`] Initial support (#42435) |
|
Numerous terse, technical commit lines a |
2026-03-13 |
| COMMIT |
0.00 |
Add register method for `ParallelInterface` (#44640) |
|
Domain term 'feat' and concise summary; |
2026-03-13 |
| COMMIT |
0.00 |
[CB] [Bug] Fix crashes when running without cuda (#44673) |
|
Technical, non-formal phrasing and bulle |
2026-03-13 |
| COMMIT |
0.00 |
Another (small) set of fixes required for tiny model creatio |
|
Very brief and vague commit messages; co |
2026-03-13 |
| COMMIT |
0.00 |
Fix CookieCutter (#44334) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
Fix AWQ tests for GPTQModel migration (#44654) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
[Model] Add PP-OCRV5_mobile_det Model Support (#43247) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
pipelines do not have modelcard (#44621) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
[`Chmv2`] Fix conversion after capture refactor (#44665) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
fix(models, testing): Fix Llama4 vision rotary meta tensor i |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
[CB] Add dedicated config (#44434) |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
fix(models): Forward timm model kwargs to timm.create_model |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
Ensure same `dtype` for subconfig when `_from_config` (#4462 |
|
— |
2026-03-13 |
| COMMIT |
0.00 |
Remove `cache_position` in more models (2) (#44602) |
|
— |
2026-03-12 |
| COMMIT |
0.00 |
fix: cast to proper dtype in EmbeddingParallel (#44612) |
|
Concise commit message with domain jargo |
2026-03-12 |
| COMMIT |
0.00 |
Allow to disable stdout hiding for TP (#44608) |
|
Brief, technical commit messages using p |
2026-03-12 |
| COMMIT |
0.00 |
Remove many output_attentions and other traced outputs on 10 |
|
Informal, terse messages with typos and |
2026-03-12 |
| COMMIT |
0.00 |
[Model] Add PP-OCRV5_server_det Model Support (#43274) |
|
Patch summary uses informal, minimal sty |
2026-03-12 |
| COMMIT |
0.00 |
fix: raise error if mm_token_type_ids not supplied (#44433) |
|
Short, technical commit messages with do |
2026-03-12 |
| COMMIT |
0.00 |
Fix output capturing for Backbones (#44638) |
|
Minimal messages, domain specific, no AI |
2026-03-12 |
| COMMIT |
0.00 |
Fix lfm2 kernel path (#44634) |
|
Very terse, informal commit messages—str |
2026-03-12 |
| PR |
0.00 |
incorrect model list update |
|
Free-text is minimal and informal; templ |
2026-03-20 |
| PR |
0.00 |
[Model] Add SLANeXt Model Support |
|
Free-text concise, domain-appropriate, n |
2026-02-03 |
| PR |
0.00 |
chore(typing): added rule 11 |
|
Patch described with project jargon and |
2026-03-19 |
| PR |
0.00 |
Remove explicit cuda stream in nemotron_h |
|
Terse, domain-specific, and informal; no |
2026-03-20 |
| PR |
0.00 |
[Model] Add UVDoc Model Support |
|
Template partially filled, minimal info; |
2026-01-21 |
| PR |
0.00 |
refactor: unify QA calls |
|
Very terse, heavily abbreviated and info |
2026-03-20 |
| PR |
0.00 |
fix config type |
|
Terse title only, accompanied by informa |
2026-03-20 |
| PR |
0.00 |
DeepGEMM |
|
No free text authored—only template and |
2026-03-18 |
| PR |
0.00 |
Align lfm2 cache to other mamba caches |
|
Very terse, just references the title; d |
2026-03-19 |
| PR |
0.00 |
Add Mistral 4 |
|
Mostly template text, no filled-in conte |
2026-03-16 |
| PR |
0.00 |
Fix Mistral4 tests |
|
Just a link and terse title, clear human |
2026-03-18 |
| PR |
0.00 |
[docs] model cards |
|
Casual phrasing and specificity, typical |
2026-03-18 |
| PR |
0.00 |
Fix nemotron config docstrings |
|
Terse, references code issue; highly lik |
2026-03-20 |
| PR |
0.00 |
Fix nemotron_h modular |
|
No additional info beyond template; huma |
2026-03-20 |
| PR |
0.00 |
Fix CircleCI summary report not showing due to missing depen |
|
Informal, with typographic expressivenes |
2026-03-11 |
| PR |
0.00 |
[qwen3] fix generation tests |
|
No free-text content; not enough to judg |
2025-03-31 |
| PR |
0.00 |
[WIP] Add CharacterBERT model |
|
No free-text content; only template pres |
2023-10-05 |
| PR |
0.00 |
[Model] Add PP-Chart2Table Model Support |
|
No free-text content; only template pres |
2026-02-05 |
| PR |
0.00 |
Dequant fix |
|
No non-template content present for anal |
2026-03-18 |
| PR |
0.00 |
fix: XLNet: relative_positional_encoding computes on CPU eve |
|
Technical, concise, domain-specific bugf |
2026-03-17 |
| PR |
0.00 |
Fix annotations reader for python 3.14 in `PreTrainedModel` |
|
Brief, factual, and issue-targeted with |
2026-03-13 |
| PR |
0.00 |
fix: allow AutoImageProcessor to load from URL |
|
Clear technical focus and succinct expla |
2026-03-18 |
| PR |
0.00 |
[CB] [Minor] Simplify test suite |
|
Describes test suite simplification, inf |
2026-03-19 |
| PR |
0.00 |
fix: Add MXFP4 MoE/attention backward kernels |
|
Domain-specific summary, technical langu |
2026-02-05 |
| PR |
0.00 |
deepseek_v2, deepseek_v3, and modernbert fix for having inco |
|
Direct mention of versions and fixes; in |
2026-03-17 |
| PR |
0.00 |
fix: move comments before @torch.jit.script decorator for Py |
|
Technical Python version note, concise w |
2026-03-19 |
| PR |
0.00 |
Fix DEIM config export and public API |
|
Technical changelog; uses bulleted summa |
2026-03-19 |
| PR |
0.00 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Concise technical description, domain-sp |
2026-03-10 |