| COMMIT |
1.00 |
Add expert parallelism (EP) config support for Qwen3 MoE (# |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
[`Privacy Filter`] Add model (#45580) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add ForSequenceClassification heads for the OLMo family (#45 |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Commit message contains explicit AI assi |
2026-04-22 |
| COMMIT |
1.00 |
Add Neuron to auto-compile hardware list (#44757) |
|
Commit message contains explicit AI assi |
2026-04-16 |
| COMMIT |
1.00 |
Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex |
|
Commit message contains explicit AI assi |
2026-04-13 |
| COMMIT |
1.00 |
fix(serving): resolve rust tokenizer from ProcessorMixin in |
|
Commit message contains explicit AI assi |
2026-04-13 |
| PR |
1.00 |
fix(DSV3): parity between native `DeepseekV3MoE` and remote |
|
PR body explicitly mentions AI collabora |
2026-04-14 |
| PR |
1.00 |
Fix local_files_only tokenizer fallback when tokenizer files |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
fix transformers + torchao nvfp4 serialization |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
Add Xiaomi MiMo-V2 |
|
PR body explicitly mentions AI collabora |
2026-03-31 |
| PR |
1.00 |
Fix whisper long-form generation when eos_token_id is a list |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
Extract dynamic vision/audio tensors into standalone pure fu |
|
PR body explicitly mentions AI collabora |
2026-04-13 |
| PR |
1.00 |
Pass packed boundary metadata to Qwen3.5 linear-attention fa |
|
PR body explicitly mentions AI collabora |
2026-03-26 |
| PR |
1.00 |
Add automated reviewer assignment script |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
Update assign_reviewers.py |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
Remove warnings for modernbert |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Proper nemotron H and 3 and 2 |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
Add runner selection for mi325 GPU type |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
perf: avoid recomputing rotary_emb for each layer in some Go |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
refactor(Dots1): drop Dots1MoE override to `pass` (inherits |
|
PR body explicitly mentions AI collabora |
2026-04-22 |
| PR |
1.00 |
[WIP] Check docstring runs on modular files |
|
PR body explicitly mentions AI collabora |
2026-01-29 |
| PR |
1.00 |
feat: make timesfm2_5 onnx export compatible |
|
PR body explicitly mentions AI collabora |
2026-04-04 |
| PR |
1.00 |
add expert parallelism for gemma-4-26B-A4B-it |
|
PR body explicitly mentions AI collabora |
2026-04-07 |
| PR |
1.00 |
Update Gemma4 weight conversion script |
|
PR body explicitly mentions AI collabora |
2026-04-08 |
| PR |
1.00 |
fix: apply channel averaging correctly in audio feature extr |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Add EXAONE 4.5 implementations |
|
PR body explicitly mentions AI collabora |
2026-04-16 |
| PR |
1.00 |
T5Gemma2: fix `prepare_decoder_input_ids_from_labels` |
|
PR body explicitly mentions AI collabora |
2026-04-19 |
| PR |
1.00 |
feat: Add GGUF loading support for Llama 4 (text) |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
1.00 |
Fix cross-attention cache layer type for T5Gemma2 long input |
|
PR body explicitly mentions AI collabora |
2026-04-21 |
| PR |
0.70 |
🚨 Refactor ViT to updated standards |
|
Phrases like 'This PR aims at...' and fo |
2025-10-17 |
| PR |
0.20 |
Add expert parallelism (EP) config support for Qwen3 MoE |
|
Detailed changelog with technical jargon |
2026-04-14 |
| PR |
0.20 |
Update torchao usage for XPU and CPU |
|
Technical changelog with domain jargon; |
2026-04-22 |
| PR |
0.15 |
Add RF-DETR |
|
Domain-specific terminology, no AI phras |
2025-03-21 |
| PR |
0.15 |
Qwen3 ASR and Forced Aligner |
|
Domain-specific description; direct and |
2026-02-08 |
| PR |
0.15 |
Add IndexCache support for GLM5 DSA |
|
Technical, but more formal; still human |
2026-04-14 |
| PR |
0.15 |
fix: raise clear error when tokenizer config uses v5 list fo |
|
Structured fix explanation; minor formal |
2026-04-22 |
| PR |
0.15 |
[Sam3LiteText] Remove unnecessary modules/configs |
|
Mentions specific PRs and users, with hu |
2026-04-20 |
| COMMIT |
0.10 |
[Doc] Fix 'tokenized' -> 'tokenizer' typo in streamer docstr |
|
Polished but references real authors; cl |
2026-04-20 |
| COMMIT |
0.10 |
[Gemma4] Add docstrings for Per-Layer Embeddings (PLE) pipel |
|
Uses technical detail and casual, non-fo |
2026-04-14 |
| COMMIT |
0.10 |
fix: prevent accelerate from splitting vision encoder by set |
|
Human, domain-focused writing and inform |
2026-04-14 |
| COMMIT |
0.10 |
[Doc] MoE routing capture and replay recipe (#44925) |
|
Informal style with partial sentences an |
2026-04-14 |
| PR |
0.10 |
[WIP] Major processing refactor |
|
Concise technical description, clear dom |
2026-04-17 |
| PR |
0.10 |
🚨 [ALM] Add base model without head |
|
Technical, concise, informal; lacks AI p |
2026-04-20 |
| PR |
0.10 |
[docs] multi-turn tool calling |
|
Brief, specific changelog, no AI or gene |
2026-04-21 |
| PR |
0.10 |
Allow for registered experts from kernels hub |
|
Technical explanation with domain jargon |
2026-04-22 |
| PR |
0.10 |
generate: drop stale num_return_sequences warning on continu |
|
Technical description with domain specif |
2026-04-22 |
| PR |
0.10 |
Add ForSequenceClassification heads for the OLMo family |
|
Technical explanation, domain-specific, |
2026-04-21 |
| PR |
0.10 |
Remove attribute_map from GptOssConfig |
|
Technical with repo/context refs, inform |
2026-04-22 |
| PR |
0.10 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Informal, technical, and context-specifi |
2026-04-16 |
| PR |
0.10 |
Fix EP + DeepSpeed ZeRO-3 loading via accelerate launch |
|
Technical summary, domain references, hu |
2026-04-21 |
| PR |
0.10 |
docs(pipeline): fix num_workers docstring default from 8 to |
|
Technical, direct, with refs to implemen |
2026-04-22 |
| PR |
0.10 |
Drop noisy generate warnings when do_sample=False (or num_be |
|
Technical, domain-specific explanations; |
2026-04-22 |
| PR |
0.10 |
[nemotron_h] Add support for MLP mixers |
|
Describes technical addition, direct and |
2026-03-16 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical task breakdown; no AI-specific |
2026-04-18 |
| PR |
0.10 |
Add Videoprism |
|
Direct, terse domain reference; human re |
2025-08-04 |
| PR |
0.10 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Duplicate content; technical, domain-spe |
2026-04-18 |
| PR |
0.10 |
Add disable_mmap kwarg to from_pretrained with hf-mount auto |
|
Feature explanation uses domain terms an |
2026-04-21 |
| PR |
0.10 |
fix(testing_utils): guard get_device_capability with torch.c |
|
Domain-specific, technical context; lack |
2026-04-09 |
| PR |
0.10 |
[modular] Fix modular logic broken in #45045 |
|
Technical rollback with explicit context |
2026-04-21 |
| PR |
0.10 |
Add HyperCLOVAX SEED Think 14B |
|
Descriptive, contains domain details/nam |
2026-03-23 |
| PR |
0.10 |
Modular playground |
|
Uses informal tone and domain abbreviati |
2026-02-04 |
| PR |
0.10 |
cache_utils: fix QuantizedLayer to correctly propagate reord |
|
Technical content, domain detail, not ge |
2026-04-19 |
| PR |
0.10 |
feat: add Gemma4ForSequenceClassification |
|
Short, domain-specific summary without A |
2026-04-07 |
| PR |
0.10 |
NVFP4 quantization: streaming loader, fused MoE experts (Qwe |
|
Brief, technical, PR title-like content |
2026-04-20 |
| PR |
0.10 |
Add dtype config options for Four Over Six |
|
Brief, domain-specific, and mentions ano |
2026-04-11 |
| COMMIT |
0.05 |
perf: avoid recomputing rotary_emb for each layer in some Go |
|
Commit messages are terse, technical, an |
2026-04-22 |
| COMMIT |
0.05 |
Gemma4 training with text-only samples (#45454) |
|
Brief, informal, and technical commit me |
2026-04-22 |
| COMMIT |
0.05 |
[nemotron_h] Add support for MLP mixers (#44763) |
|
Short, informal commit messages with dom |
2026-04-22 |
| COMMIT |
0.05 |
add expert parallelism for gemma-4-26B-A4B-it (#45279) |
|
Technical commit messages and Signed-off |
2026-04-22 |
| COMMIT |
0.05 |
Add full GGUF loading support for GPT‑OSS (fixes #43366, sup |
|
Technical, detail-oriented commit messag |
2026-04-22 |
| COMMIT |
0.05 |
Update Gemma4 weight conversion script (#45328) |
|
Technical commit messages, informal lang |
2026-04-22 |
| COMMIT |
0.05 |
fix table update versions (#45544) |
|
Very brief, technical, and informal comm |
2026-04-22 |
| COMMIT |
0.05 |
Add disable_mmap kwarg to from_pretrained with hf-mount auto |
|
Technical, template-driven changes, huma |
2026-04-22 |
| COMMIT |
0.05 |
fix(DSV3): parity between native `DeepseekV3MoE` and remote |
|
Technical description, terse phrasing, d |
2026-04-22 |
| PR |
0.05 |
do not index past decoded chars with special tokens |
|
Brief, domain-specific phrasing and issu |
2026-04-14 |
| PR |
0.05 |
TP refactor for FSDP + TP integration |
|
Informal, technical TODOs; human-written |
2026-03-26 |
| PR |
0.05 |
[Model] Add SLANet Model Support |
|
Minimal free-text, domain-specific revie |
2026-04-20 |
| PR |
0.05 |
Gemma3n and Gemma4 cannot use rotary kernel |
|
Brief, technical, informal phrasing; hum |
2026-04-22 |
| PR |
0.05 |
fix: remove stale num_return_sequences warning in paged gene |
|
Domain-specific, concise, no AI-like phr |
2026-04-22 |
| PR |
0.05 |
fix(generation): remove stale warning for num_return_sequenc |
|
Technical and concise, no AI hallmarks |
2026-04-22 |
| PR |
0.05 |
feat(trainer): log individual losses from loss_dict |
|
Domain jargon and informal tone, human s |
2026-04-22 |
| PR |
0.05 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Informal, terse, uses domain language |
2026-04-15 |
| PR |
0.05 |
Add /v1/completions endpoint (OpenAI legacy completions API) |
|
Technical, terse style, fits human autho |
2026-03-10 |
| PR |
0.05 |
Updated the image cache for Paddle models according to the l |
|
Brief, domain-specific and casual, human |
2026-04-22 |
| PR |
0.05 |
Use correct mask for packed inputs in Qwen-VL |
|
Direct, technical, brief; no AI signals |
2026-02-19 |
| PR |
0.05 |
Gemma4 training with text-only samples |
|
Domain-specific, informal tone, human st |
2026-04-15 |
| PR |
0.05 |
fix: prevent IndexError in Whisper timestamp decode on trail |
|
Concise, technical language, human-autho |
2026-03-25 |
| PR |
0.05 |
Gemma4: fix failed test cases |
|
Terse, bug-focused; uses human-like brev |
2026-04-22 |
| PR |
0.05 |
Move some conversion mappings to PrefixChange |
|
Brief, informal, includes cc and verific |
2026-04-22 |
| PR |
0.05 |
Align gemma3n cache sharing to gemma4 |
|
Concise, domain-specific, references PRs |
2026-04-17 |
| PR |
0.05 |
sam3_lite_text: skip flash_attn_2_can_dispatch_composite_mod |
|
Very brief, uses informal request; human |
2026-04-16 |
| PR |
0.05 |
Fix conditional check for float formatting |
|
Brief, direct technical explanation; hum |
2026-03-04 |
| COMMIT |
0.00 |
do not index past decoded chars with special tokens (#45435) |
|
Informal tone, domain-specific phrasing, |
2026-04-22 |
| COMMIT |
0.00 |
Update dev version (#45583) |
|
Brief, informal; lacks AI stylistic cues |
2026-04-22 |
| COMMIT |
0.00 |
Update torchao usage for XPU and CPU (#45560) |
|
Very terse, technical, humanlike style. |
2026-04-22 |
| COMMIT |
0.00 |
[docs] per-request sampling params (#45553) |
|
Minimal, informal; clear domain-specific |
2026-04-22 |
| COMMIT |
0.00 |
Add IndexCache support for GLM5 DSA (#45424) |
|
Casual tone, domain jargon, human abbrev |
2026-04-22 |
| COMMIT |
0.00 |
Fix redundant logic in video processing SmolVLM (#45272) |
|
Brief, casual style, domain-specific wor |
2026-04-22 |
| COMMIT |
0.00 |
Fix typos (#45574) |
|
Domain jargon, typos, terse tone; human |
2026-04-22 |
| COMMIT |
0.00 |
Updated the image cache for Paddle models according to the l |
|
Terse style, domain language, minimal ex |
2026-04-22 |
| COMMIT |
0.00 |
[Model] Add SLANet Model Support (#45532) |
|
Brief changelog, domain signals, informa |
2026-04-22 |
| COMMIT |
0.00 |
refactor(Dots1): drop Dots1MoE override to `pass` (inherits |
|
Domain jargon, concise, no AI signals de |
2026-04-22 |
| COMMIT |
0.00 |
Move some conversion mappings to PrefixChange (#45567) |
|
Extremely terse commit; no AI signals, h |
2026-04-22 |
| COMMIT |
0.00 |
Align gemma3n cache sharing to gemma4 (#45489) |
|
Terse, informal commit messages; lacks A |
2026-04-22 |
| COMMIT |
0.00 |
[modular] Fix modular logic broken in #45045 (#45539) |
|
Very brief, informal with typos; clearly |
2026-04-22 |
| COMMIT |
0.00 |
Fix: propagate quantization_config to text sub-config for co |
|
Commit message uses terse technical lang |
2026-04-21 |
| COMMIT |
0.00 |
T5Gemma2: fix `prepare_decoder_input_ids_from_labels` (#4551 |
|
Technical and succinct commit summary, n |
2026-04-21 |
| COMMIT |
0.00 |
[Trainer] Add ddp_static_graph option (#45519) |
|
Detailed, technical explanation; no AI m |
2026-04-21 |
| COMMIT |
0.00 |
Add dtype config options for Four Over Six (#45367) |
|
Technical message with co-author trailer |
2026-04-21 |
| COMMIT |
0.00 |
[Sam3LiteText] Remove unnecessary modules/configs (#45535) |
|
Brief, domain-specific commit messages, |
2026-04-21 |
| COMMIT |
0.00 |
Fix conditional check for float formatting (#44425) |
|
Minimal, technical statement; no AI sign |
2026-04-21 |
| COMMIT |
0.00 |
Fix AMD CI: rebuild torchvision with libjpeg + refresh expec |
|
Terse, technical, with human-typical sty |
2026-04-21 |
| COMMIT |
0.00 |
Fix EP: RouterParallel shape, tp_plan property, grouped_mm s |
|
Highly technical, includes code/jargon; |
2026-04-21 |
| COMMIT |
0.00 |
feat[vLLM × v5]: Add vLLM compatibility for audio models (#4 |
|
Consistent with human commit conventions |
2026-04-21 |
| COMMIT |
0.00 |
Reapply modular to examples (#45527) |
|
Short, domain-specific phrase, likely hu |
2026-04-21 |
| COMMIT |
0.00 |
qa: re-run modular converter when the script itself is modif |
|
Concise, domain-specific commit; informa |
2026-04-20 |
| COMMIT |
0.00 |
[GGUF] Reduce peak RAM usage by casting dequantized tensors |
|
Standard PR/commit with domain details a |
2026-04-20 |
| COMMIT |
0.00 |
Fix CSM `TextToAudioPipeline` missing `` token (#45525) |
|
Terse, human shorthand, dev email signat |
2026-04-20 |
| COMMIT |
0.00 |
Raise 400 on model mismatch when `transformers serve` is pin |
|
Brief, domain-specific terms, minimal ex |
2026-04-20 |
| COMMIT |
0.00 |
[serve] Update tool call to switch to `parse_response` (#454 |
|
Direct, brief language with dev collabor |
2026-04-20 |
| COMMIT |
0.00 |
[`Conversion Mapping`] Small fixups (#45483) |
|
Single-word commit messages, clearly hum |
2026-04-20 |
| COMMIT |
0.00 |
:rotating_light: [`Kernels`] Fix kernel function registratio |
|
Informal, terse, and domain-focused chan |
2026-04-20 |
| COMMIT |
0.00 |
Revert "Fix: modular image processors (#45492)" (#45531) |
|
Standard revert message auto-generated b |
2026-04-20 |
| COMMIT |
0.00 |
fix: return empty tuple from import_protobuf_decode_error wh |
|
Detailed technical bug explanation, info |
2026-04-20 |
| COMMIT |
0.00 |
throw error when conversion required (#45078) |
|
Multiple small, informal commit messages |
2026-04-20 |
| COMMIT |
0.00 |
chore: bump doc-builder SHA for PR upload workflow (#45450) |
|
Standard commit message format, no AI si |
2026-04-20 |
| COMMIT |
0.00 |
xpu output align with cuda in test case (#45526) |
|
Domain-specific terms, signed by a human |
2026-04-20 |
| COMMIT |
0.00 |
chore(qa): split out mlinter (#45475) |
|
Terse, informative, and domain-specific |
2026-04-20 |
| COMMIT |
0.00 |
Fix: modular image processors (#45492) |
|
Single word message, clearly human brevi |
2026-04-20 |
| COMMIT |
0.00 |
[loading] Clean way to add/remove full parts in checkpoint n |
|
Terse, includes typos and informal comme |
2026-04-20 |
| COMMIT |
0.00 |
Fix Zamba2MambaMixer ignoring use_mamba_kernels=False (#4485 |
|
Terse commits, edits, and references; no |
2026-04-20 |
| COMMIT |
0.00 |
revert sha commit pointing to main for transformers_amd_ci_ |
|
Brief, technical, terse revert commit ty |
2026-04-17 |
| COMMIT |
0.00 |
Fix ZeRO-3 from_pretrained: load registered buffers in _load |
|
Technical language, specific errors, dir |
2026-04-17 |
| COMMIT |
0.00 |
Remove redundant condition checks in `get_image_size` method |
|
Terse bullet points, technical, informal |
2026-04-17 |
| COMMIT |
0.00 |
add Qianfan-OCR model definition (#45280) |
|
Informal, iterative fix messages, domain |
2026-04-17 |
| COMMIT |
0.00 |
Add check-auto in repo-consistency and fix sorting (#45481) |
|
Colloquial wording, speculative ('maybe? |
2026-04-17 |
| COMMIT |
0.00 |
Fix typos in src/transformers/utils/output_capturing.py (#45 |
|
Concise typo fix, no free-text, purely a |
2026-04-17 |
| COMMIT |
0.00 |
typing: rule 15 - checks for tie_word_embeddings presence (# |
|
Brief update notes, technical context, c |
2026-04-17 |
| COMMIT |
0.00 |
[CB] Fix capture of max_seqlen (#45323) |
|
Informal commit titles, multiple granula |
2026-04-17 |
| COMMIT |
0.00 |
Fix response api support (#45463) |
|
Commit messages are terse, informal, sho |
2026-04-16 |
| COMMIT |
0.00 |
Minor update (#45484) |
|
Minimal human-written commit; contains C |
2026-04-16 |
| COMMIT |
0.00 |
Allow loading Qwen Thinker 'base' models without generative |
|
Technical explanation, domain language, |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Always early return for non-Mistral models in _patch |
|
Human-style summary and messages, includ |
2026-04-16 |
| COMMIT |
0.00 |
Fix spurious position_ids warnings for at least 40 architect |
|
Structured, technical commit explanation |
2026-04-16 |
| COMMIT |
0.00 |
[`fix`] Make Qwen2_5OmniProcessor warning a lot less noisy v |
|
Multiple edits and reverts; informal sty |
2026-04-16 |
| COMMIT |
0.00 |
Dynamic auto mapping (#45018) |
|
Terse, messy progression; highly informa |
2026-04-16 |
| COMMIT |
0.00 |
[serve] Forward `tool_calls`/`tool_call_id` in processor inp |
|
Commit message is concise, technical, an |
2026-04-15 |
| COMMIT |
0.00 |
[docs] vlm addition (#45271) |
|
Short, informal commit messages with min |
2026-04-15 |
| COMMIT |
0.00 |
fix: dont download artifacts from the test hub (#45319) |
|
Commit log has informal, domain-specific |
2026-04-15 |
| COMMIT |
0.00 |
refactor(qa): extend extras so ty can run on server modules |
|
Brief, domain-specific commit message wi |
2026-04-15 |
| COMMIT |
0.00 |
fix(clipseg): fix 2 failing tests (#45403) |
|
Technical, informal bullet points, signe |
2026-04-15 |
| COMMIT |
0.00 |
[docs] @auto_docstring decorator (#45130) |
|
Concise, informal commit messages typica |
2026-04-15 |
| COMMIT |
0.00 |
Fix Sam3Processor missing input_boxes_labels for padded None |
|
Detailed technical explanation using dom |
2026-04-15 |
| COMMIT |
0.00 |
Multimodal serve support (#45220) |
|
Informal, domain-based commit history, h |
2026-04-15 |
| COMMIT |
0.00 |
better grad acc tests (#45434) |
|
Terse commit message with domain abbrevi |
2026-04-15 |
| COMMIT |
0.00 |
avoid wrap 4bit-quantized model into DP (#45407) |
|
Signed by human; no AI indicators in con |
2026-04-15 |
| COMMIT |
0.00 |
Add example for iterative chatting with MLLMs (#45398) |
|
Commit message is terse and has co-autho |
2026-04-15 |
| COMMIT |
0.00 |
Gemma4 resizing per layer inputs (#45324) |
|
Commit message is short, uses domain jar |
2026-04-15 |
| COMMIT |
0.00 |
Add `step3_vl` to `MODELS_WITH_INCORRECT_HUB_TOKENIZER_CLASS |
|
Commit is a standard changelog with clea |
2026-04-15 |
| COMMIT |
0.00 |
Update workflow references to new commit hash (#45442) |
|
Standard commit message; minimal human w |
2026-04-14 |
| COMMIT |
0.00 |
[Doc] Correct checkpoint path in Dinov2 model_docs (#45430) |
|
Commit is concise and fixes a specific t |
2026-04-14 |
| COMMIT |
0.00 |
Fix ty for transformers cli (#45190) |
|
Casual, terse commit messages typical of |
2026-04-14 |
| COMMIT |
0.00 |
fix(models): Resolve regressions in Wav2Vec2PhonemeCTCTokeni |
|
Natural, technical commit flow; includes |
2026-04-14 |
| COMMIT |
0.00 |
Fix Qwen2.5VL temporal grid positions (#45400) |
|
Commit messages are brief, casual, and h |
2026-04-14 |
| COMMIT |
0.00 |
[`fix`] PEFT integration fixes preventing save/load & integr |
|
Technical, concise, with human co-author |
2026-04-14 |
| COMMIT |
0.00 |
Fix the response schema for the gemma4 converter (#45411) |
|
Short, direct PR summary; no AI-style ph |
2026-04-14 |
| COMMIT |
0.00 |
Fix `apply_chat_template` crash on `tool_call` messages with |
|
Terse commit messages with domain terms; |
2026-04-13 |
| COMMIT |
0.00 |
Add SAM3-LiteText (#44320) |
|
Very terse, incremental commit messages; |
2026-04-13 |
| COMMIT |
0.00 |
Fix IndexError with DeepSpeed ZeRO-3 when kernels rotary is |
|
Detailed explanation with domain context |
2026-04-13 |
| COMMIT |
0.00 |
[AMD CI] Fix torch.compile/export failures on AMD CI due to |
|
Brief, informal commit messages; typical |
2026-04-13 |
| COMMIT |
0.00 |
[inference_fusion] convert conv3d patch embed to linear (#45 |
|
Informal commit messages with technical |
2026-04-13 |
| COMMIT |
0.00 |
Fix #45305 + add regression test GAS (#45349) |
|
Terse, informal tone with inline technic |
2026-04-13 |
| PR |
0.00 |
[docs] cb memory management |
|
Short, informal follow-up; human phrasin |
2026-04-22 |
| PR |
0.00 |
Fix void segmentation map label reduction |
|
Direct changelog, uses domain terms; lac |
2026-04-14 |
| PR |
0.00 |
🔴🔴🔴 fix: skip `clean_up_tokenization` for BPE tokenizers in |
|
Contains direct technical details and co |
2026-03-21 |
| PR |
0.00 |
Fix: Import torch.nn for type hints even when PyTorch is dis |
|
Specific technical error fix; informal t |
2026-02-06 |
| PR |
0.00 |
Add Audio-Visual Flamingo model |
|
Contains project-specific context; lacks |
2026-04-22 |
| PR |
0.00 |
Add Mistral tokenizer missing methods |
|
Domain-specific explanation; no AI-style |
2025-10-25 |
| PR |
0.00 |
audio tester class |
|
Technical, informal explanation with dom |
2026-04-13 |
| PR |
0.00 |
qa: bumped mlinter and allow local override |
|
Terse, changelog-style content; uses dom |
2026-04-22 |
| PR |
0.00 |
Update dev version |
|
Extremely terse, informal message; certa |
2026-04-22 |
| PR |
0.00 |
[docs] model testing |
|
Terse, domain-focused, informal structur |
2026-03-31 |
| PR |
0.00 |
[docs] per-request sampling params |
|
Short, minimal, specific to human entry |
2026-04-21 |
| PR |
0.00 |
[`Privacy Filter`] Add model |
|
Very terse, informal ('As per title'), c |
2026-04-22 |
| PR |
0.00 |
Fix redundant logic in video processing SmolVLM |
|
Very terse and personalized with CC; hum |
2026-04-06 |
| PR |
0.00 |
model: Add DEIMv2 to Transformers |
|
Enthusiastic, informal, direct links; hu |
2026-02-27 |
| PR |
0.00 |
Fix typos |
|
Super terse, minimal content; clearly hu |
2026-04-22 |
| PR |
0.00 |
Add Deepseek-OCR-2 model |
|
Direct, brief, domain references; no AI |
2026-03-27 |
| PR |
0.00 |
Add Molmo2 |
|
Cut off before any free-text; cannot ass |
2026-01-23 |
| PR |
0.00 |
fix table update versions |
|
Informal tone, specific technical refere |
2026-04-21 |
| PR |
0.00 |
Add image processors refactor to v5 migration guide |
|
Concise, human casual reference, interna |
2026-04-21 |
| PR |
0.00 |
[Tests] [Bugfix] Make weights tied for `dynamic_tied_weights |
|
Domain-specific, terse, human-context ex |
2025-07-28 |
| PR |
0.00 |
[Bugfix] Fix `AutoModel.from_pretrained(..., quantization_co |
|
Detailed, technical, with error tracebac |
2025-07-29 |
| PR |
0.00 |
[docs] modular transformers |
|
Terse, structured as changelog, technica |
2026-04-08 |
| PR |
0.00 |
[docs] contributing |
|
Bulleted, brief, informal doc update not |
2026-04-15 |
| PR |
0.00 |
[docs] distributed training |
|
Bulleted, direct, pragmatic style, no AI |
2026-03-03 |
| PR |
0.00 |
[CB] Changes for long generation |
|
Extremely minimal, no signals of AI-gene |
2026-04-20 |
| PR |
0.00 |
Fix: propagate quantization_config to text sub-config for co |
|
Detailed, technical, with typos/awkward |
2026-04-17 |
| PR |
0.00 |
[Trainer] Add ddp_static_graph option |
|
Technical summary, concise, not overly f |
2026-04-20 |