GitHub AI Radar

Type	AI Score	Description	Actor	Reason	Date
COMMIT	1.00	Update AFMoE architecture to use v5-style MoE impl (#44063)	AutumnAurelium	Commit message contains explicit AI assi	2026-03-19
COMMIT	1.00	Sdpa for owlvit (#42136)	Aravind-11	Commit message contains explicit AI assi	2026-03-17
COMMIT	1.00	:rotating_light: Validate config attributes (#41250)	zucchini-nlp	Commit message contains explicit AI assi	2026-03-16
COMMIT	1.00	Fix off-by-one in decode_spans boundary check (#44584)	mvanhorn	Commit message contains explicit AI assi	2026-03-12
PR	1.00	Fix #44155: [AudioFlamingo3] Batched inference produces inco	danielalanbates	PR body explicitly mentions AI collabora	2026-02-21
COMMIT	0.00	Fix glm dsa (#44564)	ArthurZucker	—	2026-03-19
COMMIT	0.00	🚨🚨 Refactor Image Processors to support different backends (	yonigozlan	—	2026-03-19
COMMIT	0.00	[generate] Never use `cache_position` anymore in generation	Cyrilvallez	—	2026-03-19
COMMIT	0.00	Fix KeyError in convert_to_native_format for dict vocab (#44	weiguangli-io	—	2026-03-19
COMMIT	0.00	fix: XLNet: relative_positional_encoding computes on CPU eve	JiwaniZakir	—	2026-03-19
COMMIT	0.00	Fix annotations reader for python 3.14 in `PreTrainedModel`	neo	—	2026-03-19
COMMIT	0.00	[CB] Better parametrization for compile (#44578)	remi-or	—	2026-03-19
COMMIT	0.00	Fix `KeyError` when patching mistral regex (#43376)	LeonardoEmili	—	2026-03-19
COMMIT	0.00	Correct code block formatting in weightconverter.md (#44839)	zhulinchng	—	2026-03-19
COMMIT	0.00	deepseek_v2, deepseek_v3, and modernbert fix for having inco	itazap	—	2026-03-18
COMMIT	0.00	[Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod	zhang-prog	—	2026-03-18
COMMIT	0.00	Add `Jina-Embeddings-V3` Model (#44251)	Sai-Suraj-27	—	2026-03-18
COMMIT	0.00	feat(ci): added a network debug report (#44636)	tarekziade	—	2026-03-18
COMMIT	0.00	Add GreedyLR adaptive learning rate scheduler (#44271)	balak4	—	2026-03-18
COMMIT	0.00	Fix unexpected `position_ids` keys when loading OwlViT model	KartikPawade	—	2026-03-18
COMMIT	0.00	Update more modular examples (#44834)	Cyrilvallez	—	2026-03-18
COMMIT	0.00	Fix and re-run modular converter on examples (#44833)	Cyrilvallez	—	2026-03-18
COMMIT	0.00	Remove cache_position in more models (4 and last one) (#4482	Cyrilvallez	—	2026-03-18
COMMIT	0.00	Fix loading issue in Sam3 (#44831)	zucchini-nlp	—	2026-03-18
COMMIT	0.00	feat(integration): Add KubeflowCallback to enable automatic	abhijeet-dhumal	—	2026-03-18
COMMIT	0.00	Add GGUF support for MiniMax-M2.1 model (#44526)	JoursBleu	—	2026-03-18
COMMIT	0.00	Centralize AI agent templates in `.ai` (#44489)	tarekziade	—	2026-03-18
COMMIT	0.00	support xxxFast alias in v5 tokenizers (#44766)	itazap	—	2026-03-18
COMMIT	0.00	Remove cache_position in more models (3) (#44759)	Cyrilvallez	—	2026-03-18
COMMIT	0.00	Fix `supports_{tp/pp}_plan` (#44696)	hmellor	—	2026-03-18
COMMIT	0.00	[CI] Temporarily skip Mistral4 tests as they almost all fail	Cyrilvallez	—	2026-03-18
COMMIT	0.00	update flex attention to use `return_aux` instead of `return	ntenenz	—	2026-03-18
COMMIT	0.00	[Gemma] Update conversion scripts for Transformers v5 Comapt	RyanMullins	—	2026-03-18
COMMIT	0.00	fix bug embedding_size mismatch with hidden_size in electra	kaixuanliu	—	2026-03-18
COMMIT	0.00	Fix pegasus conversion (#44571)	ArthurZucker	—	2026-03-18
COMMIT	0.00	Fix repo-check bot (#44812)	ydshieh	—	2026-03-18
COMMIT	0.00	[docs] is_causal feature (#44777)	stevhliu	—	2026-03-17
COMMIT	0.00	docs(tasks): remove references to removed question-answering	BillionClaw	—	2026-03-17
COMMIT	0.00	Fix configs with `@strict` (#44770)	zucchini-nlp	—	2026-03-17
COMMIT	0.00	[AMD CI] Fix test failures across important models (#44632)	Abdennacer-Badaoui	—	2026-03-17
COMMIT	0.00	Move VLM conversions to the main mapping (#44627)	zucchini-nlp	—	2026-03-17
COMMIT	0.00	Fix config loading issues (type issues) (#44789)	ydshieh	—	2026-03-17
COMMIT	0.00	Remove `is_causal` from `EuroBertConfig` (#44774)	ydshieh	—	2026-03-17
COMMIT	0.00	model-linter: Added rule 10 (#44761)	tarekziade	—	2026-03-17
COMMIT	0.00	[fix] mistral 4 docs (#44776)	stevhliu	—	2026-03-16
COMMIT	0.00	Add Mistral 4 (#44760)	juliendenize	—	2026-03-16
COMMIT	0.00	Fix: Eurobert model was missing @strict decorator and invali	tarekziade	—	2026-03-16
COMMIT	0.00	fix: sig lip import (#44764)	tarekziade	—	2026-03-16
COMMIT	0.00	Disable async loading when quantizing on the fly (#44576)	SunMarc	—	2026-03-16
COMMIT	0.00	Bump torchao >=0.15 and fix quantization CI (#44604)	SunMarc	—	2026-03-16
COMMIT	0.00	Fix tensor indexing crash in serve generate_response KV cach	mango766	—	2026-03-16
COMMIT	0.00	[MistralCommonBackend] Upgrade mistral-common to v1.10.0 (#4	juliendenize	—	2026-03-16
COMMIT	0.00	Fix `mlcd` auto config/model/mapping issues (#44730)	ydshieh	—	2026-03-16
COMMIT	0.00	Fix bug and add XPU Expectations for qwen2 and jamba tests (	kaixuanliu	—	2026-03-16
COMMIT	0.00	Add model lerobot PI0 to transformers (#44160)	molbap	—	2026-03-16
COMMIT	0.00	[medasr] doc update (#44633)	eustlb	—	2026-03-16
COMMIT	0.00	Idefics3 without cache fix (#44607)	gabe-l-hart	—	2026-03-16
COMMIT	0.00	Add XPU Expectations for vibe voice acoustic tokenizer tests	kaixuanliu	—	2026-03-16
COMMIT	0.00	Fix transformers serve's 422 unprocessable entity (#44620)	LysandreJik	—	2026-03-16
COMMIT	0.00	Fix missing / incorrect `config` class in some model class d	ydshieh	—	2026-03-15
COMMIT	0.00	Update Nvidia CI docker file to use torch 2.10 (#44712)	ydshieh	—	2026-03-14
COMMIT	0.00	[`FA`] Fix fa detection (#44703)	vasqu	—	2026-03-14
COMMIT	0.00	Fix `set_encoder` (#44698)	hmellor	—	2026-03-14
COMMIT	0.00	[docs] cb config (#44675)	stevhliu	—	2026-03-13
COMMIT	0.00	Fix more model tester missing `parent` issue (#44685)	ydshieh	—	2026-03-13
COMMIT	0.00	:rotating_light: [`FA4`] Initial support (#42435)	vasqu	—	2026-03-13
COMMIT	0.00	Add register method for `ParallelInterface` (#44640)	michaelbenayoun	—	2026-03-13
COMMIT	0.00	[CB] [Bug] Fix crashes when running without cuda (#44673)	remi-or	—	2026-03-13
COMMIT	0.00	Another (small) set of fixes required for tiny model creatio	ydshieh	—	2026-03-13
COMMIT	0.00	Fix CookieCutter (#44334)	NielsRogge	—	2026-03-13
COMMIT	0.00	Fix AWQ tests for GPTQModel migration (#44654)	jiqing-feng	—	2026-03-13
COMMIT	0.00	[Model] Add PP-OCRV5_mobile_det Model Support (#43247)	XingweiDeng	—	2026-03-13
COMMIT	0.00	pipelines do not have modelcard (#44621)	KoichiYasuoka	—	2026-03-13
COMMIT	0.00	[`Chmv2`] Fix conversion after capture refactor (#44665)	vasqu	—	2026-03-13
COMMIT	0.00	fix(models, testing): Fix Llama4 vision rotary meta tensor i	harshaljanjani	—	2026-03-13
COMMIT	0.00	[CB] Add dedicated config (#44434)	remi-or	—	2026-03-13
COMMIT	0.00	fix(models): Forward timm model kwargs to timm.create_model	harshaljanjani	—	2026-03-13
COMMIT	0.00	Ensure same `dtype` for subconfig when `_from_config` (#4462	zucchini-nlp	—	2026-03-13
COMMIT	0.00	Remove `cache_position` in more models (2) (#44602)	Cyrilvallez	—	2026-03-12
COMMIT	0.00	fix: cast to proper dtype in EmbeddingParallel (#44612)	michaelbenayoun	—	2026-03-12
COMMIT	0.00	Allow to disable stdout hiding for TP (#44608)	michaelbenayoun	—	2026-03-12
COMMIT	0.00	Remove many output_attentions and other traced outputs on 10	molbap	—	2026-03-12
COMMIT	0.00	[Model] Add PP-OCRV5_server_det Model Support (#43274)	XingweiDeng	—	2026-03-12
COMMIT	0.00	fix: raise error if mm_token_type_ids not supplied (#44433)	leopold-tzafon	—	2026-03-12
COMMIT	0.00	Fix output capturing for Backbones (#44638)	Cyrilvallez	—	2026-03-12
COMMIT	0.00	Fix lfm2 kernel path (#44634)	Cyrilvallez	—	2026-03-12
COMMIT	0.00	Fix for `VibeVoiceAcousticTokenizer` (#44628)	ydshieh	—	2026-03-12
COMMIT	0.00	Add an integration test for LASR using pipe and chunked deco	kho	—	2026-03-12
COMMIT	0.00	Fix more wrong HF hub checkpoint names (#44624)	ydshieh	—	2026-03-12
COMMIT	0.00	Update agentic contributions guidelines in AGENTS.md to forc	burtenshaw	—	2026-03-12
COMMIT	0.00	Expand model-structure lint rules with a fast AST-based, ruf	tarekziade	—	2026-03-12
COMMIT	0.00	feat: add neuron in tensor parallelism initialization (#4449	michaelbenayoun	—	2026-03-11
COMMIT	0.00	[WIP] FIX Make Mixtral LoRA loading work (#44478)	BenjaminBossan	—	2026-03-11
COMMIT	0.00	Fix Llava tests for torch too! (#44476)	Rocketknight1	—	2026-03-11
COMMIT	0.00	Fix training ci and clean some tests (#44491)	SunMarc	—	2026-03-11
COMMIT	0.00	Add CHMv2 (#44595)	yonigozlan	—	2026-03-11
COMMIT	0.00	Remove useless identity assignment (#44600)	Cyrilvallez	—	2026-03-11
COMMIT	0.00	Add Yoni to run-slow workflow (#44598)	vasqu	—	2026-03-11
COMMIT	0.00	Add shared VLM tests (#42964)	Rocketknight1	—	2026-03-11
COMMIT	0.00	Fix wrong (non-existing) checkpoints (#44549)	ydshieh	—	2026-03-11
COMMIT	0.00	Remove `cache_position` in more models (#44330)	Cyrilvallez	—	2026-03-11
PR	0.00	Switch FP8 per tensor quant to use `torch._scaled_mm`	SunMarc	—	2026-03-19
PR	0.00	DeepGEMM	IlyasMoutawwakil	—	2026-03-18
PR	0.00	Update some type hints	zucchini-nlp	—	2026-03-19
PR	0.00	Proposal to add Qwen3-ASR support [WIP]	mbtariq82	—	2026-02-08
PR	0.00	[Model] Add PP-Chart2Table Model Support	XingweiDeng	—	2026-02-05
PR	0.00	Dequant fix	ArthurZucker	—	2026-03-18
PR	0.00	[Model] Add SLANeXt Model Support	liu-jiaxuan	—	2026-02-03
PR	0.00	🚨 Refactor ViT to updated standards	yonigozlan	—	2025-10-17
PR	0.00	Add THD support in ESM	balvisio	—	2026-02-19
PR	0.00	[Model] Add UVDoc Model Support	XingweiDeng	—	2026-01-21
PR	0.00	feat: added cache to the model linter	tarekziade	—	2026-03-17
PR	0.00	Propagate the model loading from transformers serve to chat	LysandreJik	—	2026-03-16
PR	0.00	chore(typing): extend typing to `src/transformers/cli`	tarekziade	—	2026-03-10
PR	0.00	Fix core dumped when `NemotronH` is torch compiled	ydshieh	—	2026-03-19
PR	0.00	Officially launch parse_response	Rocketknight1	—	2026-03-13
PR	0.00	[CB] Add an option to return logprobs	remi-or	—	2026-03-18
PR	0.00	fix: handle list-type _tied_weights_keys in _get_tied_weight	gh-wf	—	2026-03-19
PR	0.00	Fix glm dsa	ArthurZucker	—	2026-03-10
PR	0.00	[PoC] HF exporters	IlyasMoutawwakil	—	2025-11-03
PR	0.00	[Mistral] Fix query scaling for Mistral4 and Ministral3	Cyrilvallez	—	2026-03-19
PR	0.00	Fix several based models' pipeline parallel support	hmellor	—	2026-03-14
PR	0.00	Support Modular (!!) + Configs in `check_auto_docstrings`	yonigozlan	—	2026-03-17
PR	0.00	Fix failing `Qwen3OmniModelIntegrationTests`	Sai-Suraj-27	—	2026-03-19
PR	0.00	🚨🚨 Refactor Image Processors to support different backends	yonigozlan	—	2026-01-27
PR	0.00	Dynamic weight conversion is recursive	zucchini-nlp	—	2026-02-26
PR	0.00	FSDP2 native support in transformers	3outeille	—	2026-02-17
PR	0.00	[generate] Never use `cache_position` anymore in generation	Cyrilvallez	—	2026-03-18
PR	0.00	add HyperClovaX Vision	jp1924	—	2026-02-27
PR	0.00	perceptron: Isaac-0.1 implementation	AkshatSh	—	2025-09-18
PR	0.00	refactor: rope in model, flatten vision, rely on qwen3 backo	philippguevorguian	—	2026-03-19
PR	0.00	enable tp for benchmark	sywangyi	—	2026-02-05
PR	0.00	Update AFMoE architecture to use v5-style MoE impl	AutumnAurelium	—	2026-02-17
PR	0.00	Fix KeyError in convert_to_native_format for dict vocab	weiguangli-io	—	2026-03-05
PR	0.00	Use `index_select` instead of advanced indexing in `batched_	dacorvo	—	2026-03-13
PR	0.00	fix: XLNet: relative_positional_encoding computes on CPU eve	JiwaniZakir	—	2026-03-17
PR	0.00	Fix annotations reader for python 3.14 in `PreTrainedModel`	neo	—	2026-03-13
PR	0.00	fix: allow AutoImageProcessor to load from URL	BillionClaw	—	2026-03-18
PR	0.00	Add Music Flamingo	lashahub	—	2026-01-27
PR	0.00	[CB] [Minor] Simplify test suite	remi-or	—	2026-03-19
PR	0.00	fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures	harshaljanjani	—	2026-03-16
PR	0.00	fix: Add MXFP4 MoE/attention backward kernels	leoneperdigao	—	2026-02-05
PR	0.00	fix: handle unpicklable tokenizers in ProcessorMixin.to_dict	themavik	—	2026-03-19
PR	0.00	deepseek_v2, deepseek_v3, and modernbert fix for having inco	itazap	—	2026-03-17
PR	0.00	fix: move comments before @torch.jit.script decorator for Py	hkc5	—	2026-03-19
PR	0.00	Fix DEIM config export and public API	sahilleth	—	2026-03-19
PR	0.00	Add /v1/completions endpoint (OpenAI legacy completions API)	rain-1	—	2026-03-10
PR	0.00	[Misc] add enable_thinking to template kwargs	JJJYmmm	—	2026-03-18
PR	0.00	model: Add DEIMv2 to Transformers	harshaljanjani	—	2026-02-27
PR	0.00	Add xcodec2 model	ebezzam	—	2026-02-20
PR	0.00	[`Mllama`] Fix workaround compile	vasqu	—	2026-03-19
PR	0.00	Fix Zamba2MambaMixer ignoring use_mamba_kernels=False	sergiopaniego	—	2026-03-19
PR	0.00	Fix AutoImageProcessor URL loading regression	omyaaa1	—	2026-03-19
PR	0.00	Goodbye cache position	zucchini-nlp	—	2026-03-13
PR	0.00	[CB] Better parametrization for compile	remi-or	—	2026-03-10
PR	0.00	Allow kernel modules to declare their preferred mask functio	dacorvo	—	2026-03-13
PR	0.00	[Model] Add PP-OCRV5_mobile_rec Model Support	liu-jiaxuan	—	2026-02-06
PR	0.00	Fix AutoImageProcessor.from_pretrained failing with URL inpu	xr843	—	2026-03-18
PR	0.00	Fix whisper return language	FredHaa	—	2025-11-16
PR	0.00	fix(flaky): use a fixture for `set_seed` and single-threadin	tarekziade	—	2026-02-07
PR	0.00	Add `Jina-Embeddings-V3` Model	Sai-Suraj-27	—	2026-02-24
PR	0.00	[docs] training on specific hardware	stevhliu	—	2026-03-17
PR	0.00	Fix `AutoImageProcessor` to correctly detect local implement	kaixuanliu	—	2026-03-13
PR	0.00	Use doc-builder runnable example for GLM-ASR	tarekziade	—	2026-02-25
PR	0.00	Fix Mllama torch.compile failure caused by new attention mas	jiqing-feng	—	2026-03-19
PR	0.00	Fix `KeyError` when patching mistral regex	LeonardoEmili	—	2026-01-20
PR	0.00	ci: add anti-slop action	tarekziade	—	2026-03-19
PR	0.00	Correct code block formatting in weightconverter.md	zhulinchng	—	2026-03-19
PR	0.00	[Docs] Update DeiT model card to new format	RicardoLee510520	—	2026-03-19
PR	0.00	Fix llama4 bnb mode	jiqing-feng	—	2026-03-11
PR	0.00	Add cu_seqlens support to OlmoHybridGatedDeltaNet for packed	tyler-romero	—	2026-03-18
PR	0.00	Internalise the NomicBERT model	ed22699	—	2025-12-29
PR	0.00	[docs] optimizers, hyperparam search, training features	stevhliu	—	2026-02-26
PR	0.00	[docs] model cards	stevhliu	—	2026-03-18
PR	0.00	Fix Mistral4 tests	3outeille	—	2026-03-18
PR	0.00	[Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod	zhang-prog	—	2026-03-18
PR	0.00	small cleaning of quantization class	SunMarc	—	2025-12-04
PR	0.00	feat(ci): added a network debug report	tarekziade	—	2026-03-12
PR	0.00	Add GreedyLR adaptive learning rate scheduler	balak4	—	2026-02-25
PR	0.00	Fix unexpected `position_ids` keys when loading OwlViT model	KartikPawade	—	2026-03-06
PR	0.00	Add Mistral 4	juliendenize	—	2026-03-16
PR	0.00	Add `base_model_tp_plan` to `OlmoeConfig`	dacorvo	—	2026-03-13
PR	0.00	Update more modular examples	Cyrilvallez	—	2026-03-18
PR	0.00	fix(gpt2): Resolve NaN/Inf issue in lm_head on Python 3.13 w	JokeYoonic	—	2026-03-13
PR	0.00	Fix and re-run modular converter on examples	Cyrilvallez	—	2026-03-18
PR	0.00	[Model] Add PP-OCRv5_server_rec Model Support	liu-jiaxuan	—	2026-02-06
PR	0.00	fix: add Float8 dtype fallback in modeling_utils.py	s-zx	—	2026-03-11
PR	0.00	Remove cache_position in more models (4 and last one)	Cyrilvallez	—	2026-03-18
PR	0.00	docs(pipelines): remove outdated question-answering example	BillionClaw	—	2026-03-17
PR	0.00	Fix loading issue in Sam3	zucchini-nlp	—	2026-03-18
PR	0.00	docs(quicktour): remove question-answering pipeline from qui	BillionClaw	—	2026-03-18
PR	0.00	fix: handle dict vocab in CamembertTokenizer for tokenizer.j	aayushbaluni	—	2026-03-17
PR	0.00	Add MPS (Apple Silicon) example and documentation	divyanks	—	2026-03-17
PR	0.00	fix: Cache XLNet relative_positional_encoding to avoid CPU c	BillionClaw	—	2026-03-16
PR	0.00	fix: resolve false-positive regex warning for non-mistral mo	yunhaoli24	—	2026-03-16
PR	0.00	Fix: propagate interpolate_pos_encoding through PixioEmbeddi	aashirpersonal	—	2026-03-15
PR	0.00	feat(integration): Add KubeflowCallback to enable automatic	abhijeet-dhumal	—	2026-03-06
PR	0.00	Add AudioFlamingoNext model	lashahub	—	2026-03-18
PR	0.00	fix series of failed test case for janus model	kaixuanliu	—	2026-03-16
PR	0.00	Add GGUF support for MiniMax-M2.1 model	JoursBleu	—	2026-03-08

COMMIT

1.00

Update AFMoE architecture to use v5-style MoE impl (#44063)

AutumnAurelium

Commit message contains explicit AI assi

2026-03-19

COMMIT

1.00

Sdpa for owlvit (#42136)

Aravind-11

Commit message contains explicit AI assi

2026-03-17

COMMIT

1.00

:rotating_light: Validate config attributes (#41250)

zucchini-nlp

Commit message contains explicit AI assi

2026-03-16

COMMIT

1.00

Fix off-by-one in decode_spans boundary check (#44584)

mvanhorn

Commit message contains explicit AI assi

2026-03-12

PR

1.00

Fix #44155: [AudioFlamingo3] Batched inference produces inco

danielalanbates

PR body explicitly mentions AI collabora

2026-02-21

COMMIT

0.00

Fix glm dsa (#44564)

ArthurZucker

—

2026-03-19

COMMIT

0.00

🚨🚨 Refactor Image Processors to support different backends (

yonigozlan

—

2026-03-19

COMMIT

0.00

[generate] Never use `cache_position` anymore in generation

Cyrilvallez

—

2026-03-19

COMMIT

0.00

Fix KeyError in convert_to_native_format for dict vocab (#44

weiguangli-io

—

2026-03-19

COMMIT

0.00

fix: XLNet: relative_positional_encoding computes on CPU eve

JiwaniZakir

—

2026-03-19

COMMIT

0.00

Fix annotations reader for python 3.14 in `PreTrainedModel`

neo

—

2026-03-19

COMMIT

0.00

[CB] Better parametrization for compile (#44578)

remi-or

—

2026-03-19

COMMIT

0.00

Fix `KeyError` when patching mistral regex (#43376)

LeonardoEmili

—

2026-03-19

COMMIT

0.00

Correct code block formatting in weightconverter.md (#44839)

zhulinchng

—

2026-03-19

COMMIT

0.00

deepseek_v2, deepseek_v3, and modernbert fix for having inco

itazap

—

2026-03-18

COMMIT

0.00

[Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod

zhang-prog

—

2026-03-18

COMMIT

0.00

Add `Jina-Embeddings-V3` Model (#44251)

Sai-Suraj-27

—

2026-03-18

COMMIT

0.00

feat(ci): added a network debug report (#44636)

tarekziade

—

2026-03-18

COMMIT

0.00

Add GreedyLR adaptive learning rate scheduler (#44271)

balak4

—

2026-03-18

COMMIT

0.00

Fix unexpected `position_ids` keys when loading OwlViT model

KartikPawade

—

2026-03-18

COMMIT

0.00

Update more modular examples (#44834)

Cyrilvallez

—

2026-03-18

COMMIT

0.00

Fix and re-run modular converter on examples (#44833)

Cyrilvallez

—

2026-03-18

COMMIT

0.00

Remove cache_position in more models (4 and last one) (#4482

Cyrilvallez

—

2026-03-18

COMMIT

0.00

Fix loading issue in Sam3 (#44831)

zucchini-nlp

—

2026-03-18

COMMIT

0.00

feat(integration): Add KubeflowCallback to enable automatic

abhijeet-dhumal

—

2026-03-18

COMMIT

0.00

Add GGUF support for MiniMax-M2.1 model (#44526)

JoursBleu

—

2026-03-18

COMMIT

0.00

Centralize AI agent templates in `.ai` (#44489)

tarekziade

—

2026-03-18

COMMIT

0.00

support xxxFast alias in v5 tokenizers (#44766)

itazap

—

2026-03-18

COMMIT

0.00

Remove cache_position in more models (3) (#44759)

Cyrilvallez

—

2026-03-18

COMMIT

0.00

Fix `supports_{tp/pp}_plan` (#44696)

hmellor

—

2026-03-18

COMMIT

0.00

[CI] Temporarily skip Mistral4 tests as they almost all fail

Cyrilvallez

—

2026-03-18

COMMIT

0.00

update flex attention to use `return_aux` instead of `return

ntenenz

—

2026-03-18

COMMIT

0.00

[Gemma] Update conversion scripts for Transformers v5 Comapt

RyanMullins

—

2026-03-18

COMMIT

0.00

fix bug embedding_size mismatch with hidden_size in electra

kaixuanliu

—

2026-03-18

COMMIT

0.00

Fix pegasus conversion (#44571)

ArthurZucker

—

2026-03-18

COMMIT

0.00

Fix repo-check bot (#44812)

ydshieh

—

2026-03-18

COMMIT

0.00

[docs] is_causal feature (#44777)

stevhliu

—

2026-03-17

COMMIT

0.00

docs(tasks): remove references to removed question-answering

BillionClaw

—

2026-03-17

COMMIT

0.00

Fix configs with `@strict` (#44770)

zucchini-nlp

—

2026-03-17

COMMIT

0.00

[AMD CI] Fix test failures across important models (#44632)

Abdennacer-Badaoui

—

2026-03-17

COMMIT

0.00

Move VLM conversions to the main mapping (#44627)

zucchini-nlp

—

2026-03-17

COMMIT

0.00

Fix config loading issues (type issues) (#44789)

ydshieh

—

2026-03-17

COMMIT

0.00

Remove `is_causal` from `EuroBertConfig` (#44774)

ydshieh

—

2026-03-17

COMMIT

0.00

model-linter: Added rule 10 (#44761)

tarekziade

—

2026-03-17

COMMIT

0.00

[fix] mistral 4 docs (#44776)

stevhliu

—

2026-03-16

COMMIT

0.00

Add Mistral 4 (#44760)

juliendenize

—

2026-03-16

COMMIT

0.00

Fix: Eurobert model was missing @strict decorator and invali

tarekziade

—

2026-03-16

COMMIT

0.00

fix: sig lip import (#44764)

tarekziade

—

2026-03-16

COMMIT

0.00

Disable async loading when quantizing on the fly (#44576)

SunMarc

—

2026-03-16

COMMIT

0.00

Bump torchao >=0.15 and fix quantization CI (#44604)

SunMarc

—

2026-03-16

COMMIT

0.00

Fix tensor indexing crash in serve generate_response KV cach

mango766

—

2026-03-16

COMMIT

0.00

[MistralCommonBackend] Upgrade mistral-common to v1.10.0 (#4

juliendenize

—

2026-03-16

COMMIT

0.00

Fix `mlcd` auto config/model/mapping issues (#44730)

ydshieh

—

2026-03-16

COMMIT

0.00

Fix bug and add XPU Expectations for qwen2 and jamba tests (

kaixuanliu

—

2026-03-16

COMMIT

0.00

Add model lerobot PI0 to transformers (#44160)

molbap

—

2026-03-16

COMMIT

0.00

[medasr] doc update (#44633)

eustlb

—

2026-03-16

COMMIT

0.00

Idefics3 without cache fix (#44607)

gabe-l-hart

—

2026-03-16

COMMIT

0.00

Add XPU Expectations for vibe voice acoustic tokenizer tests

kaixuanliu

—

2026-03-16

COMMIT

0.00

Fix transformers serve's 422 unprocessable entity (#44620)

LysandreJik

—

2026-03-16

COMMIT

0.00

Fix missing / incorrect `config` class in some model class d

ydshieh

—

2026-03-15

COMMIT

0.00

Update Nvidia CI docker file to use torch 2.10 (#44712)

ydshieh

—

2026-03-14

COMMIT

0.00

[`FA`] Fix fa detection (#44703)

vasqu

—

2026-03-14

COMMIT

0.00

Fix `set_encoder` (#44698)

hmellor

—

2026-03-14

COMMIT

0.00

[docs] cb config (#44675)

stevhliu

—

2026-03-13

COMMIT

0.00

Fix more model tester missing `parent` issue (#44685)

ydshieh

—

2026-03-13

COMMIT

0.00

:rotating_light: [`FA4`] Initial support (#42435)

vasqu

—

2026-03-13

COMMIT

0.00

Add register method for `ParallelInterface` (#44640)

michaelbenayoun

—

2026-03-13

COMMIT

0.00

[CB] [Bug] Fix crashes when running without cuda (#44673)

remi-or

—

2026-03-13

COMMIT

0.00

Another (small) set of fixes required for tiny model creatio

ydshieh

—

2026-03-13

COMMIT

0.00

Fix CookieCutter (#44334)

NielsRogge

—

2026-03-13

COMMIT

0.00

Fix AWQ tests for GPTQModel migration (#44654)

jiqing-feng

—

2026-03-13

COMMIT

0.00

[Model] Add PP-OCRV5_mobile_det Model Support (#43247)

XingweiDeng

—

2026-03-13

COMMIT

0.00

pipelines do not have modelcard (#44621)

KoichiYasuoka

—

2026-03-13

COMMIT

0.00

[`Chmv2`] Fix conversion after capture refactor (#44665)

vasqu

—

2026-03-13

COMMIT

0.00

fix(models, testing): Fix Llama4 vision rotary meta tensor i

harshaljanjani

—

2026-03-13

COMMIT

0.00

[CB] Add dedicated config (#44434)

remi-or

—

2026-03-13

COMMIT

0.00

fix(models): Forward timm model kwargs to timm.create_model

harshaljanjani

—

2026-03-13

COMMIT

0.00

Ensure same `dtype` for subconfig when `_from_config` (#4462

zucchini-nlp

—

2026-03-13

COMMIT

0.00

Remove `cache_position` in more models (2) (#44602)

Cyrilvallez

—

2026-03-12

COMMIT

0.00

fix: cast to proper dtype in EmbeddingParallel (#44612)

michaelbenayoun

—

2026-03-12

COMMIT

0.00

Allow to disable stdout hiding for TP (#44608)

michaelbenayoun

—

2026-03-12

COMMIT

0.00

Remove many output_attentions and other traced outputs on 10

molbap

—

2026-03-12

COMMIT

0.00

[Model] Add PP-OCRV5_server_det Model Support (#43274)

XingweiDeng

—

2026-03-12

COMMIT

0.00

fix: raise error if mm_token_type_ids not supplied (#44433)

leopold-tzafon

—

2026-03-12

COMMIT

0.00

Fix output capturing for Backbones (#44638)

Cyrilvallez

—

2026-03-12

COMMIT

0.00

Fix lfm2 kernel path (#44634)

Cyrilvallez

—

2026-03-12

COMMIT

0.00

Fix for `VibeVoiceAcousticTokenizer` (#44628)

ydshieh

—

2026-03-12

COMMIT

0.00

Add an integration test for LASR using pipe and chunked deco

kho

—

2026-03-12

COMMIT

0.00

Fix more wrong HF hub checkpoint names (#44624)

ydshieh

—

2026-03-12

COMMIT

0.00

Update agentic contributions guidelines in AGENTS.md to forc

burtenshaw

—

2026-03-12

COMMIT

0.00

Expand model-structure lint rules with a fast AST-based, ruf

tarekziade

—

2026-03-12

COMMIT

0.00

feat: add neuron in tensor parallelism initialization (#4449

michaelbenayoun

—

2026-03-11

COMMIT

0.00

[WIP] FIX Make Mixtral LoRA loading work (#44478)

BenjaminBossan

—

2026-03-11

COMMIT

0.00

Fix Llava tests for torch too! (#44476)

Rocketknight1

—

2026-03-11

COMMIT

0.00

Fix training ci and clean some tests (#44491)

SunMarc

—

2026-03-11

COMMIT

0.00

Add CHMv2 (#44595)

yonigozlan

—

2026-03-11

COMMIT

0.00

Remove useless identity assignment (#44600)

Cyrilvallez

—

2026-03-11

COMMIT

0.00

Add Yoni to run-slow workflow (#44598)

vasqu

—

2026-03-11

COMMIT

0.00

Add shared VLM tests (#42964)

Rocketknight1

—

2026-03-11

COMMIT

0.00

Fix wrong (non-existing) checkpoints (#44549)

ydshieh

—

2026-03-11

COMMIT

0.00

Remove `cache_position` in more models (#44330)

Cyrilvallez

—

2026-03-11

PR

0.00

Switch FP8 per tensor quant to use `torch._scaled_mm`

SunMarc

—

2026-03-19

PR

0.00

DeepGEMM

IlyasMoutawwakil

—

2026-03-18

PR

0.00

Update some type hints

zucchini-nlp

—

2026-03-19

PR

0.00

Proposal to add Qwen3-ASR support [WIP]

mbtariq82

—

2026-02-08

PR

0.00

[Model] Add PP-Chart2Table Model Support

XingweiDeng

—

2026-02-05

PR

0.00

Dequant fix

ArthurZucker

—

2026-03-18

PR

0.00

[Model] Add SLANeXt Model Support

liu-jiaxuan

—

2026-02-03

PR

0.00

🚨 Refactor ViT to updated standards

yonigozlan

—

2025-10-17

PR

0.00

Add THD support in ESM

balvisio

—

2026-02-19

PR

0.00

[Model] Add UVDoc Model Support

XingweiDeng

—

2026-01-21

PR

0.00

feat: added cache to the model linter

tarekziade

—

2026-03-17

PR

0.00

Propagate the model loading from transformers serve to chat

LysandreJik

—

2026-03-16

PR

0.00

chore(typing): extend typing to `src/transformers/cli`

tarekziade

—

2026-03-10

PR

0.00

Fix core dumped when `NemotronH` is torch compiled

ydshieh

—

2026-03-19

PR

0.00

Officially launch parse_response

Rocketknight1

—

2026-03-13

PR

0.00

[CB] Add an option to return logprobs

remi-or

—

2026-03-18

PR

0.00

fix: handle list-type _tied_weights_keys in _get_tied_weight

gh-wf

—

2026-03-19

PR

0.00

Fix glm dsa

ArthurZucker

—

2026-03-10

PR

0.00

[PoC] HF exporters

IlyasMoutawwakil

—

2025-11-03

PR

0.00

[Mistral] Fix query scaling for Mistral4 and Ministral3

Cyrilvallez

—

2026-03-19

PR

0.00

Fix several based models' pipeline parallel support

hmellor

—

2026-03-14

PR

0.00

Support Modular (!!) + Configs in `check_auto_docstrings`

yonigozlan

—

2026-03-17

PR

0.00

Fix failing `Qwen3OmniModelIntegrationTests`

Sai-Suraj-27

—

2026-03-19

PR

0.00

🚨🚨 Refactor Image Processors to support different backends

yonigozlan

—

2026-01-27

PR

0.00

Dynamic weight conversion is recursive

zucchini-nlp

—

2026-02-26

PR

0.00

FSDP2 native support in transformers

3outeille

—

2026-02-17

PR

0.00

[generate] Never use `cache_position` anymore in generation

Cyrilvallez

—

2026-03-18

PR

0.00

add HyperClovaX Vision

jp1924

—

2026-02-27

PR

0.00

perceptron: Isaac-0.1 implementation

AkshatSh

—

2025-09-18

PR

0.00

refactor: rope in model, flatten vision, rely on qwen3 backo

philippguevorguian

—

2026-03-19

PR

0.00

enable tp for benchmark

sywangyi

—

2026-02-05

PR

0.00

Update AFMoE architecture to use v5-style MoE impl

AutumnAurelium

—

2026-02-17

PR

0.00

Fix KeyError in convert_to_native_format for dict vocab

weiguangli-io

—

2026-03-05

PR

0.00

Use `index_select` instead of advanced indexing in `batched_

dacorvo

—

2026-03-13

PR

0.00

fix: XLNet: relative_positional_encoding computes on CPU eve

JiwaniZakir

—

2026-03-17

PR

0.00

Fix annotations reader for python 3.14 in `PreTrainedModel`

neo

—

2026-03-13

PR

0.00

fix: allow AutoImageProcessor to load from URL

BillionClaw

—

2026-03-18

PR

0.00

Add Music Flamingo

lashahub

—

2026-01-27

PR

0.00

[CB] [Minor] Simplify test suite

remi-or

—

2026-03-19

PR

0.00

fix(testing): Fix PaliGemma 2 and PaddleOCR-VL test failures

harshaljanjani

—

2026-03-16

PR

0.00

fix: Add MXFP4 MoE/attention backward kernels

leoneperdigao

—

2026-02-05

PR

0.00

fix: handle unpicklable tokenizers in ProcessorMixin.to_dict

themavik

—

2026-03-19

PR

0.00

deepseek_v2, deepseek_v3, and modernbert fix for having inco

itazap

—

2026-03-17

PR

0.00

fix: move comments before @torch.jit.script decorator for Py

hkc5

—

2026-03-19

PR

0.00

Fix DEIM config export and public API

sahilleth

—

2026-03-19

PR

0.00

Add /v1/completions endpoint (OpenAI legacy completions API)

rain-1

—

2026-03-10

PR

0.00

[Misc] add enable_thinking to template kwargs

JJJYmmm

—

2026-03-18

PR

0.00

model: Add DEIMv2 to Transformers

harshaljanjani

—

2026-02-27

PR

0.00

Add xcodec2 model

ebezzam

—

2026-02-20

PR

0.00

[`Mllama`] Fix workaround compile

vasqu

—

2026-03-19

PR

0.00

Fix Zamba2MambaMixer ignoring use_mamba_kernels=False

sergiopaniego

—

2026-03-19

PR

0.00

Fix AutoImageProcessor URL loading regression

omyaaa1

—

2026-03-19

PR

0.00

Goodbye cache position

zucchini-nlp

—

2026-03-13

PR

0.00

[CB] Better parametrization for compile

remi-or

—

2026-03-10

PR

0.00

Allow kernel modules to declare their preferred mask functio

dacorvo

—

2026-03-13

PR

0.00

[Model] Add PP-OCRV5_mobile_rec Model Support

liu-jiaxuan

—

2026-02-06

PR

0.00

Fix AutoImageProcessor.from_pretrained failing with URL inpu

xr843

—

2026-03-18

PR

0.00

Fix whisper return language

FredHaa

—

2025-11-16

PR

0.00

fix(flaky): use a fixture for `set_seed` and single-threadin

tarekziade

—

2026-02-07

PR

0.00

Add `Jina-Embeddings-V3` Model

Sai-Suraj-27

—

2026-02-24

PR

0.00

[docs] training on specific hardware

stevhliu

—

2026-03-17

PR

0.00

Fix `AutoImageProcessor` to correctly detect local implement

kaixuanliu

—

2026-03-13

PR

0.00

Use doc-builder runnable example for GLM-ASR

tarekziade

—

2026-02-25

PR

0.00

Fix Mllama torch.compile failure caused by new attention mas

jiqing-feng

—

2026-03-19

PR

0.00

Fix `KeyError` when patching mistral regex

LeonardoEmili

—

2026-01-20

PR

0.00

ci: add anti-slop action

tarekziade

—

2026-03-19

PR

0.00

Correct code block formatting in weightconverter.md

zhulinchng

—

2026-03-19

PR

0.00

[Docs] Update DeiT model card to new format

RicardoLee510520

—

2026-03-19

PR

0.00

Fix llama4 bnb mode

jiqing-feng

—

2026-03-11

PR

0.00

Add cu_seqlens support to OlmoHybridGatedDeltaNet for packed

tyler-romero

—

2026-03-18

PR

0.00

Internalise the NomicBERT model

ed22699

—

2025-12-29

PR

0.00

[docs] optimizers, hyperparam search, training features

stevhliu

—

2026-02-26

PR

0.00

[docs] model cards

stevhliu

—

2026-03-18

PR

0.00

Fix Mistral4 tests

3outeille

—

2026-03-18

PR

0.00

[Model] Add PP-OCRv5_server_rec and PP-OCRv5_mobile_rec mod

zhang-prog

—

2026-03-18

PR

0.00

small cleaning of quantization class

SunMarc

—

2025-12-04

PR

0.00

feat(ci): added a network debug report

tarekziade

—

2026-03-12

PR

0.00

Add GreedyLR adaptive learning rate scheduler

balak4

—

2026-02-25

PR

0.00

Fix unexpected `position_ids` keys when loading OwlViT model

KartikPawade

—

2026-03-06

PR

0.00

Add Mistral 4

juliendenize

—

2026-03-16

PR

0.00

Add `base_model_tp_plan` to `OlmoeConfig`

dacorvo

—

2026-03-13

PR

0.00

Update more modular examples

Cyrilvallez

—

2026-03-18

PR

0.00

fix(gpt2): Resolve NaN/Inf issue in lm_head on Python 3.13 w

JokeYoonic

—

2026-03-13

PR

0.00

Fix and re-run modular converter on examples

Cyrilvallez

—

2026-03-18

PR

0.00

[Model] Add PP-OCRv5_server_rec Model Support

liu-jiaxuan

—

2026-02-06

PR

0.00

fix: add Float8 dtype fallback in modeling_utils.py

s-zx

—

2026-03-11

PR

0.00

Remove cache_position in more models (4 and last one)

Cyrilvallez

—

2026-03-18

PR

0.00

docs(pipelines): remove outdated question-answering example

BillionClaw

—

2026-03-17

PR

0.00

Fix loading issue in Sam3

zucchini-nlp

—

2026-03-18

PR

0.00

docs(quicktour): remove question-answering pipeline from qui

BillionClaw

—

2026-03-18

PR

0.00

fix: handle dict vocab in CamembertTokenizer for tokenizer.j

aayushbaluni

—

2026-03-17

PR

0.00

Add MPS (Apple Silicon) example and documentation

divyanks

—

2026-03-17

PR

0.00

fix: Cache XLNet relative_positional_encoding to avoid CPU c

BillionClaw

—

2026-03-16

PR

0.00

fix: resolve false-positive regex warning for non-mistral mo

yunhaoli24

—

2026-03-16

PR

0.00

Fix: propagate interpolate_pos_encoding through PixioEmbeddi

aashirpersonal

—

2026-03-15

PR

0.00

feat(integration): Add KubeflowCallback to enable automatic

abhijeet-dhumal

—

2026-03-06

PR

0.00

Add AudioFlamingoNext model

lashahub

—

2026-03-18

PR

0.00

fix series of failed test case for janus model

kaixuanliu

—

2026-03-16

PR

0.00

Add GGUF support for MiniMax-M2.1 model

JoursBleu

—

2026-03-08

huggingface/transformers