GitHub AI Radar

Type	AI Score	Description	Actor	Reason	Date
COMMIT	0.20	[fix] harmonize template	jingyaogong	Uses technical term 'harmonize' but in b	2025-11-06
COMMIT	0.15	fix: attn_forwad when is_causal=True assert attn_mask is Non	yuyu5333	Slightly more descriptive but remains te	2025-11-18
COMMIT	0.10	[update] params log	jingyaogong	Bare-bones commit messages typical of hu	2026-01-07
COMMIT	0.10	[update] mask log	jingyaogong	Bare-bones commit messages typical of hu	2026-01-07
COMMIT	0.10	[update] readme	jingyaogong	Bare-bones commit messages typical of hu	2026-01-06
COMMIT	0.10	[update] simplify loader	jingyaogong	Bare-bones commit messages typical of hu	2026-01-05
COMMIT	0.10	[update] readme	jingyaogong	Bare-bones commit messages typical of hu	2026-01-05
COMMIT	0.10	[update] rename train tokenizer	jingyaogong	Bare-bones commit messages typical of hu	2026-01-05
COMMIT	0.10	[update] readme	jingyaogong	Bare-bones commit messages typical of hu	2026-01-05
COMMIT	0.10	[fix] messages num	jingyaogong	Bare-bones commit messages typical of hu	2026-01-04
COMMIT	0.10	[fix] dist cleanup	jingyaogong	Bare-bones commit messages typical of hu	2026-01-02
COMMIT	0.10	[feat] update yarn	jingyaogong	Extremely terse commit-style messages wi	2025-12-01
COMMIT	0.10	[feat] release memory	jingyaogong	Very brief technical phrasing typical of	2025-11-27
COMMIT	0.10	[fix] ppo mask	jingyaogong	Minimal message uses domain abbreviation	2025-11-19
COMMIT	0.10	[fix] model attn_mask	jingyaogong	Concise technical fix reference with dom	2025-11-19
COMMIT	0.10	[fix] update model	jingyaogong	Simple two-word technical instruction wi	2025-11-18
COMMIT	0.10	[fix] prompt length calculate	jingyaogong	Brief domain-specific fix reference with	2025-11-15
COMMIT	0.10	[fix] model-name	jingyaogong	Minimal hyphenated fix reference typical	2025-11-07
COMMIT	0.10	[feat] update requirements	jingyaogong	Extremely terse technical commit message	2025-10-23
COMMIT	0.10	[feat] update readme	jingyaogong	Minimal commit message with specific tec	2025-10-23
COMMIT	0.10	[feat] update readme	jingyaogong	Brief, repetitive commit message typical	2025-10-23
COMMIT	0.10	[feat] repetition-penalty	jingyaogong	Specialized ML term with no polite or fo	2025-10-23
COMMIT	0.10	[feat] convert2llama	jingyaogong	Technical shorthand conversion label wit	2025-10-23
COMMIT	0.10	[feat] shuffle data	jingyaogong	Concise data operation command, not AI-s	2025-10-23
COMMIT	0.10	[fix] loss-issues-430	jingyaogong	Issue-specific fix reference with techni	2025-10-23
COMMIT	0.10	[fix] restore	jingyaogong	Single-word commit message - too minimal	2025-10-23
COMMIT	0.10	[fix] issue-431	jingyaogong	Issue number reference in terse human-st	2025-10-23
COMMIT	0.10	[fix] sampler-ddp	jingyaogong	Technical DDP sampler fix with domain-sp	2025-10-23
COMMIT	0.00	[update] random seed	jingyaogong	Very terse, standard commit message styl	2026-03-27
COMMIT	0.00	[update] fp16 inference	jingyaogong	Brief, domain-specific commit message.	2026-03-27
COMMIT	0.00	[update] image	jingyaogong	Terse, mechanical commit; typical human	2026-03-26
COMMIT	0.00	[update] change default seq_len	jingyaogong	Concise and technical; common human comm	2026-03-26
COMMIT	0.00	[update] minimind-3	jingyaogong	Very brief, informal; no AI hallmarks pr	2026-03-24
COMMIT	0.00	[fix] align log/save last-step check and ETA with 1-indexed	readlnh	Specific technical fix; human-like jargo	2026-03-24
COMMIT	0.00	[fix] gradient accumulation step alignment	readlnh	Direct technical fix; human engineering	2026-03-24
COMMIT	0.00	[update] empty_think_ratio	jingyaogong	Minimal, to-the-point; lacks AI politene	2026-02-06
COMMIT	0.00	[update] empty_think_ratio	jingyaogong	Repetitive, brief update; likely human c	2026-02-05
COMMIT	0.00	[feat] data process	jingyaogong	Ambiguous but informal; typical human sh	2026-02-05
COMMIT	0.00	[update] save interval	jingyaogong	Concise and clear; common human commit m	2026-01-30
COMMIT	0.00	[update] safe half	jingyaogong	Very brief, technical term; no AI stylis	2026-01-30
COMMIT	0.00	[fix] data skip	jingyaogong	Extremely terse and informal phrasing ty	2026-01-18
COMMIT	0.00	[update] shuffle data	jingyaogong	Brief, informal update note with common	2026-01-18
COMMIT	0.00	[fix] max length	jingyaogong	Minimalist technical fix; lacks any poli	2026-01-17
COMMIT	0.00	[update] pretrain load	jingyaogong	Concise, technical update note with stan	2026-01-17
COMMIT	0.00	[update] align mask	jingyaogong	Very short and specific; common ML/engin	2026-01-15
COMMIT	0.00	[update] align loss	jingyaogong	Terse technical update; no hallmark AI p	2026-01-14
COMMIT	0.00	[fix] compile unpack	jingyaogong	Brief human-style fix message ('fix comp	2026-01-14
COMMIT	0.00	[feat] add compile	jingyaogong	Succinct feature addition; common Git co	2026-01-14
COMMIT	0.00	[update] prompt prefill	jingyaogong	Direct technical update; uses specific j	2026-01-13
COMMIT	0.00	[update] show speed	jingyaogong	Short, informal update typical of human	2026-01-07
COMMIT	0.00	[update] rename reason	jingyaogong	Bare-bones commit messages typical of hu	2026-01-05
COMMIT	0.00	[update] aux loss	jingyaogong	Terse commit-style messages with technic	2026-01-01
COMMIT	0.00	[fix] experts unused	jingyaogong	Concise fix notation typical of develope	2025-12-31
COMMIT	0.00	[fix] layers set 8	jingyaogong	Very brief technical notation lacking AI	2025-12-31
COMMIT	0.00	[fix] moe unused	jingyaogong	Short technical fix message with domain-	2025-12-31
COMMIT	0.00	[feat] get params	jingyaogong	Simple feature description with no AI-st	2025-12-31
COMMIT	0.00	[feat] get params	jingyaogong	Identical to previous item; typical huma	2025-12-31
COMMIT	0.00	[feat] update config	jingyaogong	Brief config update notation without AI	2025-12-31
COMMIT	0.00	[feat] update lr	jingyaogong	Abbreviated technical term (lr) with no	2025-12-31
COMMIT	0.00	[feat] compatible tokenizer	jingyaogong	Concise feature description using domain	2025-12-31
COMMIT	0.00	[feat] stream load data	jingyaogong	Terse technical notation typical of deve	2025-12-28
COMMIT	0.00	[feat] remove empty_cache	jingyaogong	Terse, technical message with typical Gi	2025-12-26
COMMIT	0.00	[feat] explicit left padding	jingyaogong	Concise, technical message with standard	2025-12-23
COMMIT	0.00	[fix] lora weight	jingyaogong	Minimal, direct technical fix descriptio	2025-12-22
COMMIT	0.00	Fix: support loading DDP-saved LoRA weights for inference	whiteswordLI	Direct technical description with specif	2025-12-22
COMMIT	0.00	[feat] adjust seq length	jingyaogong	Very brief, technical update typical of	2025-12-14
COMMIT	0.00	[feat] update readme	jingyaogong	Very brief update message, common for Gi	2025-12-11
COMMIT	0.00	[fix] dtype & lr	jingyaogong	Extremely concise technical fix with com	2025-12-09
COMMIT	0.00	[fix] Refactor get_lr function to include min_lr calculation	dyhuachi	Detailed, specific technical explanation	2025-12-06
COMMIT	0.00	[fix] reduce aux_loss_alpha	jingyaogong	Minimal technical message about a parame	2025-12-05
COMMIT	0.00	[fix] cuda memory #559	jingyaogong	Terse, context-specific technical messag	2025-12-01
COMMIT	0.00	[feat] add MNN support to README.	wangzhaode	Specific feature addition but remains co	2025-11-10
COMMIT	0.00	[feat] clear cache	jingyaogong	Terse, informal commit style typical of	2025-11-06
COMMIT	0.00	[fix] harmonize template	jingyaogong	Concise technical wording, lacks AI form	2025-11-02
COMMIT	0.00	[feat] update import	jingyaogong	Brief and direct, common in human commit	2025-10-31
COMMIT	0.00	[feat] update readme	jingyaogong	Minimal update note, no AI stylistic mar	2025-10-30
COMMIT	0.00	[feat] update readme	jingyaogong	Repetitive simple update, typical human	2025-10-30
COMMIT	0.00	[feat] update readme	jingyaogong	Same as previous, no signs of AI generat	2025-10-30
COMMIT	0.00	[feat] update datasets	jingyaogong	Direct technical term, no AI phrasing.	2025-10-30
COMMIT	0.00	[feat] update args	jingyaogong	Short, informal, lacks any AI hallmarks.	2025-10-30
COMMIT	0.00	[feat] add args	jingyaogong	Minimalist and terse, characteristic of	2025-10-30
COMMIT	0.00	[feat] update readme	jingyaogong	Identical to other updates, clearly huma	2025-10-29
COMMIT	0.00	[fix] model device	jingyaogong	Very terse, technical commit format with	2025-10-29
COMMIT	0.00	[feat] update trainer	jingyaogong	Minimal, repetitive commit title typical	2025-10-28
COMMIT	0.00	[feat] update trainer	jingyaogong	Identical to previous; likely batch huma	2025-10-28
COMMIT	0.00	[feat] update readme	jingyaogong	Brief, standard commit to update documen	2025-10-27
COMMIT	0.00	[feat] update readme	jingyaogong	Repetitive format, no AI stylistic phras	2025-10-26
COMMIT	0.00	[feat] update eval-llm	jingyaogong	Abbreviated, informal module name like '	2025-10-26
COMMIT	0.00	[feat] pause-training	jingyaogong	Hyphenated, descriptive feature name com	2025-10-26
COMMIT	0.00	[feat] update readme	jingyaogong	Repetitive, minimal content typical of m	2025-10-23
COMMIT	0.00	[feat] update readme	jingyaogong	Identical pattern; no AI hallmarks like	2025-10-23
COMMIT	0.00	[feat] update requirements	jingyaogong	Terse, technical commit typical of depen	2025-10-23
COMMIT	0.00	[fix] graph-oom & ddp-pos_cis	jingyaogong	Extremely terse and informal, typical of	2025-10-23
COMMIT	0.00	[fix] git track	jingyaogong	Short, minimal text with informal phrasi	2025-10-21
COMMIT	0.00	[feat] update readme	jingyaogong	Brief and typical human update to docume	2025-10-21
COMMIT	0.00	[feat] minimind-2510	jingyaogong	Concise, likely a project-specific refer	2025-10-21
COMMIT	0.00	[feat] update eval	jingyaogong	Very terse; lacks any AI-style politenes	2025-10-17
COMMIT	0.00	[feat] update requirements	jingyaogong	Minimal description, common for dependen	2025-10-16
COMMIT	0.00	[fix] update model	jingyaogong	Too brief and direct to be AI-generated.	2025-10-16
COMMIT	0.00	[fix] update model	jingyaogong	Identical to previous; no AI hallmarks.	2025-10-16
PR	0.00	使用einops进一步提升代码可读性	wizardforcel	Chinese text, concise and topic-specific	2026-03-27
PR	0.00	[feat] add dapo algorithm	LittleExian	Detailed technical phrasing, human tone,	2026-03-27
PR	0.00	merge redundant forward passes for logps and aux_loss (in tr	Dxpsk	—	2026-03-24
PR	0.00	添加数据集加载逻辑、网页内容抓取与数据预处理逻辑	DiracSeas	—	2025-12-16
PR	0.00	[docs] Fix wording in RLHF section of README.md file	vanking20000918	—	2026-01-27
PR	0.00	[docs]: clarify pretraining data format in README	dyhuachi	—	2025-12-31
PR	0.00	refactor: optimize tensor wrapping in lm_dataset.py	Dear47	—	2025-12-20
PR	0.00	[fix] 修复训练脚本中 1-indexed step 与 0-indexed 逻辑混用的问题	readlnh	—	2026-03-24
PR	0.00	Fix SFT resume with torch.compile enabledfix sft resume with	DeliWang	—	2026-03-19
PR	0.00	新建test分支	upwardflow	—	2026-03-22
PR	0.00	[mod & add] fix spo algorithm, add dapo and cispo algorithm	vanking20000918	—	2026-01-30
PR	0.00	Mega	yangnianboy	—	2025-06-26
PR	0.00	Add dynamic growth pipeline, eval tooling, and overnight run	spectramaster	—	2026-02-22
PR	0.00	更新了 model / trainer 中的注释 & PyTorch 新版本 Automatic Mixed Preci	Bader-CN	—	2026-02-04
PR	0.00	Update requirements.txt	CharlieZhuang-Code	—	2024-10-28
PR	0.00	Auto tokenizer name path fix	krmst	—	2024-10-03
PR	0.00	Update requirements	krmst	—	2024-10-01
PR	0.00	[add] add gating term on po algorithm	vanking20000918	—	2026-02-03
PR	0.00	add muon optimizer	guo-sj	—	2026-02-02
PR	0.00	modified: .gitignore	FWJ321	—	2026-01-19
PR	0.00	Fix DPO loss_mask boundary (include first assistant token)	xiao-baia	—	2026-01-07
PR	0.00	[feat] Support Minimind retrieval-augmented generation (RAG)	ztzhu1	—	2025-11-16
PR	0.00	perf: merge LoRA weights into model for inference	whiteswordLI	—	2025-12-23
PR	0.00	Create RESOURCES.md for MiniMind project	VinodHatti-AI-Developer	—	2025-10-24
PR	0.00	Perf/merge lora weights	whiteswordLI	—	2025-12-23
PR	0.00	Fix: support loading DDP-saved LoRA weights for inference	whiteswordLI	—	2025-12-22
PR	0.00	[feat] add interactive notebook	Nijikadesu	—	2025-02-23
PR	0.00	[feat] Add Training Web UI	yuyu5333	—	2025-11-06
PR	0.00	fix: 调整model_lora.py里面lora作用的对象	Litmeb	—	2025-12-05
PR	0.00	Add attention gate	Peace-Howard-Wang	—	2025-12-12
PR	0.00	feat: 增加 LoRA alpha 缩放系数及命令行支持	Peace-Howard-Wang	—	2025-12-12
PR	0.00	[fix] Refactor get_lr function to include min_lr calculation	dyhuachi	—	2025-12-06
PR	0.00	feat: add merge_lora.py to support merging LoRA weights into	dyhuachi	—	2025-12-05
PR	0.00	第一次尝试	wzyandyzw	—	2025-11-21
PR	0.00	[Security] Fix HIGH vulnerability: trailofbits.python.pickle	orbisai0security	—	2025-11-19
PR	0.00	fix: attn_forwad when is_causal=True assert attn_mask is Non	yuyu5333	—	2025-11-18
PR	0.00	Train_Grpo 添加注释	chengyuZou	—	2025-10-26
PR	0.00	[feat] update model install method	Explorer-Dong	—	2025-11-05
PR	0.00	[feat] add MNN support to README.	wangzhaode	—	2025-11-10
PR	0.00	fix: Loading LoRA parameters which saved from multi-card tra	yuyu5333	—	2025-11-06
PR	0.00	Update eval_llm.py	AnHeXi	—	2025-11-03
PR	0.00	Merge pull request #1 from jingyaogong/master	AlvinScrp	—	2025-10-29
PR	0.00	Hope for Integrate swanlab希望集成SwanLab实验跟踪工具	ShaohonChen	—	2025-02-28
PR	0.00	新增注释，解释 Attention Trainer 细节	zhenyu-02	—	2025-08-15
PR	0.00	取消模型上下文限制，增加模型动态长度扩展机制，并保持前向兼容性	hujiyo	—	2025-07-09
PR	0.00	增加可选的MLA支持、修复模型内部精度一致，优化代码add mla, fix model dtype, improve	Zephor5	—	2025-02-28
PR	0.00	Improve training performance with torch.compile and torch.am	Gouryella	—	2025-05-21
PR	0.00	升腾NPU适配	adenzhou1350	—	2025-05-16
PR	0.00	完善注释及训练脚本	llIlIllIIlIIllIl	—	2025-05-08
PR	0.00	修改 serve_openai_api.py 的默认参数	screnwei	—	2025-04-30
PR	0.00	Hotfix/issues 382	screnwei	—	2025-04-29
PR	0.00	Update eval_model.py	howard0su	—	2025-04-26
PR	0.00	sft should use pretrain model	zachzwy	—	2025-04-26
PR	0.00	完善 README 中关于加载已有模型的说明	llxxbb	—	2025-04-20
PR	0.00	chore: auto detect mps for pre train	zwpaper	—	2025-04-05
PR	0.00	Add Load ckpt	LH-and-FPGA	—	2025-04-03
PR	0.00	Little typo of readme	HaiHui886	—	2025-03-09
PR	0.00	Add the interface testing interface for model API deployment	jingsongliujing	—	2025-02-21
PR	0.00	add smart gradient accumulation	powermano	—	2025-02-21
PR	0.00	Add ckp_dir and tokenizer path	xunuohope1107	—	2025-02-18
PR	0.00	修正了训练tokenizer中的chat_template中的逻辑,以及修正了tokenizer_config.json	Singularity-M	—	2024-11-07
PR	0.00	Stabilize full SFT	Sensente	—	2025-10-23
PR	0.00	接续训练	zisu09	—	2025-05-28
PR	0.00	Fix Flash Attention attn_mask and is_causal conflict in Atte	Peace-Howard-Wang	—	2025-10-18
PR	0.00	Minimind	Tawns-lab	—	2025-10-10
PR	0.00	主要增加了直接使用huggingface模型的适配	math-zhuxy	—	2025-05-25
PR	0.00	update	zyren123	—	2025-06-28
PR	0.00	Fix bug #329 top_p 参数由 int 类型调整为 float	cn-farmer	—	2025-04-09
PR	0.00	123	happly-plane	—	2025-04-17
PR	0.00	移除构建输入文本时在开头和末尾重复添加的和	cn-farmer	—	2025-04-02
PR	0.00	feat: 优化导入和代码风格	sanshi42	—	2025-02-11
PR	0.00	Update README.md	a67793581	—	2025-02-07
PR	0.00	fix weight initialization for residual block	CohleM	—	2025-02-03
PR	0.00	fix 5-dpo train	guomin	—	2025-01-31
PR	0.00	Remove unnecessary code.	yym68686	—	2024-12-03
PR	0.00	Update 5-dpo_train.py	leoz9	—	2024-11-14
PR	0.00	fix 5-dpo_train.py bugs	StudyingLover	—	2024-10-11
PR	0.00	修复wandb bug & 添加了argparse	iomgaa-ycz	—	2024-09-24
PR	0.00	添加了wandb	iomgaa-ycz	—	2024-09-23
PR	0.00	修复了data_process.py文件的bug	iomgaa-ycz	—	2024-09-23
PR	0.00	Update requirements.txt	MuWinds	—	2024-09-05

COMMIT

0.20

[fix] harmonize template

jingyaogong

Uses technical term 'harmonize' but in b

2025-11-06

COMMIT

0.15

fix: attn_forwad when is_causal=True assert attn_mask is Non

yuyu5333

Slightly more descriptive but remains te

2025-11-18

COMMIT

0.10

[update] params log

jingyaogong

Bare-bones commit messages typical of hu

2026-01-07

COMMIT

0.10

[update] mask log

jingyaogong

Bare-bones commit messages typical of hu

2026-01-07

COMMIT

0.10

[update] readme

jingyaogong

Bare-bones commit messages typical of hu

2026-01-06

COMMIT

0.10

[update] simplify loader

jingyaogong

Bare-bones commit messages typical of hu

2026-01-05

COMMIT

0.10

[update] readme

jingyaogong

Bare-bones commit messages typical of hu

2026-01-05

COMMIT

0.10

[update] rename train tokenizer

jingyaogong

Bare-bones commit messages typical of hu

2026-01-05

COMMIT

0.10

[update] readme

jingyaogong

Bare-bones commit messages typical of hu

2026-01-05

COMMIT

0.10

[fix] messages num

jingyaogong

Bare-bones commit messages typical of hu

2026-01-04

COMMIT

0.10

[fix] dist cleanup

jingyaogong

Bare-bones commit messages typical of hu

2026-01-02

COMMIT

0.10

[feat] update yarn

jingyaogong

Extremely terse commit-style messages wi

2025-12-01

COMMIT

0.10

[feat] release memory

jingyaogong

Very brief technical phrasing typical of

2025-11-27

COMMIT

0.10

[fix] ppo mask

jingyaogong

Minimal message uses domain abbreviation

2025-11-19

COMMIT

0.10

[fix] model attn_mask

jingyaogong

Concise technical fix reference with dom

2025-11-19

COMMIT

0.10

[fix] update model

jingyaogong

Simple two-word technical instruction wi

2025-11-18

COMMIT

0.10

[fix] prompt length calculate

jingyaogong

Brief domain-specific fix reference with

2025-11-15

COMMIT

0.10

[fix] model-name

jingyaogong

Minimal hyphenated fix reference typical

2025-11-07

COMMIT

0.10

[feat] update requirements

jingyaogong

Extremely terse technical commit message

2025-10-23

COMMIT

0.10

[feat] update readme

jingyaogong

Minimal commit message with specific tec

2025-10-23

COMMIT

0.10

[feat] update readme

jingyaogong

Brief, repetitive commit message typical

2025-10-23

COMMIT

0.10

[feat] repetition-penalty

jingyaogong

Specialized ML term with no polite or fo

2025-10-23

COMMIT

0.10

[feat] convert2llama

jingyaogong

Technical shorthand conversion label wit

2025-10-23

COMMIT

0.10

[feat] shuffle data

jingyaogong

Concise data operation command, not AI-s

2025-10-23

COMMIT

0.10

[fix] loss-issues-430

jingyaogong

Issue-specific fix reference with techni

2025-10-23

COMMIT

0.10

[fix] restore

jingyaogong

Single-word commit message - too minimal

2025-10-23

COMMIT

0.10

[fix] issue-431

jingyaogong

Issue number reference in terse human-st

2025-10-23

COMMIT

0.10

[fix] sampler-ddp

jingyaogong

Technical DDP sampler fix with domain-sp

2025-10-23

COMMIT

0.00

[update] random seed

jingyaogong

Very terse, standard commit message styl

2026-03-27

COMMIT

0.00

[update] fp16 inference

jingyaogong

Brief, domain-specific commit message.

2026-03-27

COMMIT

0.00

[update] image

jingyaogong

Terse, mechanical commit; typical human

2026-03-26

COMMIT

0.00

[update] change default seq_len

jingyaogong

Concise and technical; common human comm

2026-03-26

COMMIT

0.00

[update] minimind-3

jingyaogong

Very brief, informal; no AI hallmarks pr

2026-03-24

COMMIT

0.00

[fix] align log/save last-step check and ETA with 1-indexed

readlnh

Specific technical fix; human-like jargo

2026-03-24

COMMIT

0.00

[fix] gradient accumulation step alignment

readlnh

Direct technical fix; human engineering

2026-03-24

COMMIT

0.00

[update] empty_think_ratio

jingyaogong

Minimal, to-the-point; lacks AI politene

2026-02-06

COMMIT

0.00

[update] empty_think_ratio

jingyaogong

Repetitive, brief update; likely human c

2026-02-05

COMMIT

0.00

[feat] data process

jingyaogong

Ambiguous but informal; typical human sh

2026-02-05

COMMIT

0.00

[update] save interval

jingyaogong

Concise and clear; common human commit m

2026-01-30

COMMIT

0.00

[update] safe half

jingyaogong

Very brief, technical term; no AI stylis

2026-01-30

COMMIT

0.00

[fix] data skip

jingyaogong

Extremely terse and informal phrasing ty

2026-01-18

COMMIT

0.00

[update] shuffle data

jingyaogong

Brief, informal update note with common

2026-01-18

COMMIT

0.00

[fix] max length

jingyaogong

Minimalist technical fix; lacks any poli

2026-01-17

COMMIT

0.00

[update] pretrain load

jingyaogong

Concise, technical update note with stan

2026-01-17

COMMIT

0.00

[update] align mask

jingyaogong

Very short and specific; common ML/engin

2026-01-15

COMMIT

0.00

[update] align loss

jingyaogong

Terse technical update; no hallmark AI p

2026-01-14

COMMIT

0.00

[fix] compile unpack

jingyaogong

Brief human-style fix message ('fix comp

2026-01-14

COMMIT

0.00

[feat] add compile

jingyaogong

Succinct feature addition; common Git co

2026-01-14

COMMIT

0.00

[update] prompt prefill

jingyaogong

Direct technical update; uses specific j

2026-01-13

COMMIT

0.00

[update] show speed

jingyaogong

Short, informal update typical of human

2026-01-07

COMMIT

0.00

[update] rename reason

jingyaogong

Bare-bones commit messages typical of hu

2026-01-05

COMMIT

0.00

[update] aux loss

jingyaogong

Terse commit-style messages with technic

2026-01-01

COMMIT

0.00

[fix] experts unused

jingyaogong

Concise fix notation typical of develope

2025-12-31

COMMIT

0.00

[fix] layers set 8

jingyaogong

Very brief technical notation lacking AI

2025-12-31

COMMIT

0.00

[fix] moe unused

jingyaogong

Short technical fix message with domain-

2025-12-31

COMMIT

0.00

[feat] get params

jingyaogong

Simple feature description with no AI-st

2025-12-31

COMMIT

0.00

[feat] get params

jingyaogong

Identical to previous item; typical huma

2025-12-31

COMMIT

0.00

[feat] update config

jingyaogong

Brief config update notation without AI

2025-12-31

COMMIT

0.00

[feat] update lr

jingyaogong

Abbreviated technical term (lr) with no

2025-12-31

COMMIT

0.00

[feat] compatible tokenizer

jingyaogong

Concise feature description using domain

2025-12-31

COMMIT

0.00

[feat] stream load data

jingyaogong

Terse technical notation typical of deve

2025-12-28

COMMIT

0.00

[feat] remove empty_cache

jingyaogong

Terse, technical message with typical Gi

2025-12-26

COMMIT

0.00

[feat] explicit left padding

jingyaogong

Concise, technical message with standard

2025-12-23

COMMIT

0.00

[fix] lora weight

jingyaogong

Minimal, direct technical fix descriptio

2025-12-22

COMMIT

0.00

Fix: support loading DDP-saved LoRA weights for inference

whiteswordLI

Direct technical description with specif

2025-12-22

COMMIT

0.00

[feat] adjust seq length

jingyaogong

Very brief, technical update typical of

2025-12-14

COMMIT

0.00

[feat] update readme

jingyaogong

Very brief update message, common for Gi

2025-12-11

COMMIT

0.00

[fix] dtype & lr

jingyaogong

Extremely concise technical fix with com

2025-12-09

COMMIT

0.00

[fix] Refactor get_lr function to include min_lr calculation

dyhuachi

Detailed, specific technical explanation

2025-12-06

COMMIT

0.00

[fix] reduce aux_loss_alpha

jingyaogong

Minimal technical message about a parame

2025-12-05

COMMIT

0.00

[fix] cuda memory #559

jingyaogong

Terse, context-specific technical messag

2025-12-01

COMMIT

0.00

[feat] add MNN support to README.

wangzhaode

Specific feature addition but remains co

2025-11-10

COMMIT

0.00

[feat] clear cache

jingyaogong

Terse, informal commit style typical of

2025-11-06

COMMIT

0.00

[fix] harmonize template

jingyaogong

Concise technical wording, lacks AI form

2025-11-02

COMMIT

0.00

[feat] update import

jingyaogong

Brief and direct, common in human commit

2025-10-31

COMMIT

0.00

[feat] update readme

jingyaogong

Minimal update note, no AI stylistic mar

2025-10-30

COMMIT

0.00

[feat] update readme

jingyaogong

Repetitive simple update, typical human

2025-10-30

COMMIT

0.00

[feat] update readme

jingyaogong

Same as previous, no signs of AI generat

2025-10-30

COMMIT

0.00

[feat] update datasets

jingyaogong

Direct technical term, no AI phrasing.

2025-10-30

COMMIT

0.00

[feat] update args

jingyaogong

Short, informal, lacks any AI hallmarks.

2025-10-30

COMMIT

0.00

[feat] add args

jingyaogong

Minimalist and terse, characteristic of

2025-10-30

COMMIT

0.00

[feat] update readme

jingyaogong

Identical to other updates, clearly huma

2025-10-29

COMMIT

0.00

[fix] model device

jingyaogong

Very terse, technical commit format with

2025-10-29

COMMIT

0.00

[feat] update trainer

jingyaogong

Minimal, repetitive commit title typical

2025-10-28

COMMIT

0.00

[feat] update trainer

jingyaogong

Identical to previous; likely batch huma

2025-10-28

COMMIT

0.00

[feat] update readme

jingyaogong

Brief, standard commit to update documen

2025-10-27

COMMIT

0.00

[feat] update readme

jingyaogong

Repetitive format, no AI stylistic phras

2025-10-26

COMMIT

0.00

[feat] update eval-llm

jingyaogong

Abbreviated, informal module name like '

2025-10-26

COMMIT

0.00

[feat] pause-training

jingyaogong

Hyphenated, descriptive feature name com

2025-10-26

COMMIT

0.00

[feat] update readme

jingyaogong

Repetitive, minimal content typical of m

2025-10-23

COMMIT

0.00

[feat] update readme

jingyaogong

Identical pattern; no AI hallmarks like

2025-10-23

COMMIT

0.00

[feat] update requirements

jingyaogong

Terse, technical commit typical of depen

2025-10-23

COMMIT

0.00

[fix] graph-oom & ddp-pos_cis

jingyaogong

Extremely terse and informal, typical of

2025-10-23

COMMIT

0.00

[fix] git track

jingyaogong

Short, minimal text with informal phrasi

2025-10-21

COMMIT

0.00

[feat] update readme

jingyaogong

Brief and typical human update to docume

2025-10-21

COMMIT

0.00

[feat] minimind-2510

jingyaogong

Concise, likely a project-specific refer

2025-10-21

COMMIT

0.00

[feat] update eval

jingyaogong

Very terse; lacks any AI-style politenes

2025-10-17

COMMIT

0.00

[feat] update requirements

jingyaogong

Minimal description, common for dependen

2025-10-16

COMMIT

0.00

[fix] update model

jingyaogong

Too brief and direct to be AI-generated.

2025-10-16

COMMIT

0.00

[fix] update model

jingyaogong

Identical to previous; no AI hallmarks.

2025-10-16

PR

0.00

使用einops进一步提升代码可读性

wizardforcel

Chinese text, concise and topic-specific

2026-03-27

PR

0.00

[feat] add dapo algorithm

LittleExian

Detailed technical phrasing, human tone,

2026-03-27

PR

0.00

merge redundant forward passes for logps and aux_loss (in tr

Dxpsk

—

2026-03-24

PR

0.00

添加数据集加载逻辑、网页内容抓取与数据预处理逻辑

DiracSeas

—

2025-12-16

PR

0.00

[docs] Fix wording in RLHF section of README.md file

vanking20000918

—

2026-01-27

PR

0.00

[docs]: clarify pretraining data format in README

dyhuachi

—

2025-12-31

PR

0.00

refactor: optimize tensor wrapping in lm_dataset.py

Dear47

—

2025-12-20

PR

0.00

[fix] 修复训练脚本中 1-indexed step 与 0-indexed 逻辑混用的问题

readlnh

—

2026-03-24

PR

0.00

Fix SFT resume with torch.compile enabledfix sft resume with

DeliWang

—

2026-03-19

PR

0.00

新建test分支

upwardflow

—

2026-03-22

PR

0.00

[mod & add] fix spo algorithm, add dapo and cispo algorithm

vanking20000918

—

2026-01-30

PR

0.00

Mega

yangnianboy

—

2025-06-26

PR

0.00

Add dynamic growth pipeline, eval tooling, and overnight run

spectramaster

—

2026-02-22

PR

0.00

更新了 model / trainer 中的注释 & PyTorch 新版本 Automatic Mixed Preci

Bader-CN

—

2026-02-04

PR

0.00

Update requirements.txt

CharlieZhuang-Code

—

2024-10-28

PR

0.00

Auto tokenizer name path fix

krmst

—

2024-10-03

PR

0.00

Update requirements

krmst

—

2024-10-01

PR

0.00

[add] add gating term on po algorithm

vanking20000918

—

2026-02-03

PR

0.00

add muon optimizer

guo-sj

—

2026-02-02

PR

0.00

modified: .gitignore

FWJ321

—

2026-01-19

PR

0.00

Fix DPO loss_mask boundary (include first assistant token)

xiao-baia

—

2026-01-07

PR

0.00

[feat] Support Minimind retrieval-augmented generation (RAG)

ztzhu1

—

2025-11-16

PR

0.00

perf: merge LoRA weights into model for inference

whiteswordLI

—

2025-12-23

PR

0.00

Create RESOURCES.md for MiniMind project

VinodHatti-AI-Developer

—

2025-10-24

PR

0.00

Perf/merge lora weights

whiteswordLI

—

2025-12-23

PR

0.00

Fix: support loading DDP-saved LoRA weights for inference

whiteswordLI

—

2025-12-22

PR

0.00

[feat] add interactive notebook

Nijikadesu

—

2025-02-23

PR

0.00

[feat] Add Training Web UI

yuyu5333

—

2025-11-06

PR

0.00

fix: 调整model_lora.py里面lora作用的对象

Litmeb

—

2025-12-05

PR

0.00

Add attention gate

Peace-Howard-Wang

—

2025-12-12

PR

0.00

feat: 增加 LoRA alpha 缩放系数及命令行支持

Peace-Howard-Wang

—

2025-12-12

PR

0.00

[fix] Refactor get_lr function to include min_lr calculation

dyhuachi

—

2025-12-06

PR

0.00

feat: add merge_lora.py to support merging LoRA weights into

dyhuachi

—

2025-12-05

PR

0.00

第一次尝试

wzyandyzw

—

2025-11-21

PR

0.00

[Security] Fix HIGH vulnerability: trailofbits.python.pickle

orbisai0security

—

2025-11-19

PR

0.00

fix: attn_forwad when is_causal=True assert attn_mask is Non

yuyu5333

—

2025-11-18

PR

0.00

Train_Grpo 添加注释

chengyuZou

—

2025-10-26

PR

0.00

[feat] update model install method

Explorer-Dong

—

2025-11-05

PR

0.00

[feat] add MNN support to README.

wangzhaode

—

2025-11-10

PR

0.00

fix: Loading LoRA parameters which saved from multi-card tra

yuyu5333

—

2025-11-06

PR

0.00

Update eval_llm.py

AnHeXi

—

2025-11-03

PR

0.00

Merge pull request #1 from jingyaogong/master

AlvinScrp

—

2025-10-29

PR

0.00

Hope for Integrate swanlab希望集成SwanLab实验跟踪工具

ShaohonChen

—

2025-02-28

PR

0.00

新增注释，解释 Attention Trainer 细节

zhenyu-02

—

2025-08-15

PR

0.00

取消模型上下文限制，增加模型动态长度扩展机制，并保持前向兼容性

hujiyo

—

2025-07-09

PR

0.00

增加可选的MLA支持、修复模型内部精度一致，优化代码add mla, fix model dtype, improve

Zephor5

—

2025-02-28

PR

0.00

Improve training performance with torch.compile and torch.am

Gouryella

—

2025-05-21

PR

0.00

升腾NPU适配

adenzhou1350

—

2025-05-16

PR

0.00

完善注释及训练脚本

llIlIllIIlIIllIl

—

2025-05-08

PR

0.00

修改 serve_openai_api.py 的默认参数

screnwei

—

2025-04-30

PR

0.00

Hotfix/issues 382

screnwei

—

2025-04-29

PR

0.00

Update eval_model.py

howard0su

—

2025-04-26

PR

0.00

sft should use pretrain model

zachzwy

—

2025-04-26

PR

0.00

完善 README 中关于加载已有模型的说明

llxxbb

—

2025-04-20

PR

0.00

chore: auto detect mps for pre train

zwpaper

—

2025-04-05

PR

0.00

Add Load ckpt

LH-and-FPGA

—

2025-04-03

PR

0.00

Little typo of readme

HaiHui886

—

2025-03-09

PR

0.00

Add the interface testing interface for model API deployment

jingsongliujing

—

2025-02-21

PR

0.00

add smart gradient accumulation

powermano

—

2025-02-21

PR

0.00

Add ckp_dir and tokenizer path

xunuohope1107

—

2025-02-18

PR

0.00

修正了训练tokenizer中的chat_template中的逻辑,以及修正了tokenizer_config.json

Singularity-M

—

2024-11-07

PR

0.00

Stabilize full SFT

Sensente

—

2025-10-23

PR

0.00

接续训练

zisu09

—

2025-05-28

PR

0.00

Fix Flash Attention attn_mask and is_causal conflict in Atte

Peace-Howard-Wang

—

2025-10-18

PR

0.00

Minimind

Tawns-lab

—

2025-10-10

PR

0.00

主要增加了直接使用huggingface模型的适配

math-zhuxy

—

2025-05-25

PR

0.00

update

zyren123

—

2025-06-28

PR

0.00

Fix bug #329 top_p 参数由 int 类型调整为 float

cn-farmer

—

2025-04-09

PR

0.00

123

happly-plane

—

2025-04-17

PR

0.00

移除构建输入文本时在开头和末尾重复添加的和

cn-farmer

—

2025-04-02

PR

0.00

feat: 优化导入和代码风格

sanshi42

—

2025-02-11

PR

0.00

Update README.md

a67793581

—

2025-02-07

PR

0.00

fix weight initialization for residual block

CohleM

—

2025-02-03

PR

0.00

fix 5-dpo train

guomin

—

2025-01-31

PR

0.00

Remove unnecessary code.

yym68686

—

2024-12-03

PR

0.00

Update 5-dpo_train.py

leoz9

—

2024-11-14

PR

0.00

fix 5-dpo_train.py bugs

StudyingLover

—

2024-10-11

PR

0.00

修复wandb bug & 添加了argparse

iomgaa-ycz

—

2024-09-24

PR

0.00

添加了wandb

iomgaa-ycz

—

2024-09-23

PR

0.00

修复了data_process.py文件的bug

iomgaa-ycz

—

2024-09-23

PR

0.00

Update requirements.txt

MuWinds

—

2024-09-05

jingyaogong/minimind