-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: PaddlePaddle/PaddleFormers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix separate_mtp_headloss aoa for hf save & load
#4420
opened May 9, 2026 by
Wennie396
Contributor
Loading…
Fix reshape for zcc master weight save
#4419
opened May 9, 2026 by
changeyoung98
Contributor
Loading…
2 tasks
[Qwen2MoE] Fix h1_normed gradient accumulation order divergence via probe hook
#4407
opened May 8, 2026 by
a31413510
Collaborator
Loading…
[Qwen2MoE] Fix gradient alignment: remove fake_path and use clear_grad(set_to_zero=False)
#4405
opened May 8, 2026 by
a31413510
Collaborator
Loading…
fix SFTDataset : refine num_proc and num_workers co-existence check
#4404
opened May 8, 2026 by
weiyixuanxx
Contributor
Loading…
2 tasks done
fix(qwen2_moe): use active-expert-only iteration in SparseMoeBlock
#4400
opened May 7, 2026 by
a31413510
Collaborator
Loading…
[Trainer | Cherry-Pick] feat(callback): support Fleet MoE class and GlobalRNGCallback
#4359
opened Apr 27, 2026 by
hushenwei2000
Contributor
Loading…
[Qwen3MoE] Fix gradient alignment between PaddlePaddle and PyTorch/HF
#4358
opened Apr 25, 2026 by
a31413510
Collaborator
Loading…
[release/1.1] Add high_precision_rope cfg
contributor
#4354
opened Apr 24, 2026 by
risemeup1111
Loading…
2 tasks
[Deps] pin paddlecodec to
>=0.1, <0.2 for Paddle 3.3 compatibility
#4349
opened Apr 24, 2026 by
SigureMo
Member
Loading…
1 of 2 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:develop.