-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: hw-native-sys/simpler
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: atomic acquire/release for sim register handshake gate (a2a3 + a5)
#1081
opened Jun 17, 2026 by
ChaoZheng109
Collaborator
Loading…
feat(a2a3/runtime): speculative early-dispatch (pre-stage + doorbell)
#1079
opened Jun 17, 2026 by
poursoul
Collaborator
Loading…
2 tasks
Rename dump tensor surface to dump args
#1072
opened Jun 16, 2026 by
vegetabledoww
Contributor
Loading…
2 tasks done
Fix: fanin producer dedup lookup cost
#1066
opened Jun 16, 2026 by
sunkaixuan2018
Contributor
Loading…
docs(args): fix stale cap literals missed by #1056
#1064
opened Jun 16, 2026 by
ChaoZheng109
Collaborator
Loading…
Refactor: drop per-run AICPU init launch; per-thread run-wall capture
#1061
opened Jun 16, 2026 by
hw-native-sys-bot
Collaborator
Loading…
3 of 4 tasks
perf(l3): remove useless exchange buffer from ring allreduce kernel
#1059
opened Jun 15, 2026 by
georgebisbas
Contributor
Loading…
Fix: make a5 L2 swimlane work onboard (host-shadow alloc + payload buffer channel)
#1058
opened Jun 15, 2026 by
indigo1973
Contributor
Loading…
docs(hardware): MMIO performance reference + 2 cann-example probe tools
#1057
opened Jun 15, 2026 by
hw-native-sys-bot
Collaborator
Loading…
1 of 3 tasks
[WIP] feat(dfx): add l0_swimlane intra-core pipeline trace tool
#1053
opened Jun 15, 2026 by
indigo1973
Contributor
Loading…
[Fix] [Performance] Fix AICPU thread affinity assignment to fit in a single AICPU package (a.k.a NUMA domain)
#1046
opened Jun 12, 2026 by
noabauma
Contributor
Loading…
Fix: keep HCCL weak-link dependency
#1032
opened Jun 11, 2026 by
sunkaixuan2018
Contributor
Loading…
fix(runtime): allow zero-size view at any offset (Tensor::view bounds-check)
#1023
opened Jun 10, 2026 by
csy0225
Loading…
2 of 3 tasks
Add: L3-L2 orchestration communication design
#1015
opened Jun 8, 2026 by
ccyywwen
Contributor
Loading…
feat: add remote l3 python session runtime
#1011
opened Jun 8, 2026 by
puddingfjz
Contributor
Loading…
Add: AICore receive_time DFX field — split head_OH into NoC + dcci/ack
#1004
opened Jun 8, 2026 by
hw-native-sys-bot
Collaborator
Loading…
4 of 6 tasks
Add: per-task and scope-filter granularity to scope_stats
#976
opened Jun 3, 2026 by
doraemonmj
Contributor
•
Draft
Add: L3/L2 host-device mapped region design
#861
opened May 26, 2026 by
ccyywwen
Contributor
Loading…
Triangular Inverse Kernel (continuation of Zouzias' impl)
#830
opened May 20, 2026 by
MirkoDeVita98
Contributor
Loading…
[Performance Analysis] Adding intra-kernel timing runs
#829
opened May 20, 2026 by
SergioMartin86
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.