fix prefill_params when prefill num_reqs > 1024#1336
Conversation
There was a problem hiding this comment.
Code Review
This pull request fixes a bug in the _gen_cumsum_pad0_kernel function within lightllm/common/basemodel/triton_kernel/gen_prefill_params.py. It corrects the memory offset calculation inside the loop by replacing offs with current_offs when loading b_q_seq_len and b_kv_seq_len, ensuring the correct data is loaded for each iteration. No review comments were provided, so there is no additional feedback to address.
Important
The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.
No description provided.