fix prefill_params when prefill num_reqs > 1024 by shihaobai · Pull Request #1336 · ModelTC/LightLLM

shihaobai · 2026-06-08T07:07:17Z

No description provided.

gemini-code-assist

Code Review

This pull request fixes a bug in the _gen_cumsum_pad0_kernel function within lightllm/common/basemodel/triton_kernel/gen_prefill_params.py. It corrects the memory offset calculation inside the loop by replacing offs with current_offs when loading b_q_seq_len and b_kv_seq_len, ensuring the correct data is loaded for each iteration. No review comments were provided, so there is no additional feedback to address.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

fix prefill_params when prefill num_reqs > 1024

95f4ca4

gemini-code-assist Bot reviewed Jun 8, 2026

View reviewed changes

hiworldwzj merged commit 5514e24 into main Jun 8, 2026
1 check passed

hiworldwzj deleted the prefill_para_fix branch June 8, 2026 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix prefill_params when prefill num_reqs > 1024#1336

fix prefill_params when prefill num_reqs > 1024#1336
hiworldwzj merged 1 commit into
mainfrom
prefill_para_fix

shihaobai commented Jun 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shihaobai commented Jun 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants