Add step tests for chunked prefill #575

maxdebayser · 2025-11-21T21:03:59Z

Description

This PR add step by step execution tests for the chunked prefill scheduler. It tests several scenarios:

Single sequence with several chunks
Two sequences scheduled at the same time, one with 1 chunk and the other with 4
Same as 2 but with interleaving disabled to test this debugging feature
Same as 2 but the second requests arrives later.

The tests show that the interleaving of requests is working as expected.

Signed-off-by: Max de Bayser <[email protected]>

github-actions · 2025-11-21T21:04:06Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

Since we need to use random tokens sometimes they end up in the cache file and codespell complains about them. Signed-off-by: Max de Bayser <[email protected]>

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser · 2025-11-24T13:34:31Z

The test failure seems related to state carried over from tests like test_prefill_tkv_too_big. If run together with this test, test_cp_prefill_interleave1 fails locally as well.

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser · 2025-11-24T19:48:40Z

bot:test

Signed-off-by: Max de Bayser <[email protected]>

to reproduce the conditions of the bug that PR #576 fixes (fix: check only decoding requests in _satisfies_last_chunk_constraints) Signed-off-by: Max de Bayser <[email protected]>

wallashss

LGTM

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser · 2025-11-24T20:33:34Z

tests/e2e/test_spyre_cp_scheduler_steps.py

+@pytest.mark.full_model
+# These values are all parameterized for test sorting
+@pytest.mark.parametrize("max_num_seqs", [2])
+@pytest.mark.parametrize("max_model_len", [514])


@tjohnson31415 , the 514 model len here is what would trigger the hang before your fix.

tjohnson31415 · 2025-11-24T20:35:13Z

tests/llm_cache.py

        else:
            self._teardown = teardown_method

+        self._preexisting_max_tkv = os.getenv("VLLM_DT_MAX_BATCH_TKV_LIMIT")


Tests that modify VLLM_DT_MAX_BATCH_TKV_LIMIT should be using monkeypatch which would automatically restore the value after the test. Can we use that instead of having a special case reset here?

It's bit more involved than a monkey patch because we have to replicate some logic from platform.py. But it's possible to do so with a fixture if you prefer.

I don't mean to set the actual value of VLLM_DT_MAX_BATCH_TKV_LIMIT with monkeypatch but using setenv will register to monkeypatch that it needs to restore the previous value even if something in the test overrides it.

I thought we had an example of this in the code, but I can't find it now.

The problem is the interaction with the model cache. The setting/resetting of this variable has to be tied to the life cycle of the llm engine but pytest is not aware of that since it's not a fixture. Do you remember if the example you're referring to also dealt with this problem?

Hmm, I don't recall the previous case dealing with the model cache. I'm not very familiar with how the caching works...

Signed-off-by: Max de Bayser <[email protected]>

tjohnson31415

LGTM

maxdebayser added 3 commits November 20, 2025 19:37

Start with single prompt test

89f8bb8

Signed-off-by: Max de Bayser <[email protected]>

Merge branch 'main' into cp_step_tests

e1147ec

Signed-off-by: Max de Bayser <[email protected]>

add more interleaving test cases

6530b86

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser requested review from tjohnson31415, wallashss and yannicks1 November 21, 2025 21:03

maxdebayser requested review from prashantgupta24, rafvasq and sducouedic as code owners November 21, 2025 21:04

Add cache file to codespell skip list

405d6e1

Since we need to use random tokens sometimes they end up in the cache file and codespell complains about them. Signed-off-by: Max de Bayser <[email protected]>

maxdebayser requested a review from joerunde as a code owner November 21, 2025 21:09

maxdebayser added 3 commits November 21, 2025 18:12

fix syntax error

4dc849c

Signed-off-by: Max de Bayser <[email protected]>

update hf cache

5b92b00

Signed-off-by: Max de Bayser <[email protected]>

debug ci tests

e1729de

Signed-off-by: Max de Bayser <[email protected]>

fix environment variable state between tests

815bab4

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser added 2 commits November 24, 2025 17:01

Merge branch 'main' into cp_step_tests

b6f1eb9

Signed-off-by: Max de Bayser <[email protected]>

Lower max model len

acf0991

to reproduce the conditions of the bug that PR #576 fixes (fix: check only decoding requests in _satisfies_last_chunk_constraints) Signed-off-by: Max de Bayser <[email protected]>

wallashss approved these changes Nov 24, 2025

View reviewed changes

revert unnecessary changes

bba458b

Signed-off-by: Max de Bayser <[email protected]>

maxdebayser commented Nov 24, 2025

View reviewed changes

tjohnson31415 reviewed Nov 24, 2025

View reviewed changes

Merge branch 'main' into cp_step_tests

abe4ea7

Signed-off-by: Max de Bayser <[email protected]>

tjohnson31415 approved these changes Nov 24, 2025

View reviewed changes

maxdebayser merged commit 87c69d5 into main Nov 24, 2025
20 checks passed

maxdebayser deleted the cp_step_tests branch November 24, 2025 22:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add step tests for chunked prefill #575

Add step tests for chunked prefill #575

Uh oh!

maxdebayser commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

maxdebayser commented Nov 24, 2025

Uh oh!

maxdebayser commented Nov 24, 2025

Uh oh!

wallashss left a comment

Uh oh!

maxdebayser Nov 24, 2025

Uh oh!

tjohnson31415 Nov 24, 2025

Uh oh!

maxdebayser Nov 24, 2025

Uh oh!

tjohnson31415 Nov 24, 2025 •

edited

Loading

Uh oh!

maxdebayser Nov 24, 2025

Uh oh!

tjohnson31415 Nov 24, 2025

Uh oh!

tjohnson31415 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add step tests for chunked prefill #575

Add step tests for chunked prefill #575

Uh oh!

Conversation

maxdebayser commented Nov 21, 2025

Description

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

maxdebayser commented Nov 24, 2025

Uh oh!

maxdebayser commented Nov 24, 2025

Uh oh!

wallashss left a comment

Choose a reason for hiding this comment

Uh oh!

maxdebayser Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

maxdebayser Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maxdebayser Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

tjohnson31415 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tjohnson31415 Nov 24, 2025 •

edited

Loading