Skip to content

Conversation

@kiranandcode
Copy link
Contributor

This PR modifies test_handlers_llm.py to implement record/replay fixtures. By default, when running locally completion calls are dumped to the folder fixtures with the test id that they are being run in.

When running in CI, the script will instead deserialize the fixtures instead of calling the LLM provider endpoint.

This should allow seamless testing of LLM behaviours locally, while making sure CI runs are deterministic.

Closes #458

@kiranandcode kiranandcode requested a review from jfeser December 26, 2025 21:10
@kiranandcode kiranandcode linked an issue Dec 26, 2025 that may be closed by this pull request
@kiranandcode kiranandcode force-pushed the kg-record-replay-llm-requests branch from 56834ba to 6c45af0 Compare December 31, 2025 12:35
@jfeser jfeser merged commit 23f95ef into staging-llm Dec 31, 2025
6 checks passed
@jfeser jfeser deleted the kg-record-replay-llm-requests branch December 31, 2025 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

LLM integration tests should be made more reliable

4 participants