[Misc] Add brixbench benchmark module#2165
Conversation
There was a problem hiding this comment.
Code Review
This pull request introduces a benchmark suite for AIBrix, including a test runner, deployment logic for various platforms, and support for running vLLM benchmarks. The feedback highlights concerns regarding the use of fixed sleeps, the security and portability issues of shelling out to bash, the need for more robust file ignoring in .gitignore, and a more reliable method for extracting benchmark results from pods.
|
@whalepark can you sign the commit and address gemini comments?
|
Thanks for the detailed review. I pushed follow-up commits addressing the feedback. Summary:
Verification:
|
|
Pushed the latest updates to feature/add-brixbench. New commits: What changed:
Verification:
|
| @@ -0,0 +1,452 @@ | |||
| # AIBrix Benchmark Suite - Walkthrough Guide | |||
There was a problem hiding this comment.
change to README.md?
Feel free to add docs under project_root/docs as well
| engine: | ||
| type: vllm | ||
| manifest: testdata/deployments/aibrix/models/pd-model.yaml | ||
| benchmark: testdata/benchmarks/vllm-chat-smoke-pd.yaml |
There was a problem hiding this comment.
I checked this file, seems it's different from what I expected. looks like it still describe how to deploy a model. I am looking for the benchmark data, how we send request to gateway (rps, concurrency, arrival rate etc). do you have such configs?
kind: vllm-bench
execution: cluster
image: aibrix-container-registry-cn-beijing.cr.volces.com/autodash/vllm-bench:v0.10.2-20260118
namespace: brixbench-adhoc
podName: vllm-bench-client
modelHostPath: /data01/models
rootHostPath: /root
artifacts:
resultFilename: bench_results.json
logDir: testdata/logs
vllmArgs:
There was a problem hiding this comment.
You are right, the walkthrough was focused on deployment/scenario wiring and did not clearly explain the request workload config. The request-side benchmark settings live in the benchmark YAML referenced by each test case, under vllmArgs. (brixbench/benchmark/testdata/benchmarks/vllm-chat-smoke-pd.yaml) I will update the README to add a dedicated benchmark workload section covering request rate, concurrency, num prompts, endpoint, dataset name, and routing specific options.
There was a problem hiding this comment.
brixbench/benchmark/testdata/deployments/aibrix/custom/aibrix-dependency-custom.yaml
and
brixbench/benchmark/testdata/deployments/aibrix/custom/aibrix-core-custom.yaml
are too heavy, do not check in and let's get this removed. Instead, I highly suggest to use production recommended way to deploy it. please check whether helm has some sdk so we can use helm. If not, we can use kustomize to generate the right configuration.
There was a problem hiding this comment.
Let me remove them. Thanks for pointing out.
Signed-off-by: Misun Park <misuneeh@gmail.com>
Pull Request Description
This PR imports the initial
brixbenchbenchmark module into theaibrixrepository.Included in this import:
brixbench/brixbenchmodule insideaibrixAdditional cleanup included in this PR:
github.com/vllm-project/aibrix/brixbench.tmp/releases/<version>/Notes:
May 21 2026
Addressed the latest review feedback and pushed follow-up commits.
Key updates:
Verification:
May 27 2026
Pushed follow-up updates for the latest review comments.
Changes:
Verification: