Enable the graph mode and add the full warmup logic for Deepseek OCR model #2138

HeJunyan · 2025-11-17T08:52:35Z

The Deepseek OCR model uses fixed input tensor shape, so there is no need for padding.
Its input is different from the other MM models so a new fake data set is added to warm it up.

czhu15 · 2025-12-08T14:42:39Z

vllm/worker/hpu_model_runner.py

    '''

-    def __init__(self, is_batch_based):
+    def __init__(self, is_batch_based, sub_image_list=None):


suggest to add the description of the argument of is_batch_based and sub_image_list in the defination.
sub_image_list seems like a list of image objects. But from line 117, maybe it is a list of integers?

vllm/worker/hpu_model_runner.py

HeJunyan · 2025-12-08T15:58:02Z

I verified the PR in full warmup mode, the result is unchanged. So it is OK.

czhu15

LGTM

HeJunyan requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners November 17, 2025 08:52

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch from dd04d22 to 8f1c5fa Compare November 26, 2025 10:08

Fix the issue when input of images_crop is a list

2c98efb

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch 2 times, most recently from 8c391a6 to b560b50 Compare December 8, 2025 14:15

Correct the model type of deepseek ocr in config

2e632fe

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch from b560b50 to ebf2220 Compare December 8, 2025 14:26

czhu15 reviewed Dec 8, 2025

View reviewed changes

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch from ebf2220 to 51dfa56 Compare December 8, 2025 15:55

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch 2 times, most recently from 44f67ad to e1ab76b Compare December 8, 2025 16:13

Add warmup and graph mode for deepseek ocr model

363a85a

HeJunyan force-pushed the improvement_performance_deepseek_ocr branch from e1ab76b to 363a85a Compare December 8, 2025 16:23

czhu15 approved these changes Dec 10, 2025

View reviewed changes

czhu15 merged commit d38f197 into HabanaAI:aice/v1.22.0 Dec 10, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable the graph mode and add the full warmup logic for Deepseek OCR model #2138

Enable the graph mode and add the full warmup logic for Deepseek OCR model #2138

Uh oh!

HeJunyan commented Nov 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

czhu15 Dec 8, 2025

Uh oh!

HeJunyan Dec 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HeJunyan commented Dec 8, 2025

Uh oh!

czhu15 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enable the graph mode and add the full warmup logic for Deepseek OCR model #2138

Enable the graph mode and add the full warmup logic for Deepseek OCR model #2138

Uh oh!

Conversation

HeJunyan commented Nov 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

czhu15 Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

HeJunyan Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HeJunyan commented Dec 8, 2025

Uh oh!

czhu15 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HeJunyan commented Nov 17, 2025 •

edited by github-actions bot

Loading