QR operator utilizing XPU. #2399

mwiktor-intel · 2025-11-24T12:23:17Z

fixes #1900. The implementation utilizes two mkl::lapack libraries,
geqrf and geqrf for recovering pure Q matrix. Since torch and lapack use different storage formats, a hard transposition (memory layout not only stride ) was necessary. The iteraion over batch utilizes internal memory layout of processed data.

CuiYifeng · 2025-11-26T03:35:01Z

QR is MKL Op rather than SYCL Op. Please move kernel code to xpu/mkl/.
Existed xpu/mkl/BatchLinearAlgebra.cpp is a choice for kernel code (refer to stock Pytorch). Correspondingly, op level code can be added in xpu/BatchLinearAlgebra.cpp.
Please check lint error.
Adding new test case is good. Please check if there are related cases in test/xpu/skip_list_common.py. If so, please reactivate them.

Silv3S · 2025-11-26T13:42:33Z

There were no tests on the skip lists, as QR was silently falling back to CPU. Maybe after removing it, some tests will start to fail now

CuiYifeng

Please check comments and fix related failed cases.

CuiYifeng · 2025-11-28T08:54:00Z

src/ATen/native/xpu/BatchLinearAlgebra.cpp

+TORCH_IMPL_FUNC(linalg_qr_xpu_out)(const Tensor& A,
+                               std::string_view mode,
+                               const Tensor & Q,
+                               const Tensor & R) {
+#if defined(USE_ONEMKL_XPU)
+  xpu::linalg_qr_kernel(A, mode, Q, R);
+#else
+  auto A_cpu = A.to(at::kCPU);
+  auto Q_cpu = at::empty_like(Q, at::kCPU);
+  auto R_cpu = at::empty_like(R, at::kCPU);
+  at::cpu::linalg_qr_out(Q_cpu, R_cpu, A_cpu, mode);
+  Q.copy_(Q_cpu);
+  R.copy_(R_cpu);
+#endif // USE_ONEMKL_XPU
+}


My suggestion is to register geqrf_kerenl_xpu/orgqr_kernel_xpu to geqrf_stub/orgqr_stub, which allows us to reuse op level code in stock Pytorch and reuse these two kernels in future.

CuiYifeng · 2025-11-28T08:54:49Z

yaml/native/native_functions.yaml

+- func: linalg_qr(Tensor A, str mode='reduced') -> (Tensor Q, Tensor R)
+  python_module: linalg
+  variants: function
+  structured_delegate: linalg_qr.out
+
+- func: linalg_qr.out(Tensor A, str mode='reduced', *, Tensor(a!) Q, Tensor(b!) R) -> (Tensor(a!) Q, Tensor(b!) R)
+  python_module: linalg
+  structured: True
+  dispatch:
+    XPU: linalg_qr_xpu_out
+


Please refer to https://github.com/pytorch/pytorch/blob/main/aten/src/ATen/native/native_functions.yaml#L14623 and consider stub mentioned above.

Silv3S and others added 15 commits October 29, 2025 20:18

mm

78e95cd

remove duplicates

60b868b

Merge branch 'intel:main' into dev_wiktor

2267fca

dev3

ba893b0

First callable poc

e0f5f6b

Merge branch 'intel:main' into dev_wiktor

16028d1

Merge branch 'intel:main' into dev_wiktor

c40b846

simple test

025291a

cleanup

6e025e1

format

5e593d8

remove vars

2038538

Merge branch 'intel:main' into dev_wiktor

e564e1a

Merge branch 'intel:main' into pre_qr_commit

8f5a74c

New implementation of QR kernel based on mkl::geqrf and orgqr

3c26a9e

cleaned version of QR

1531fa1

mwiktor-intel requested review from CuiYifeng and Silv3S November 24, 2025 12:23

Enable more dev tests

980732b

MarekStrachacki requested a review from astachowiczhabana November 25, 2025 10:13

Silv3S added 4 commits November 26, 2025 11:21

Move QR to mkl

e8b49e7

Port ut to test_linalg_xpu

419eb57

Merge remote-tracking branch 'iops/main' into pre_qr_commit

a56a4ff

Fix the non-MKL fallback path

7906ef2

CuiYifeng requested changes Nov 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

QR operator utilizing XPU. #2399

QR operator utilizing XPU. #2399

mwiktor-intel commented Nov 24, 2025

Uh oh!

CuiYifeng commented Nov 26, 2025 •

edited

Loading

Uh oh!

Silv3S commented Nov 26, 2025

Uh oh!

CuiYifeng left a comment

Uh oh!

CuiYifeng Nov 28, 2025

Uh oh!

CuiYifeng Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

QR operator utilizing XPU. #2399

Are you sure you want to change the base?

QR operator utilizing XPU. #2399

Conversation

mwiktor-intel commented Nov 24, 2025

Uh oh!

CuiYifeng commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Silv3S commented Nov 26, 2025

Uh oh!

CuiYifeng left a comment

Choose a reason for hiding this comment

Uh oh!

CuiYifeng Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

CuiYifeng Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CuiYifeng commented Nov 26, 2025 •

edited

Loading