-
Notifications
You must be signed in to change notification settings - Fork 62
Open
Labels
bugSomething isn't workingSomething isn't workingmodule: distributedFor distributed feature issueFor distributed feature issue
Milestone
Description
🐛 Describe the bug
please get wheels from https://github.com/intel/torch-xpu-ops/actions/runs/18399579512/job/52471088040 or use gh download
gh run download [18399579512](https://github.com/intel/torch-xpu-ops/actions/runs/18399579512/job/52471088040) --repo intel/torch-xpu-ops --name Torch-XPU-Wheel-1826 --dir path --pattern "*.zip"
git clone -b distributed_2.10 https://github.com/daisyden/pytorch.git
cd pytorch
pip install -r requirements.txt
pytest -v test/distributed/test_device_mesh.py::TestDeviceMeshGetItem::test_flatten_mesh_3d
FAILED [2.6097s] ../../../../test/distributed/test_device_mesh.py::TestDeviceMeshGetItem::test_flatten_mesh_3d - RuntimeError: Process 2 exited with error code 10 and exception:
Traceback (most recent call last):
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/common_distributed.py", line 912, in run_test
getattr(self, test_name)()
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/common_distributed.py", line 760, in wrapper
fn()
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3280, in wrapper
method(*args, **kwargs)
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/distributed/_tensor/common_dtensor.py", line 510, in wrapper
raise e
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/distributed/_tensor/common_dtensor.py", line 507, in wrapper
func(self, *args, **kwargs) # type: ignore[misc]
File "/home/jenkins/actions-runner/_work/torch-xpu-ops/torch-xpu-ops/pytorch/test/distributed/test_device_mesh.py", line 903, in test_flatten_mesh_3d
self.assertEqual(flattened_dp_cp_mesh.get_group().group_desc, "mesh_dp_cp")
File "/tmp/xpu-tool/Python/3.10.19/x64/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4235, in assertEqual
raise error_metas.pop()[0].to_error( # type: ignore[index]
AssertionError: String comparison failed: 'default_pg' != 'mesh_dp_cp'
- default_pg
+ mesh_dp_cp
Versions
pytorch: https://github.com/daisyden/pytorch/tree/distributed_2.10
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingmodule: distributedFor distributed feature issueFor distributed feature issue