Fix pdist numerical issue on large input size #2181

Kanya-Mo · 2025-10-15T23:45:57Z

This is to fix #2021.
The current algorithm uses the following formula to calculate index and is numerically unstable at cases when n goes large (and j=n-1). Need sqrt to be in double precision to ensure correct results after casting into int. This change also aligns with cuda implementation.

torch-xpu-ops/src/ATen/native/xpu/sycl/DistanceKernels.cpp

Line 732 in 779f899

(n2_val_ - device_sqrt<accscalar_t>(n2_squared_minus_1_val_ - 2 * k)));

Use double in pdist index calculation

1687cdb

Kanya-Mo changed the title ~~(WIP) Fix pdist numerical issue on large input~~ (WIP) Fix pdist numerical issue on large input size Oct 15, 2025

jenniew mentioned this pull request Oct 16, 2025

TestTorchDeviceTypeXPU::test_pdist_norm_large_xpu AssertionError: False is not true #2021

Open

Kanya-Mo changed the title ~~(WIP) Fix pdist numerical issue on large input size~~ Fix pdist numerical issue on large input size Oct 16, 2025

Kanya-Mo requested review from chunhuanMeng and yucai-intel October 17, 2025 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix pdist numerical issue on large input size #2181

Fix pdist numerical issue on large input size #2181

Uh oh!

Kanya-Mo commented Oct 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix pdist numerical issue on large input size #2181

Are you sure you want to change the base?

Fix pdist numerical issue on large input size #2181

Uh oh!

Conversation

Kanya-Mo commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Kanya-Mo commented Oct 15, 2025 •

edited

Loading