You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SPARK-53623][SQL] improve reading large table properties performance
### What changes were proposed in this pull request?
The CatalogColumnStat.readLargeTableProp is an O(N) operation. Considering a table can have a lot of table properties, this effectively becomes an O(N^2) operation, which can be very slow for tables with a lot of table properties.
This PR improves the algorithmic complexity to O(N) by only constructing the large table properties if numParts exists.
### Why are the changes needed?
For fixing a performance issue unintentionally introduced before.
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
Existing unit tests. A previous patch already tested the side effect of this change #52355
### Was this patch authored or co-authored using generative AI tooling?
No
Closes#52374 from yeshengm/improve-read-large-prop.
Authored-by: Yesheng Ma <[email protected]>
Signed-off-by: Wenchen Fan <[email protected]>
0 commit comments