-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-54280][SDP] Require pipeline checkpoint storage dir to be absolute path #52999
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you make the CI happy?
[info] SinkExecutionSuite:
[info] - writing to external sink - memory sink *** FAILED *** (2 seconds, 65 milliseconds)
| }, | ||
| "PIPELINE_STORAGE_ROOT_INVALID" : { | ||
| "message" : [ | ||
| "Pipeline storage root must be an absolute path with a URI scheme (e.g., file://, s3a://, hdfs://). Got: `<storage_root>`." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| "Pipeline storage root must be an absolute path with a URI scheme (e.g., file://, s3a://, hdfs://). Got: `<storage_root>`." | |
| "Pipeline storage root must be an absolute path with an URI scheme (e.g., file://, s3a://, hdfs://). Got: `<storage_root>`." |
...nes/src/test/scala/org/apache/spark/sql/pipelines/graph/PipelineUpdateContextImplSuite.scala
Outdated
Show resolved
Hide resolved
jaceklaskowski
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit-picking 🤷♂️
971c876 to
dd8d70a
Compare
|
Thank you for updating. It seems that we have only 1 failure, @sryza . |
|
I keep thinking I've handled them all and then finding new ones 🤦 . Working on this. |
|
There is one remaining failure. I locally verified that it's irrelevant and passes locally. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. Thank you, @sryza . Merged to master/4.1.
…lute path ### What changes were proposed in this pull request? - Raises an error if the pipeline checkpoint storage dir is not an absolute path - Updated the init CLI to create and set a checkpoint storage dir as an absolute path ### Why are the changes needed? Prevent users from accidentally losing checkpoints. ### Does this PR introduce _any_ user-facing change? Yes, but to unreleased functionality. ### How was this patch tested? - New unit tests - Ran the init CLI and then ran pipeline with streaming table ### Was this patch authored or co-authored using generative AI tooling? Closes #52999 from sryza/storage-location-absolute. Authored-by: Sandy Ryza <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]> (cherry picked from commit 4020794) Signed-off-by: Dongjoon Hyun <[email protected]>
|
Thanks @dongjoon-hyun ! |
|
Hi, @sryza . This seems to be flaky and block other PRs. Please see the following. |
…t schema ### What changes were proposed in this pull request? Fixes the `EndToEndAPISuite`, which was broken by #52999. ### Why are the changes needed? For CI to pass ### Does this PR introduce _any_ user-facing change? No - test-only change. ### How was this patch tested? Ran tests. ### Was this patch authored or co-authored using generative AI tooling? Closes #53069 from sryza/end-to-end-storage-root. Authored-by: Sandy Ryza <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>
…t schema ### What changes were proposed in this pull request? Fixes the `EndToEndAPISuite`, which was broken by #52999. ### Why are the changes needed? For CI to pass ### Does this PR introduce _any_ user-facing change? No - test-only change. ### How was this patch tested? Ran tests. ### Was this patch authored or co-authored using generative AI tooling? Closes #53069 from sryza/end-to-end-storage-root. Authored-by: Sandy Ryza <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]> (cherry picked from commit 7599b2f) Signed-off-by: Dongjoon Hyun <[email protected]>
What changes were proposed in this pull request?
Why are the changes needed?
Prevent users from accidentally losing checkpoints.
Does this PR introduce any user-facing change?
Yes, but to unreleased functionality.
How was this patch tested?
Was this patch authored or co-authored using generative AI tooling?