-
Notifications
You must be signed in to change notification settings - Fork 481
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SYSTEMDS-3405] Write matrices and frames at site for federated write #1665
base: main
Are you sure you want to change the base?
Conversation
yes please ! |
public void setFilepath(String filepath) { | ||
_filepath = filepath; | ||
} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might want to keep FederatedData
constant and create a copy with a different filepath instead. I am open for discussion.
src/main/java/org/apache/sysds/runtime/instructions/fed/FEDInstructionUtils.java
Outdated
Show resolved
Hide resolved
Instead of having a separate format `format=federated`, we write federated if the object is federated. We also do not create a JSON file with the federated addresses and ranges, instead we add this information to the MTD file. Note that we now also aligned the specification of addresses and ranges with the usage in our `federated()` function, such that the syntax is similar.
c1a130f
to
4d1cb50
Compare
ef6ca80
to
bca9bd4
Compare
There is a minor problem I don't know how to best fix. I need to select a path at the sites for the workers to write their partition, I am currently choosing to create a (most likely) unique filename and write into the LOCAL_TEMP_DIR (defined by the configuration). This usually works, but the federated python testcases have it set to The problems mostly impact our testcases, but I also don't really have a favorite from a user side.
The other aspects are finished and this PR is ready for review. |
There is a minor problem I don't know how to best fix. I need to select a path at the sites for the workers to write their partition, I am currently choosing to create a (most likely) unique filename and write into the LOCAL_TEMP_DIR (defined by the configuration). This usually works, but the federated python testcases have it set to The problems mostly impact our testcases, but I also don't really have a favorite from a user side.
The other aspects are finished and this PR is ready for review. |
hmm, good question what the best option is. I find the last option with a new configuration tempting. |
This PR also change the write(M) (if M is federated) to not collect the matrix and write it locally. |
Hi @kev-inn , this PR seems to be ready. about your question:
I guess @Baunsgaard is in favor of your suggestion:
shall we wrap this up? Regards |
I will need some time to refresh my memory and get it in a merge ready state next month. |
sure thing, @kev-inn - thanks |
Perform federated write at site of workers and locally write a MTD file containing the addresses. Frames work too, but testcases are still missing.