Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use Slurm jobs for reprocessing #270

Merged
merged 8 commits into from
Jul 1, 2024
Merged

Use Slurm jobs for reprocessing #270

merged 8 commits into from
Jul 1, 2024

Conversation

takluyver
Copy link
Member

In #256 we made the listener submit jobs to the Solaris cluster to process data, rather than running subprocesses wherever it's running. This extends that to extraction run by amore-proto reprocess as well, so restrictions on CPU & memory usage apply consistently, however the processing was launched.

Reprocessing jobs now run in parallel by default (superseding #250), limited by how quickly the solaris cluster starts jobs. A new --watch flag switches back to processing one run at a time and showing live output directly in the terminal, but still uses Slurm with the srun command.

I see this as a step towards launching reprocessing from the GUI, and having the status of processing jobs visible in the GUI. But I'm leaving those features for a later PR.

I've also refactored a bit. The extract_data module is now only the code that runs inside a Slurm job, while extraction_control is the code to submit Slurm jobs.

Copy link
Contributor

@hcraT hcraT left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is good (I have a few minor comments).

@hcraT
Copy link
Contributor

hcraT commented Jul 1, 2024

LGTM!

@takluyver
Copy link
Member Author

Thanks!

@takluyver takluyver merged commit 122a49f into master Jul 1, 2024
4 checks passed
@takluyver takluyver deleted the reprocess-slurm branch July 1, 2024 16:04
@takluyver
Copy link
Member Author

I am deploying this to beta. I'm in the midst of processing a whole lot of runs for some old proposals, this should be a big help.

@takluyver
Copy link
Member Author

This is pretty nice so far - I can process a whole bunch of runs in parallel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants