-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Exporter fails due to "AllocGRES is deprecated" fatal error #40
Comments
Interesting. At the moment I do not have a chance to test it, since we are still running with version 18.08.8. I can try to reproduce it on VMs and eventually look for a workaround but I guess the only solution (not particularly nice) would be with an if/else based on the output of |
I faced the same problem because I am running SLURM 20.11.3. The problem is at this line: https://github.com/vpenso/prometheus-slurm-exporter/blob/master/gpus.go#L41 |
This can be a thorny issue: we rely on the output provided by the Slurm command lines utilities. Whenever the developers from SchedMD change the format or drop some options, this exporter is not able to cope with. As it seem also from issue #38 , at the moment it can be guaranteed that this exporter will run on Slurm version 18.x.x. On higher version there may be problems. |
It's not that simple, the format of the output changed as well. More info is already in #38. |
The latest prometheus-slurm-exporter runs for a few seconds before terminating with a fatal error:
I'm running slurm-20.11.3-1 and a rebuild picked up the new gpus.go module. Digging into it a bit, it appears the Allocgres option to sacct is treated as fatal, which causes the Execute() routine to terminate:
The text was updated successfully, but these errors were encountered: