How to have per GPU logging? #17474
Unanswered
RuABraun
asked this question in
DDP / multi-GPU / multi-node
Replies: 2 comments 2 replies
-
you can configure per-GPU logging by setting up a separate logger for each GPU process.
You can then call this function at the beginning of each GPU process and use the returned logger instance to log messages. For example:
|
Beta Was this translation helpful? Give feedback.
2 replies
-
Hi guys |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Is there a way to have logs per GPU? I want to understand why training fails and at the moment the log just shows a time out error. I know from previous experience that this likely means a nonzero rank GPU process had an error, and I want to validate that (and see what the error is).
Beta Was this translation helpful? Give feedback.
All reactions