TPU training and WandB implementation causes "AssertionError: can only test a child process" #6639
Unanswered
adamDhalla
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 1 comment
-
@adamDhalla is this still an issue for you? I think wandb support for TPUs should be better in the latest version |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi all,
I'm not raising this issue as a bug since it's already been raised here, and was seemingly solved.
I have pretty much the exact problem - I'm trying to implement weights and biases with a TPU and it keeps showing "AssertionError: can only test a child process". The person who raised the old error also replicated the issue in the boring model here.
The thing is he never explains how he solves the issue - he explains the issue itself, but doesn't say how he solves it. I'm wondering where I should be initializing the wandb object:
Right now, I'm initializing both of those lines above the trainer. The trainer requires wandb_logger be passed through, so I can't do the wandb_logger = WandbLogger... line any higher up.
The whole problem seems to come about as a problem with distributed processing and WandB.
Thanks,
Adam
Beta Was this translation helpful? Give feedback.
All reactions