I have a script like this
trainer = Trainer(distributed_backend="ddp", gpus=2, ...) model = Model(...) trainer.fit(model) trainer.test(model)
and when I launch it, it hangs after fit, never reaching test, or it errors with a message
“Address already in use”
What is the problem?
Question/Problem from here