[distributeddataparallel](https://pytorch.org/docs/stable/nn.html#distributeddataparallel) claims better performance than [dataparallel](https://pytorch.org/docs/stable/nn.html#dataparallel)