* Add UnrollPerformer as the module being wrapped by DistributedDataParallel * Enable DDP for on policy RLTrainer