IMPALA¶
- class maze.train.trainers.impala.impala_trainer.IMPALA(algorithm_config: ImpalaAlgorithmConfig, rollout_generator: DistributedActors, evaluator: RolloutEvaluator | None, model: TorchActorCritic, model_selection: BestModelSelection | None)¶
Multi step advantage actor critic.