A2C¶
-
class
maze.train.trainers.a2c.a2c_trainer.
A2C
(algorithm_config: Union[maze.train.trainers.a2c.a2c_algorithm_config.A2CAlgorithmConfig, maze.train.trainers.ppo.ppo_algorithm_config.PPOAlgorithmConfig, maze.train.trainers.impala.impala_algorithm_config.ImpalaAlgorithmConfig], rollout_generator: Union[maze.core.rollout.rollout_generator.RolloutGenerator, maze.train.parallelization.distributed_actors.distributed_actors.DistributedActors], evaluator: Optional[maze.train.trainers.common.evaluators.rollout_evaluator.RolloutEvaluator], model: maze.core.agent.torch_actor_critic.TorchActorCritic, model_selection: Optional[maze.train.trainers.common.model_selection.best_model_selection.BestModelSelection])¶ Advantage Actor Critic. Suitable for multi-step and multi-agent scenarios.