A2CAlgorithmConfig¶
- class maze.train.trainers.a2c.a2c_algorithm_config.A2CAlgorithmConfig(n_epochs: int, epoch_length: int, patience: int, critic_burn_in_epochs: int, n_rollout_steps: int, lr: float, gamma: float, gae_lambda: float, policy_loss_coef: float, value_loss_coef: float, entropy_coef: float, max_grad_norm: float, device: str, rollout_evaluator: RolloutEvaluator, n_training_seeds: int)¶
Algorithm parameters for multi-step A2C model.
- n_training_seeds: int¶
Number of seeds to be generated for seeding the environment except when passing a list of explicit seeds.
- rollout_evaluator: RolloutEvaluator¶
Rollout evaluator.