ESAlgorithmConfig¶
- class maze.train.trainers.es.es_algorithm_config.ESAlgorithmConfig(n_epochs: int, n_rollouts_per_update: int, n_timesteps_per_update: int, max_steps: int, optimizer: Any, l2_penalty: float, noise_stddev: float, policy_wrapper: Policy | None)¶
Algorithm parameters for evolution strategies model. Note: Pass 0 to n_epochs to train indefinitely.
- max_steps: int¶
Limit the episode rollouts to a maximum number of steps. Set to 0 to disable this option.