SAC¶
- class maze.train.trainers.sac.sac_trainer.SAC(algorithm_config: SACAlgorithmConfig, learner_model: TorchActorCritic, distributed_actors: BaseDistributedWorkersWithBuffer, model_selection: BestModelSelection | None, evaluator: RolloutEvaluator | None)¶
Multi step soft actor critic.
- Parameters:
algorithm_config – Algorithm options.
learner_model – Structured torch actor critic to train.
distributed_actors – Distributed actors for collection of training rollouts.
model_selection – Optional model selection class, receives model evaluation results.
evaluator – The evaluator to use.