SACDevRunner¶
-
class
maze.train.trainers.sac.sac_runners.
SACDevRunner
(state_dict_dump_file: str, dump_interval: Optional[int], spaces_config_dump_file: str, normalization_samples: int, eval_concurrency: int, initial_demonstration_trajectories: omegaconf.DictConfig)¶ Runner for single-threaded training, based on SequentialVectorEnv.
-
create_distributed_eval_env
(env_factory: Callable[], maze.core.env.maze_env.MazeEnv], eval_concurrency: int, logging_prefix: str) → maze.train.parallelization.vector_env.sequential_vector_env.SequentialVectorEnv¶ (overrides
SACRunner
)create single-threaded env distribution
-
create_distributed_rollout_workers
(env_factory: Callable[], maze.core.env.maze_env.MazeEnv], worker_policy: maze.core.agent.torch_policy.TorchPolicy, n_rollout_steps: int, n_workers: int, batch_size: int, rollouts_per_iteration: int, split_rollouts_into_transitions: bool, env_instance_seeds: List[int], replay_buffer: maze.train.trainers.common.replay_buffer.replay_buffer.BaseReplayBuffer) → maze.train.parallelization.distributed_actors.dummy_distributed_workers_with_buffer.DummyDistributedWorkersWithBuffer¶ (overrides
SACRunner
)Create dummy (sequentially-executed) actors.
-