SACDevRunner

class maze.train.trainers.sac.sac_runners.SACDevRunner(state_dict_dump_file: str, dump_interval: Optional[int], spaces_config_dump_file: str, normalization_samples: int, eval_concurrency: int, initial_demonstration_trajectories: omegaconf.DictConfig)

Runner for single-threaded training, based on SequentialVectorEnv.

create_distributed_eval_env(env_factory: Callable[], maze.core.env.maze_env.MazeEnv], eval_concurrency: int, logging_prefix: str)maze.train.parallelization.vector_env.sequential_vector_env.SequentialVectorEnv

(overrides SACRunner)

create single-threaded env distribution

create_distributed_rollout_workers(env_factory: Callable[], maze.core.env.maze_env.MazeEnv], worker_policy: maze.core.agent.torch_policy.TorchPolicy, n_rollout_steps: int, n_workers: int, batch_size: int, rollouts_per_iteration: int, split_rollouts_into_transitions: bool, env_instance_seeds: List[int], replay_buffer: maze.train.trainers.common.replay_buffer.replay_buffer.BaseReplayBuffer)maze.train.parallelization.distributed_actors.dummy_distributed_workers_with_buffer.DummyDistributedWorkersWithBuffer

(overrides SACRunner)

Create dummy (sequentially-executed) actors.