SequentialVectorEnv¶

class maze.train.parallelization.vector_env.sequential_vector_env.SequentialVectorEnv(env_factories: List[Callable[], maze.core.env.maze_env.MazeEnv]], logging_prefix: Optional[str] = None)¶

Creates a simple wrapper for multiple environments, calling each environment in sequence on the current Python process. This is useful for computationally simple environment such as cartpole-v1, as the overhead of multiprocess or multi-thread outweighs the environment computation time. This can also be used for RL methods that require a vectorized environment, but that you want a single environments to train with.

Parameters: env_factories – A list of functions that will create the environments

close() → None ¶: VectorEnv implementation

get_actor_rewards() → Optional[numpy.ndarray]¶

(overrides StructuredVectorEnv)

Stack actor rewards from encapsulated environments.

reset() → Dict[str, numpy.ndarray]¶: VectorEnv implementation

seed(seeds: List[Any]) → None ¶

(overrides VectorEnv)

VectorEnv implementation

step(actions: Dict[str, Union[int, numpy.ndarray]]) → Tuple[Dict[str, numpy.ndarray], numpy.ndarray, numpy.ndarray, Iterable[Dict[Any, Any]]]¶

Step the environments with the given actions.

Parameters: actions – the list of actions for the respective envs.
Returns: observations, rewards, dones, information-dicts all in env-aggregated form.