DummyCartPolePolicy¶
-
class
maze.core.agent.dummy_cartpole_policy.
DummyCartPolePolicy
¶ Dummy structured policy for the CartPole env.
Useful mainly for showcase of the config scheme and for testing.
-
compute_action
(observation: Dict[str, numpy.ndarray], maze_state: Optional[Any] = None, env: Optional[maze.core.env.base_env.BaseEnv] = None, actor_id: maze.core.env.structured_env.ActorID = None, deterministic: bool = False) → Dict[str, Union[int, numpy.ndarray]]¶ (overrides
Policy
)Sample an action.
-
compute_top_action_candidates
(observation: Dict[str, numpy.ndarray], num_candidates: Optional[int], maze_state: Optional[Any], env: Optional[maze.core.env.base_env.BaseEnv], actor_id: maze.core.env.structured_env.ActorID = None) → Tuple[Sequence[Dict[str, Union[int, numpy.ndarray]]], Sequence[float]]¶ (overrides
Policy
)implementation of
Policy
interface
-