DummyCartPolePolicy¶
- class maze.core.agent.dummy_cartpole_policy.DummyCartPolePolicy¶
Dummy structured policy for the CartPole env.
Useful mainly for showcase of the config scheme and for testing.
- compute_action(observation: Dict[str, numpy.ndarray], maze_state: Any | None = None, env: BaseEnv | None = None, actor_id: ActorID | None = None, deterministic: bool = False) Dict[str, int | numpy.ndarray]¶
(overrides
Policy)Sample an action.
- compute_top_action_candidates(observation: Dict[str, numpy.ndarray], num_candidates: int | None, maze_state: Any | None, env: BaseEnv | None, actor_id: ActorID | None = None) Tuple[Sequence[Dict[str, int | numpy.ndarray]], Sequence[float]]¶
(overrides
Policy)implementation of
Policyinterface