RewardWrapper

class maze.core.wrappers.wrapper.RewardWrapper(*args, **kwds)

A Wrapper with typing support modifying the reward before passed to the agent.

get_observation_and_action_dicts(maze_state: Optional[Any], maze_action: Optional[Any], first_step_in_episode: bool) → Tuple[Optional[Dict[Union[int, str], Any]], Optional[Dict[Union[int, str], Any]]]

(overrides Wrapper)

Keep both actions and observation the same.

abstract reward(reward: Any) → Any

Reward mapping method.

step(action) → Tuple[Any, Any, bool, Dict[Any, Any]]

Intercept BaseEnv.step and map rewards.