RewardClippingWrapper

class maze.core.wrappers.reward_clipping_wrapper.RewardClippingWrapper(*args, **kwds)

Clips original step reward to range [min, max].

Parameters
  • env – The underlying environment.

  • min_val – Minimum allowed reward value.

  • max_val – Maximum allowed reward value.

clone_from(env: maze.core.wrappers.reward_clipping_wrapper.RewardClippingWrapper)None

(overrides SimulatedEnvMixin)

implementation of SimulatedEnvMixin.

reward(reward: float)float

(overrides RewardWrapper)

Clips the original reward.

param reward

The original reward.

return

The clipped reward.