SquashedGaussianProbabilityDistribution¶
- class maze.distributions.squashed_gaussian.SquashedGaussianProbabilityDistribution(logits: torch.Tensor, action_space: gymnasium.spaces.Box, temperature: float)¶
Tanh-squashed diagonal Gaussian (Normal) Torch probability distribution.
- Parameters:
logits – the logits for both mean and standard deviation.
action_space – the underlying gym.spaces action space.
- deterministic_sample()¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface
- entropy() torch.Tensor¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface
- kl(other: TorchProbabilityDistribution) torch.Tensor¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface
- log_prob(actions: torch.Tensor) torch.Tensor¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface
- classmethod required_logits_shape(action_space: gymnasium.spaces.Space) Sequence[int]¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface
- sample()¶
(overrides
TorchProbabilityDistribution)implementation of
TorchProbabilityDistributioninterface