BetaProbabilityDistribution

class maze.distributions.beta.BetaProbabilityDistribution(logits: torch.Tensor, action_space: gymnasium.spaces.Box, temperature: float)

Beta Torch probability distribution.

Parameters:
  • logits – the logits for both mean and standard deviation.

  • action_space – the underlying gym.spaces action space.

deterministic_sample() torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface

entropy() torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface

kl(other: TorchProbabilityDistribution) torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface

log_prob(actions: torch.Tensor) torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface

classmethod required_logits_shape(action_space: gymnasium.spaces.Space) Sequence[int]

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface

sample() torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface