BernoulliProbabilityDistribution

class maze.distributions.bernoulli.BernoulliProbabilityDistribution(*args, **kwds)

Bernoulli Torch probability distribution for multi-binary action spaces.

Parameters
  • logits – the action selection logits.

  • action_space – The gym action space.

  • temperature – The distribution temperature parameter.

deterministic_sample()torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of ProbabilityDistribution interface

entropy()torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of ProbabilityDistribution interface

kl(other: maze.distributions.torch_dist.TorchProbabilityDistribution)torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of ProbabilityDistribution interface

log_prob(actions: torch.Tensor)torch.Tensor

(overrides TorchProbabilityDistribution)

implementation of ProbabilityDistribution interface

classmethod required_logits_shape(action_space: gym.spaces.MultiBinary) → Sequence[int]

(overrides TorchProbabilityDistribution)

implementation of TorchProbabilityDistribution interface