DictProbabilityDistribution¶
-
class
maze.distributions.dict.
DictProbabilityDistribution
(distribution_dict: Dict[str, maze.distributions.distribution.ProbabilityDistribution])¶ Dictionary probability distribution.
- The respective functions either return
the per key distribution properties or
aggregate the properties across the sub-distributions using a reduce_fun such as mean or sum.
- Parameters
distribution_dict – dictionary holding sub-probability distributions.
-
deterministic_sample
() → Dict[str, torch.Tensor]¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface
-
entropy
(reduce_fun: callable = torch.mean) → torch.Tensor¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface
-
kl
(other: maze.distributions.dict.DictProbabilityDistribution, reduce_fun: callable = torch.mean) → torch.Tensor¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface
-
log_prob
(actions: Dict[str, torch.Tensor]) → Dict[str, torch.Tensor]¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface
-
neg_log_prob
(actions: Dict[str, torch.Tensor]) → Dict[str, torch.Tensor]¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface
-
sample
() → Dict[str, torch.Tensor]¶ (overrides
ProbabilityDistribution
)implementation of
TorchProbabilityDistribution
interface