ProbabilisticPolicyComposer¶
-
class
maze.perception.models.policies.probabilistic_policy_composer.
ProbabilisticPolicyComposer
(action_spaces_dict: Dict[Union[str, int], gym.spaces.Dict], observation_spaces_dict: Dict[Union[str, int], gym.spaces.Dict], agent_counts_dict: Dict[Union[str, int], int], distribution_mapper: maze.distributions.distribution_mapper.DistributionMapper, networks: Union[List[Union[None, Mapping[str, Any], Any]], Mapping[Union[str, Type], Union[None, Mapping[str, Any], Any]]], substeps_with_separate_agent_nets: List[Union[str, int]])¶ Composes networks for probabilistic policies.
- Parameters
action_spaces_dict – Dict of sub-step id to action space.
observation_spaces_dict – Dict of sub-step id to observation space.
distribution_mapper – The distribution mapper.
networks – Policy networks as defined in the config (either list or dictionary of object params and type).
-
property
policy
¶ implementation of
BasePolicyComposer