BasePolicyComposer

class maze.perception.models.policies.base_policy_composer.BasePolicyComposer(action_spaces_dict: Dict[str | int, gymnasium.spaces.Dict], observation_spaces_dict: Dict[str | int, gymnasium.spaces.Dict], agent_counts_dict: Dict[str | int, int], distribution_mapper: DistributionMapper)

Interface for policy (actor) network composers.

Parameters:
  • action_spaces_dict – Dict of sub-step id to action space.

  • observation_spaces_dict – Dict of sub-step id to observation space.

  • distribution_mapper – The distribution mapper.

abstract property policy: TorchPolicy

The policy object