StateCriticInput

class maze.core.agent.state_critic_input_output.StateCriticInput

State Critic output defined as it’s own type, since it has to be explicitly build to be compatible with shared embedding networks.

property actor_ids: List[ActorID]

List of actor IDs for the individual sub-steps.

append(item: StateCriticStepInput) None

Append an CriticStepInput object to the Critic input internal list.

Parameters:

item – The item to add.

classmethod build(policy_output: PolicyOutput, record: StructuredSpacesRecord) StateCriticInput

Build the critic input from the policy outputs and the spaces record (policy input).

This method is responsible for building a List that hold the appropriate input for each critic w.r.t. the substep and the shared-embedding-keys.

Parameters:
  • policy_output – The full policy output.

  • record – The structured spaces record used to compute the policy output.

Returns:

A Critic input.

property substep_inputs: List[StateCriticStepInput]

List of CriticStepInputs for the individual sub-steps.

property tensor_dict: List[Dict[str, torch.Tensor]]

List of tensor dicts for the individual sub-steps.