StateCriticOutput

class maze.core.agent.state_critic_input_output.StateCriticOutput

Critic output holds the output of a critic for one full flat env step.

Individual CriticStepOutputs are stored in a list which can be (if needed) referenced to the corresponding ActorID.

property actor_ids: List[ActorID]

List of actor IDs for the individual sub-steps.

append(value: StateCriticStepOutput) None

Set self[key] to value.

property detached_values: List[torch.Tensor]

List of detached values for the individual sub-steps

reshape(shape: Sequence[int]) None

Reshape all the elements of the critic output to the given shape

property values: List[torch.Tensor]

List of values for the individual sub-steps