StateCriticOutput¶
- class maze.core.agent.state_critic_input_output.StateCriticOutput¶
Critic output holds the output of a critic for one full flat env step.
Individual CriticStepOutputs are stored in a list which can be (if needed) referenced to the corresponding ActorID.
- append(value: StateCriticStepOutput) None¶
Set self[key] to value.
- property detached_values: List[torch.Tensor]¶
List of detached values for the individual sub-steps
- reshape(shape: Sequence[int]) None¶
Reshape all the elements of the critic output to the given shape
- property values: List[torch.Tensor]¶
List of values for the individual sub-steps