StateCriticOutput

class maze.core.agent.state_critic_input_output.StateCriticOutput

Critic output holds the output of a critic for one full flat env step.

Individual CriticStepOutputs are stored in a list which can be (if needed) referenced to the corresponding ActorID.

property actor_ids

List of actor IDs for the individual sub-steps.

append(value: maze.core.agent.state_critic_input_output.StateCriticStepOutput)None

Set self[key] to value.

property detached_values

List of detached values for the individual sub-steps

reshape(shape: Sequence[int])None

Reshape all the elements of the critic output to the given shape

property values

List of values for the individual sub-steps