StateCritic

class maze.core.agent.state_critic.StateCritic

Structured state critic class designed to work with structured environments. (see StructuredEnv).

It encapsulates state critic and queries them for values according to the provided policy ID.

abstract predict_value(observation: Dict[str, numpy.ndarray], critic_id: Union[int, str])maze.core.agent.state_critic_input_output.StateCriticStepOutput

Query a critic that corresponds to the given ID for the state value.

Parameters
  • observation – Current observation of the environment

  • critic_id – The critic id to query

Returns

The value for this observation

abstract predict_values(critic_input: maze.core.agent.state_critic_input_output.StateCriticInput)maze.core.agent.state_critic_input_output.StateCriticOutput

Query a critic that corresponds to the given ID for the state value.

Parameters

critic_input – The critic input for predicting the values of all sub-steps of the env.

Returns

Critic output holding the values, detached values and actor_id