ActionCandidates

class maze.core.agent_deployment.maze_action_candidates.ActionCandidates(candidates_and_probabilities: Tuple[Sequence[Any], Sequence[float]])

Action object for encapsulation of multiple action objects along with their respective probabilities. Useful when getting multiple candidate actions from a policy.

Parameters

candidates_and_probabilities – a tuple of sequences, where the first sequence corresponds to the possible actions, the other sequence to the associated probabilities