ImitationEvents

class maze.train.trainers.imitation.imitation_events.ImitationEvents

Event interface defining statistics emitted by the imitation learning trainers.

box_mean_abs_deviation(step_id: Union[str, int], subspace_name: str, value: int)

Mean absolute deviation for box (continuous) subspaces.

discrete_accuracy(step_id: Union[str, int], subspace_name: str, value: int)

Accuracy for discrete (categorical) subspaces.

multi_binary_accuracy(step_id: Union[str, int], subspace_name: str, value: int)

Accuracy for multi-binary subspaces.

policy_entropy(step_id: Union[str, int], value: float)

Entropy of the step policies.

policy_grad_norm(step_id: Union[str, int], value: float)

Gradient norm of the step policies.

policy_l2_norm(step_id: Union[str, int], value: float)

L2 norm of the step policies.

policy_loss(step_id: Union[str, int], value: float)

Optimization loss of the step policy.