SpacesRecord¶

class maze.core.trajectory_recording.records.spaces_record.SpacesRecord(actor_id: maze.core.env.structured_env.ActorID, observation: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None, action: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None, reward: Optional[Union[float, numpy.ndarray, torch.Tensor]] = None, done: Optional[Union[bool, numpy.ndarray, torch.Tensor]] = None, info: Optional[Dict] = None, next_observation: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None, logits: Optional[Dict[str, numpy.ndarray]] = None, discounted_return: Optional[Union[float, numpy.ndarray]] = None, batch_shape: Optional[List[int]] = None)¶

Record of spaces (i.e., raw action, observation, and associated data) from a single sub-step.

action: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None¶: Action recorded during the step.

actor_id: maze.core.env.structured_env.ActorID¶: ID of the actor for this step.

property agent_id¶: Sub-step key (i.e., the second part of the Actor ID) for this step.

batch_shape: Optional[List[int]] = None¶: If the record is batched, this is the shape of the batch.

discounted_return: Optional[Union[float, numpy.ndarray]] = None¶: Discounted return for this step.

done: Optional[Union[bool, numpy.ndarray, torch.Tensor]] = None¶: Done flag recorded during the step.

info: Optional[Dict] = None¶: Info dictionary recorded during the step.

logits: Optional[Dict[str, numpy.ndarray]] = None¶: Action logits recorded during the step.

next_observation: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None¶: Observation obtained after this step (i.e., results of the action taken in this step).

observation: Optional[Dict[str, Union[numpy.ndarray, torch.Tensor]]] = None¶: Observation recorded during the step.

reward: Optional[Union[float, numpy.ndarray, torch.Tensor]] = None¶: Reward recorded during the step.

classmethod stack(records: List[SpacesRecord]) → maze.core.trajectory_recording.records.spaces_record.SpacesRecord ¶

Stack multiple records into a single spaces record. Useful for processing multiple records in a batch.

All the records should be in numpy and have the same structure of the spaces (i.e. come from the same environment etc.).

Parameters: records – Records to stack.
Returns: Single stacked record, containing all the given records, and having the corresponding batch shape.

property substep_key¶: Sub-step key (i.e., the first part of the Actor ID) for this step.

to_numpy() → maze.core.trajectory_recording.records.spaces_record.SpacesRecord ¶: Convert the record to numpy.

to_torch(device: str) → maze.core.trajectory_recording.records.spaces_record.SpacesRecord ¶

Convert the record to Torch.

Parameters: device – Device to move the tensors to.

Read the Docs v: stable

Versions: latest; stable

Downloads: html; epub

On Read the Docs: Project Home; Builds