UniformReplayBuffer¶
- class maze.train.trainers.common.replay_buffer.uniform_replay_buffer.UniformReplayBuffer(buffer_size: int, seed: int)¶
Replay buffer for off policy learning.
- Parameters:
buffer_size – The maximum buffer size.
seed – The random seed used for initializing the uniform random sampling in the buffer.
- add_rollout(rollout: SpacesTrajectoryRecord | List[StructuredSpacesRecord]) None¶
(overrides
BaseReplayBuffer)implementation of
BaseReplayBuffer
- add_transition(transition: StructuredSpacesRecord | SpacesTrajectoryRecord) None¶
(overrides
BaseReplayBuffer)implementation of
BaseReplayBuffer
- sample_batch(n_samples: int, learner_device: str) List[StructuredSpacesRecord | SpacesTrajectoryRecord]¶
(overrides
BaseReplayBuffer)implementation of
BaseReplayBuffer