Workflow
Policy and Value Networks
Training
Concepts and Structure
Environment Customization
Best Practices and Tutorials
Logging and Monitoring
Scaling the Training Process
maze.train.trainers.es.optimizers.sgd.
SGD
Stochastic gradient descent with momentum
setup
(overrides Optimizer)
Optimizer
prepare optimizer for training