Policy and Value Networks
Concepts and Structure
Best Practices and Tutorials
Logging and Monitoring
Scaling the Training Process
Compute the bias value for a sigmoid activation function
such as in multi-binary action spaces (Bernoulli distributions).
probability – The desired selection probability.
The respective bias value.