Policy Gradient¶
- class PolicyGradient(model, lr)[source]¶
Bases:
parl.core.paddle.algorithm.Algorithm
- __init__(model, lr)[source]¶
Policy gradient algorithm
- Parameters
model (parl.Model) – model defining forward network of policy.
lr (float) – learning rate.