Policy Gradient¶
- class PolicyGradient(model, lr)[源代码]¶
基类:
parl.core.paddle.algorithm.Algorithm
- __init__(model, lr)[源代码]¶
Policy gradient algorithm
- 参数
model (parl.Model) – model defining forward network of policy.
lr (float) – learning rate.
基类:parl.core.paddle.algorithm.Algorithm
Policy gradient algorithm
model (parl.Model) – model defining forward network of policy.
lr (float) – learning rate.