PARL
latest
Overview
Features
Abstractions
Parallelization
Installation
Installation
Tutorial
Getting Started
Model, Algorithm, Agent
Create Customized Algorithms
Save and Restore Parameters
Visualization Tool
CSV Logger
High-quality Implementations
Implemented Algorithms
Parallel Training
Overview
Xparl Usage
Recommended Practice
How to Debug
File Distribution
Serialization Acceleration (Not Necessary)
APIs
parl.Model
parl.Algorithm
parl.Agent
EvoKit
Overview
minimal example
Example for Online Products
PARL
»
Index
Edit on GitHub
Index
_
|
A
|
D
|
G
|
I
|
L
|
M
|
O
|
P
|
Q
|
R
|
S
|
T
|
V
_
__init__() (A2C method)
(Agent method)
(Algorithm method)
(DDPG method)
(DDQN method)
(DQN method)
(IMPALA method)
(OAC method)
(PolicyGradient method)
(PPO method)
(QMIX method)
(SAC method)
(TD3 method)
A
A2C (class in parl.algorithms.paddle.a2c)
Agent (class in parl.core.paddle.agent)
Algorithm (class in parl.core.paddle.algorithm)
D
DDPG (class in parl.algorithms.paddle.ddpg)
DDQN (class in parl.algorithms.paddle.ddqn)
DQN (class in parl.algorithms.paddle.dqn)
G
get_weights() (Algorithm method)
(Model method)
I
IMPALA (class in parl.algorithms.fluid.impala.impala)
L
learn() (A2C method)
(Agent method)
(Algorithm method)
(DDPG method)
(DDQN method)
(DQN method)
(IMPALA method)
(OAC method)
(PolicyGradient method)
(QMIX method)
(SAC method)
(TD3 method)
M
Model (class in parl.core.paddle.model)
module
parl.algorithms.fluid.impala.impala
parl.algorithms.fluid.ppo
parl.algorithms.paddle.a2c
parl.algorithms.paddle.ddpg
parl.algorithms.paddle.ddqn
parl.algorithms.paddle.dqn
parl.algorithms.paddle.oac
parl.algorithms.paddle.policy_gradient
parl.algorithms.paddle.qmix
parl.algorithms.paddle.sac
parl.algorithms.paddle.td3
O
OAC (class in parl.algorithms.paddle.oac)
P
parl.algorithms.fluid.impala.impala
module
parl.algorithms.fluid.ppo
module
parl.algorithms.paddle.a2c
module
parl.algorithms.paddle.ddpg
module
parl.algorithms.paddle.ddqn
module
parl.algorithms.paddle.dqn
module
parl.algorithms.paddle.oac
module
parl.algorithms.paddle.policy_gradient
module
parl.algorithms.paddle.qmix
module
parl.algorithms.paddle.sac
module
parl.algorithms.paddle.td3
module
policy_learn() (PPO method)
PolicyGradient (class in parl.algorithms.paddle.policy_gradient)
PPO (class in parl.algorithms.fluid.ppo)
predict() (A2C method)
(Agent method)
(Algorithm method)
(DDPG method)
(DDQN method)
(DQN method)
(IMPALA method)
(OAC method)
(PolicyGradient method)
(PPO method)
(SAC method)
(TD3 method)
prob_and_value() (A2C method)
Q
QMIX (class in parl.algorithms.paddle.qmix)
R
restore() (Agent method)
S
SAC (class in parl.algorithms.paddle.sac)
sample() (Agent method)
(Algorithm method)
(IMPALA method)
(OAC method)
(PPO method)
(SAC method)
save() (Agent method)
set_weights() (Algorithm method)
(Model method)
sync_old_policy() (PPO method)
sync_weights_to() (Model method)
T
TD3 (class in parl.algorithms.paddle.td3)
V
value() (A2C method)
value_learn() (PPO method)
value_predict() (PPO method)
Read the Docs
v: latest
Versions
latest
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds