View source on GitHub
|
Module importing all agents.
Modules
behavioral_cloning module: A Behavioral Cloning agent.
categorical_dqn module: A Categorical DQN (C51) agent.
cql module: A CQL-SAC agent.
data_converter module: Agent Converter API and converters.
ddpg module: A Deep Deterministic Policy Gradient (DDPG) agent and its networks.
dqn module: A DQN (Deep Q Network) agent.
ppo module: PPO Agents.
reinforce module: A REINFORCE agent.
sac module: A Soft Actor Critic agent.
td3 module: Twin Delayed Deep Deterministic policy gradient (TD3) agent.
tf_agent module: TensorFlow RL Agent API.
Classes
class BehavioralCloningAgent: A Behavioral Cloning agent.
class CategoricalDqnAgent: A Categorical DQN Agent based on the DQN Agent.
class CqlSacAgent: A CQL-SAC Agent based on the SAC Agent.
class DdpgAgent: A DDPG Agent.
class DqnAgent: A DQN Agent.
class PPOAgent: A PPO Agent.
class PPOClipAgent: A PPO Agent implementing the clipped probability ratios.
class PPOKLPenaltyAgent: A PPO Agent implementing the KL penalty loss.
class ReinforceAgent: A REINFORCE Agent.
class SacAgent: A SAC Agent.
class TFAgent: Abstract base class for TF-based RL and Bandits agents.
class Td3Agent: A TD3 Agent.
View source on GitHub