View source on GitHub
|
Trajectories module.
Modules
policy_step module: Policy Step.
time_step module: TimeStep representing a step in the environment.
trajectory module: Trajectory containing time_step transition information.
Classes
class PolicyInfo: PolicyInfo(log_probability,)
class PolicyStep: Returned with every call to policy.action() and policy.distribution().
class StepType: Defines the status of a TimeStep within a sequence.
class TimeStep: Returned with every call to step and reset on an environment.
class Trajectory: A tuple that represents a trajectory.
class Transition: A tuple that represents a transition.
Functions
boundary(...): Create a Trajectory transitioning between StepTypes LAST and FIRST.
first(...): Create a Trajectory transitioning between StepTypes FIRST and MID.
from_transition(...): Returns a Trajectory given transitions.
last(...): Create a Trajectory transitioning between StepTypes MID and LAST.
mid(...): Create a Trajectory transitioning between StepTypes MID and MID.
restart(...): Returns a TimeStep with step_type set equal to StepType.FIRST.
single_step(...): Create a Trajectory transitioning between StepTypes FIRST and LAST.
termination(...): Returns a TimeStep with step_type set to StepType.LAST.
time_step_spec(...): Returns a TimeStep spec given the observation_spec.
to_n_step_transition(...): Create an n-step transition from a trajectory with T=N + 1 frames.
to_transition(...): Create a transition from a trajectory or two adjacent trajectories.
to_transition_spec(...): Create a transition spec from a trajectory spec.
transition(...): Returns a TimeStep with step_type set equal to StepType.MID.
truncation(...): Returns a TimeStep with step_type set to StepType.LAST.
View source on GitHub