Pendulum Class Reference

Implementation of Pendulum task. More...

Classes

class  Action
 Implementation of action of Pendulum. More...

 
class  State
 Implementation of state of Pendulum. More...

 

Public Member Functions

 Pendulum (const size_t maxSteps=200, const double maxAngularVelocity=8, const double maxTorque=2.0, const double dt=0.05, const double doneReward=0.0)
 Construct a Pendulum instance using the given values. More...

 
double AngleNormalize (double theta) const
 This function calculates the normalized angle for a particular theta. More...

 
State InitialSample ()
 Initial theta is randomly generated within [-pi, pi]. More...

 
bool IsTerminal (const State &) const
 This function checks if the pendulum has reaches a terminal state. More...

 
size_t MaxSteps () const
 Get the maximum number of steps allowed. More...

 
size_t & MaxSteps ()
 Set the maximum number of steps allowed. More...

 
double Sample (const State &state, const Action &action, State &nextState)
 Dynamics of Pendulum. More...

 
double Sample (const State &state, const Action &action)
 Dynamics of Pendulum. More...

 
size_t StepsPerformed () const
 Get the number of steps performed. More...

 

Detailed Description

Implementation of Pendulum task.

The inverted pendulum swingup problem is a classic problem in the control literature. In this version of the problem, the pendulum starts in a random position, and the goal is to swing it up so it stays upright

Definition at line 31 of file pendulum.hpp.

Constructor & Destructor Documentation

◆ Pendulum()

Pendulum ( const size_t  maxSteps = 200,
const double  maxAngularVelocity = 8,
const double  maxTorque = 2.0,
const double  dt = 0.05,
const double  doneReward = 0.0 
)
inline

Construct a Pendulum instance using the given values.

Parameters
maxStepsThe number of steps after which the episode terminates. If the value is 0, there is no limit (Default: 200 steps).
maxAngularVelocityMaximum angular velocity.
maxTorqueMaximum torque.
dtThe differential value.
doneRewardThe reward recieved by the agent on success.

Definition at line 118 of file pendulum.hpp.

Member Function Documentation

◆ AngleNormalize()

double AngleNormalize ( double  theta) const
inline

This function calculates the normalized angle for a particular theta.

Parameters
thetaThe un-normalized angle.

Definition at line 212 of file pendulum.hpp.

References M_PI.

Referenced by Pendulum::Sample().

◆ InitialSample()

State InitialSample ( )
inline

Initial theta is randomly generated within [-pi, pi].

Initial angular velocity is randomly generated within [-1, 1].

Returns
Initial state for each episode.

Definition at line 197 of file pendulum.hpp.

References Pendulum::State::AngularVelocity(), M_PI, mlpack::math::Random(), Pendulum::State::SetState(), and Pendulum::State::Theta().

◆ IsTerminal()

bool IsTerminal ( const State ) const
inline

This function checks if the pendulum has reaches a terminal state.

Parameters
*(state) desired state.
Returns
true if state is a terminal state, otherwise false.

Definition at line 227 of file pendulum.hpp.

References Log::Info.

◆ MaxSteps() [1/2]

size_t MaxSteps ( ) const
inline

Get the maximum number of steps allowed.

Definition at line 242 of file pendulum.hpp.

◆ MaxSteps() [2/2]

size_t& MaxSteps ( )
inline

Set the maximum number of steps allowed.

Definition at line 244 of file pendulum.hpp.

◆ Sample() [1/2]

double Sample ( const State state,
const Action action,
State nextState 
)
inline

Dynamics of Pendulum.

Get reward and next state based on current state and current action.

Parameters
stateThe current state.
actionThe current action.
nextStateThe next state.
Returns
reward, The reward for taking the action taken for current state.

Definition at line 140 of file pendulum.hpp.

References Pendulum::Action::action, Pendulum::AngleNormalize(), Pendulum::State::AngularVelocity(), mlpack::math::ClampRange(), M_PI, Pendulum::State::SetState(), and Pendulum::State::Theta().

Referenced by Pendulum::Sample().

◆ Sample() [2/2]

double Sample ( const State state,
const Action action 
)
inline

Dynamics of Pendulum.

Get reward based on current state and current action

Parameters
stateThe current state.
actionThe current action.
Returns
reward, The reward.

Definition at line 185 of file pendulum.hpp.

References Pendulum::Sample().

◆ StepsPerformed()

size_t StepsPerformed ( ) const
inline

Get the number of steps performed.

Definition at line 239 of file pendulum.hpp.


The documentation for this class was generated from the following file:
  • /home/jenkins-mlpack/mlpack.org/_src/mlpack-git/src/mlpack/methods/reinforcement_learning/environment/pendulum.hpp