BRNN< OutputLayerType, MergeLayerType, MergeOutputType, InitializationRuleType, CustomLayers > Class Template Reference

Implementation of a standard bidirectional recurrent neural network container. More...

Public Types

using NetworkType = BRNN< OutputLayerType, MergeLayerType, MergeOutputType, InitializationRuleType, CustomLayers... >
 Convenience typedef for the internal model construction. More...

 

Public Member Functions

 BRNN (const size_t rho, const bool single=false, OutputLayerType outputLayer=OutputLayerType(), MergeLayerType mergeLayer=MergeLayerType(), MergeOutputType mergeOutput=MergeOutputType(), InitializationRuleType initializeRule=InitializationRuleType())
 Create the BRNN object. More...

 
template<class LayerType , class... Args>
void Add (Args... args)
 
void Add (LayerTypes< CustomLayers... > layer)
 
double Evaluate (const arma::mat &parameters, const size_t begin, const size_t batchSize, const bool deterministic)
 Evaluate the bidirectional recurrent neural network with the given parameters. More...

 
double Evaluate (const arma::mat &parameters, const size_t begin, const size_t batchSize)
 Evaluate the bidirectional recurrent neural network with the given parameters. More...

 
template
<
typename
GradType
>
double EvaluateWithGradient (const arma::mat &parameters, const size_t begin, GradType &gradient, const size_t batchSize)
 Evaluate the bidirectional recurrent neural network with the given parameters. More...

 
void Gradient (const arma::mat &parameters, const size_t begin, arma::mat &gradient, const size_t batchSize)
 Evaluate the gradient of the bidirectional recurrent neural network with the given parameters, and with respect to only one point in the dataset. More...

 
size_t NumFunctions () const
 Return the number of separable functions. (number of predictor points). More...

 
const arma::mat & Parameters () const
 Return the initial point for the optimization. More...

 
arma::mat & Parameters ()
 Modify the initial point for the optimization. More...

 
void Predict (arma::cube predictors, arma::cube &results, const size_t batchSize=256)
 Predict the responses to a given set of predictors. More...

 
const arma::cube & Predictors () const
 Get the matrix of data points (predictors). More...

 
arma::cube & Predictors ()
 Modify the matrix of data points (predictors). More...

 
void Reset ()
 Reset the state of the network. More...

 
void ResetParameters ()
 Reset the module information (weights/parameters). More...

 
const arma::cube & Responses () const
 Get the matrix of responses to the input data points. More...

 
arma::cube & Responses ()
 Modify the matrix of responses to the input data points. More...

 
const size_t & Rho () const
 Return the maximum length of backpropagation through time. More...

 
size_t & Rho ()
 Modify the maximum length of backpropagation through time. More...

 
template
<
typename
Archive
>
void serialize (Archive &ar, const unsigned int)
 Serialize the model. More...

 
void Shuffle ()
 Shuffle the order of function visitation. More...

 
template
<
typename
OptimizerType
>
double Train (arma::cube predictors, arma::cube responses, OptimizerType &optimizer)
 Train the bidirectional recurrent neural network on the given input data using the given optimizer. More...

 
template
<
typename
OptimizerType
=
ens::StandardSGD
>
double Train (arma::cube predictors, arma::cube responses)
 Train the bidirectional recurrent neural network on the given input data. More...

 

Detailed Description


template<typename OutputLayerType = NegativeLogLikelihood<>, typename MergeLayerType = Concat<>, typename MergeOutputType = LogSoftMax<>, typename InitializationRuleType = RandomInitialization, typename... CustomLayers>
class mlpack::ann::BRNN< OutputLayerType, MergeLayerType, MergeOutputType, InitializationRuleType, CustomLayers >

Implementation of a standard bidirectional recurrent neural network container.

Template Parameters
OutputLayerTypeThe output layer type used to evaluate the network.
InitializationRuleTypeRule used to initialize the weight matrix.

Definition at line 47 of file brnn.hpp.

Member Typedef Documentation

◆ NetworkType

using NetworkType = BRNN<OutputLayerType, MergeLayerType, MergeOutputType, InitializationRuleType, CustomLayers...>

Convenience typedef for the internal model construction.

Definition at line 55 of file brnn.hpp.

Constructor & Destructor Documentation

◆ BRNN()

BRNN ( const size_t  rho,
const bool  single = false,
OutputLayerType  outputLayer = OutputLayerType(),
MergeLayerType  mergeLayer = MergeLayerType(),
MergeOutputType  mergeOutput = MergeOutputType(),
InitializationRuleType  initializeRule = InitializationRuleType() 
)

Create the BRNN object.

Optionally, specify which initialize rule and performance function should be used.

If you want to pass in a parameter and discard the original parameter object, be sure to use std::move to avoid unnecessary copy.

Parameters
rhoMaximum number of steps to backpropagate through time (BPTT).
singlePredict only the last element of the input sequence.
outputLayerOutput layer used to evaluate the network.
initializeRuleOptional instantiated InitializationRule object for initializing the network parameter.

Member Function Documentation

◆ Add() [1/2]

void Add ( Args...  args)

◆ Add() [2/2]

void Add ( LayerTypes< CustomLayers... >  layer)

◆ Evaluate() [1/2]

double Evaluate ( const arma::mat &  parameters,
const size_t  begin,
const size_t  batchSize,
const bool  deterministic 
)

Evaluate the bidirectional recurrent neural network with the given parameters.

This function is usually called by the optimizer to train the model.

Parameters
parametersMatrix model parameters.
beginIndex of the starting point to use for objective function evaluation.
batchSizeNumber of points to be passed at a time to use for objective function evaluation.
deterministicWhether or not to train or test the model. Note some layer act differently in training or testing mode.

◆ Evaluate() [2/2]

double Evaluate ( const arma::mat &  parameters,
const size_t  begin,
const size_t  batchSize 
)

Evaluate the bidirectional recurrent neural network with the given parameters.

This function is usually called by the optimizer to train the model. This just calls the other overload of Evaluate() with deterministic = true.

Parameters
parametersMatrix model parameters.
beginIndex of the starting point to use for objective function evaluation.
batchSizeNumber of points to be passed at a time to use for objective function evaluation.

◆ EvaluateWithGradient()

double EvaluateWithGradient ( const arma::mat &  parameters,
const size_t  begin,
GradType &  gradient,
const size_t  batchSize 
)

Evaluate the bidirectional recurrent neural network with the given parameters.

This function is usually called by the optimizer to train the model. This just calls the other overload of Evaluate() with deterministic = true.

Parameters
parametersMatrix model parameters.
beginIndex of the starting point to use for objective function evaluation.
gradientMatrix to output gradient into.
batchSizeNumber of points to be passed at a time to use for objective function evaluation.

◆ Gradient()

void Gradient ( const arma::mat &  parameters,
const size_t  begin,
arma::mat &  gradient,
const size_t  batchSize 
)

Evaluate the gradient of the bidirectional recurrent neural network with the given parameters, and with respect to only one point in the dataset.

This is useful for optimizers such as SGD, which require a separable objective function.

Parameters
parametersMatrix of the model parameters to be optimized.
beginIndex of the starting point to use for objective function gradient evaluation.
gradientMatrix to output gradient into.
batchSizeNumber of points to be processed as a batch for objective function gradient evaluation.

◆ NumFunctions()

size_t NumFunctions ( ) const
inline

Return the number of separable functions. (number of predictor points).

Definition at line 249 of file brnn.hpp.

◆ Parameters() [1/2]

const arma::mat& Parameters ( ) const
inline

Return the initial point for the optimization.

Definition at line 252 of file brnn.hpp.

◆ Parameters() [2/2]

arma::mat& Parameters ( )
inline

Modify the initial point for the optimization.

Definition at line 254 of file brnn.hpp.

◆ Predict()

void Predict ( arma::cube  predictors,
arma::cube &  results,
const size_t  batchSize = 256 
)

Predict the responses to a given set of predictors.

The responses will reflect the output of the given output layer as returned by the output layer function.

If you want to pass in a parameter and discard the original parameter object, be sure to use std::move to avoid unnecessary copy.

The format of the data should be as follows:

  • each slice should correspond to a time step
  • each column should correspond to a data point
  • each row should correspond to a dimension So, e.g., predictors(i, j, k) is the i'th dimension of the j'th data point at time slice k. The responses will be in the same format.
Parameters
predictorsInput predictors.
resultsMatrix to put output predictions of responses into.
batchSizeNumber of points to predict at once.

◆ Predictors() [1/2]

const arma::cube& Predictors ( ) const
inline

Get the matrix of data points (predictors).

Definition at line 267 of file brnn.hpp.

◆ Predictors() [2/2]

◆ Reset()

void Reset ( )

Reset the state of the network.

This ensures that all internally-held gradients are set to 0, all memory cells are reset, and the parameters matrix is the right size.

Referenced by BRNN< OutputLayerType, MergeLayerType, MergeOutputType, InitializationRuleType, CustomLayers >::Predictors().

◆ ResetParameters()

void ResetParameters ( )

◆ Responses() [1/2]

const arma::cube& Responses ( ) const
inline

Get the matrix of responses to the input data points.

Definition at line 262 of file brnn.hpp.

◆ Responses() [2/2]

arma::cube& Responses ( )
inline

Modify the matrix of responses to the input data points.

Definition at line 264 of file brnn.hpp.

◆ Rho() [1/2]

const size_t& Rho ( ) const
inline

Return the maximum length of backpropagation through time.

Definition at line 257 of file brnn.hpp.

◆ Rho() [2/2]

size_t& Rho ( )
inline

Modify the maximum length of backpropagation through time.

Definition at line 259 of file brnn.hpp.

◆ serialize()

void serialize ( Archive &  ar,
const unsigned  int 
)

◆ Shuffle()

void Shuffle ( )

Shuffle the order of function visitation.

This may be called by the optimizer.

◆ Train() [1/2]

double Train ( arma::cube  predictors,
arma::cube  responses,
OptimizerType &  optimizer 
)

Train the bidirectional recurrent neural network on the given input data using the given optimizer.

This will use the existing model parameters as a starting point for the optimization. If this is not what you want, then you should access the parameters vector directly with Parameters() and modify it as desired.

If you want to pass in a parameter and discard the original parameter object, be sure to use std::move to avoid unnecessary copy.

The format of the data should be as follows:

  • each slice should correspond to a time step
  • each column should correspond to a data point
  • each row should correspond to a dimension So, e.g., predictors(i, j, k) is the i'th dimension of the j'th data point at time slice k.
Template Parameters
OptimizerTypeType of optimizer to use to train the model.
Parameters
predictorsInput training variables.
responsesOutputs results from input training variables.
optimizerInstantiated optimizer used to train the model.

◆ Train() [2/2]

double Train ( arma::cube  predictors,
arma::cube  responses 
)

Train the bidirectional recurrent neural network on the given input data.

By default, the SGD optimization algorithm is used, but others can be specified (such as ens::RMSprop).

This will use the existing model parameters as a starting point for the optimization. If this is not what you want, then you should access the parameters vector directly with Parameters() and modify it as desired.

If you want to pass in a parameter and discard the original parameter object, be sure to use std::move to avoid unnecessary copy.

The format of the data should be as follows:

  • each slice should correspond to a time step
  • each column should correspond to a data point
  • each row should correspond to a dimension So, e.g., predictors(i, j, k) is the i'th dimension of the j'th data point at time slice k.
Template Parameters
OptimizerTypeType of optimizer to use to train the model.
Parameters
predictorsInput training variables.
responsesOutputs results from input training variables.

The documentation for this class was generated from the following file:
  • /home/jenkins-mlpack/mlpack.org/_src/mlpack-3.2.1/src/mlpack/methods/ann/brnn.hpp