nssTrainingSGDM

SGDM training options object for neural state-space systems

Since R2022b

expand all in page

Description

SGDM options set object to train an idNeuralStateSpace network using nlssest.

Creation

Create an nssTrainingSGDM object using nssTrainingOptions and specifying "sgdm" as input argument.

Properties

expand all

`UpdateMethod` — Solver used to update network parameters
`"SGDM"` (default)

Solver used to update network parameters, returned as a string. This property is read-only.

Use nssTrainingOptions("adam"), nssTrainingOptions("rmsprop"), or nssTrainingOptions("lbfgs") to return an options set object for the Adam, RMSProp, or L-BFGS solvers respectively. For more information on these algorithms, see the Algorithms section of trainingOptions (Deep Learning Toolbox).

`Momentum` — Contribution of previous step
`0.95` (default) | nonnegative scalar less than `1`

Contribution of the parameter update step of the previous iteration to the current iteration of stochastic gradient descent with momentum, specified as a scalar from 0 to 1.

A value of 0 means no contribution from the previous step, whereas a value of 1 means maximal contribution from the previous step. The default value works well for most tasks.

For more information, see Stochastic Gradient Descent with Momentum (Deep Learning Toolbox).

`LossFcn` — Type of function used to calculate loss
`"MeanAbsoluteError"` (default) | `"MeanSquaredError"`

Type of function used to calculate loss, specified as one of the following:

"MeanAbsoluteError" — uses the mean value of the absolute error.
"MeanSquaredError" — uses the mean value of the squared error.

`PlotLossFcn` — Option to plot the value of the loss function during training
`true` (default) | `false`

Option to plot the value of the loss function during training, specified as one of the following:

true — plots the value of the loss function during training.
false — does not plot the value of the loss function during training.

`Lambda` — Loss function regularization constant
`0` (default) | positive scalar

Constant coefficient applied to the regularization term added to the loss function, specified as a positive scalar.

The loss function with the regularization term is given by:

${\hat{V}}_{N} (θ) = \frac{1}{N} \sum_{t = 1}^{N} ε^{2} (t, θ) + \frac{1}{N} λ {‖ θ ‖}^{2}$

where t is the time variable, N is the size of the batch, ε is the sum of the reconstruction loss and autoencoder loss, θ is a concatenated vector of weights and biases of the neural network, and λ is the regularization constant that you can tune.

For more information, see Regularized Estimates of Model Parameters.

`LearnRate` — Learning rate
`0.01` (default) | positive scalar

Learning rate used for training, specified as a positive scalar. If the learning rate is too small, then training can take a long time. If the learning rate is too large, then training might reach a suboptimal result or diverge.

`LearnRateSchedule` — Learning rate schedule
`"none"` (default) | `"piecewise"`

Learning rate schedule, specified as "none" or "piecewise".

Learning Rate Schedule	Description	Plot
`"none"`	No learning rate schedule. This schedule keeps the learning rate constant.
`"piecewise"`	Piecewise learning rate schedule. Every 10 epochs, this schedule drops the learn rate by a factor of 10.

`LearnRateDropPeriod` — Number of epochs for dropping the learning rate
`10` (default) | positive integer

Number of epochs for dropping the learning rate, specified as a positive integer. This option is valid only when the LearnRateSchedule training option is "piecewise".

The software multiplies the global learning rate with the drop factor every time the specified number of epochs passes. Specify the drop factor using the LearnRateDropFactor training option.

`LearnRateDropFactor` — Factor for dropping the learning rate
`0.1` (default) | scalar from `0` to `1`

Factor for dropping the learning rate, specified as a scalar from 0 to 1. This option is valid only when the LearnRateSchedule training option is "piecewise".

LearnRateDropFactor is a multiplicative factor to apply to the learning rate every time a certain number of epochs passes. Specify the number of epochs using the LearnRateDropPeriod training option.

`MaxEpochs` — Maximum number of epochs
`100` (default) | positive integer

Maximum number of epochs to use for training, specified as a positive integer. An epoch is the full pass of the training algorithm over the entire training set.

`Beta` — Coefficient applied to tune the reconstruction loss of an autoencoder
`0` (default) | nonnegative scalar

Coefficient applied to tune the reconstruction loss of an autoencoder, specified as a nonnegative scalar.

Reconstruction loss measures the difference between the original input (x) and its reconstruction (x_r) after encoding and decoding. You calculate this loss as the L2 norm of (x - x_r) divided by the batch size (N).

`WindowSize` — Size of data frames
`Inf` (default) | positive integer

Number of samples in each frame or batch when segmenting data for model training, specified as a positive integer.

`NumWindowFraction` — Fraction of total number of frames or batches
`1` (default) | positive scalar less than or equal to `1`

Fraction of the total number of frames or batches used in each iteration within a training epoch, specified as a positive scalar less than or equal to one.

If NumWindowFraction = 1, in each training epoch, you use all the available data samples for estimation. This approach is called full-batch learning.

If NumWindowFraction < 1, at the start of each training epoch, the algorithm randomly shuffles all the batches. Then the algorithm divides these batches into consecutive groups where each group contains a fraction of the total number of batches as specified by NumWindowFraction. During the training epoch, the algorithm iterates over these groups, using a different subset of data samples in each iteration. This approach is called mini-batch or stochastic learning. For mini-batch learning, loss in an epoch is approximated by taking the average of losses in all iterations within the epoch.

For more information on full-batch and mini-batch learning, see Training Neural State-Space Models.

`Overlap` — Size of overlap
`"auto"` (default) | integer

Number of samples in the overlap between successive frames when segmenting data for model training, specified as an integer. A negative integer indicates that certain data samples are skipped when creating the data frames.

The default value, "auto", implies that the size of the overlap is 0.

`ODESolverOptions` — ODE solver options for continuous-time systems
`nssDLODE45` (default)

ODE solver options to integrate continuous-time neural state-space systems, specified as an nssDLODE45 object.

Use dot notation to access properties such as the following:

Solver — Solver type, set as "dlode45". This is a read-only property.
InitialStepSize — Initial step size, specified as a positive scalar. If you do not specify an initial step size, then the solver bases the initial step size on the slope of the solution at the initial time point.
MaxStepSize — Maximum step size, specified as a positive scalar. It is an upper bound on the size of any step taken by the solver. The default is one tenth of the difference between final and initial time.
AbsoluteTolerance — Absolute tolerance, specified as a positive scalar. It is the largest allowable absolute error. Intuitively, when the solution approaches 0, AbsoluteTolerance is the threshold below which you do not worry about the accuracy of the solution since it is effectively 0.
RelativeTolerance — Relative tolerance, specified as a positive scalar. This tolerance measures the error relative to the magnitude of each solution component. Intuitively, it controls the number of significant digits in a solution, (except when it is smaller than the absolute tolerance).

For more information, see odeset.

`InputInterSample` — Input interpolation method
`'foh'` (default) | `'zoh'` | `'spline'` | `'cubic'` | `'makima'` | `'pchip'`

Input interpolation method, specified as one of the following:

'zoh' — uses zero-order hold interpolation method.
'foh' — uses first-order hold interpolation method.
'cubic' — uses cubic interpolation method.
'makima' — uses modified Akima interpolation method.
'pchip' — uses shape-preserving piecewise cubic interpolation method.
'spline' — uses spline interpolation method.

This is the interpolation method used to interpolate the input when integrating continuous-time neural state-space systems. For more information, see interpolation methods in interp1.

Examples

collapse all

Create SGDM Option Set to Train a Neural State-Space System

Open Live Script

Use nssTrainingOptions to return an options set object to train an idNeuralStateSpace system.

sgdmOpts = nssTrainingOptions("sgdm")

sgdmOpts = 
  nssTrainingSGDM with properties:

           UpdateMethod: "SGDM"
              LearnRate: 0.0100
               Momentum: 0.9500
              MaxEpochs: 100
      LearnRateSchedule: "none"
    LearnRateDropFactor: 0.1000
    LearnRateDropPeriod: 10
                 Lambda: 0
                   Beta: 0
                LossFcn: "MeanAbsoluteError"
            PlotLossFcn: 1
       ODESolverOptions: [1×1 idoptions.nssDLODE45]
       InputInterSample: 'foh'
             WindowSize: Inf
      NumWindowFraction: 1
                Overlap: "auto"

Use dot notation to access the object properties.

sgdmOpts.LearnRate = 0.01;

You can use sgdmOpts as an input argument to nlssest to specify the training options for the state or the non-trivial output network of an idNeuralStateSpace object.

Version History

Introduced in R2022b

expand all

R2026a: Train neural state-space models using mini-batch learning

Starting in R2026a, you can now use mini-batch learning, also known as stochastic learning, to train neural state-space models.

To enable mini-batch learning, specify an Adam, SGDM, or RMSProp training options object using nssTrainingOptions. Then, set the WindowSize property to be less than the number of data samples and the NumWindowFraction property to be less than one. You then estimate the neural state-space model using the nlssest command with the specified training options.

R2026a: `MiniBatchSize` property is removed

The MiniBatchSize property of the nssTrainingADAM, nssTrainingSGDM, and nssTrainingRMSProp has been removed. You can use the WindowSize property and the new NumWindowFraction property instead of MiniBatchSize.

Previously, if you had 1000 batches of data and set MiniBatchSize=100, starting in R2026a, for the same number of batches, set NumWindowFraction=0.1.

nssTrainingSGDM

Description

Creation

Properties

`UpdateMethod` — Solver used to update network parameters
`"SGDM"` (default)

`Momentum` — Contribution of previous step
`0.95` (default) | nonnegative scalar less than `1`

`LossFcn` — Type of function used to calculate loss
`"MeanAbsoluteError"` (default) | `"MeanSquaredError"`

`PlotLossFcn` — Option to plot the value of the loss function during training
`true` (default) | `false`

`Lambda` — Loss function regularization constant
`0` (default) | positive scalar

`LearnRate` — Learning rate
`0.01` (default) | positive scalar

`LearnRateSchedule` — Learning rate schedule
`"none"` (default) | `"piecewise"`

`LearnRateDropPeriod` — Number of epochs for dropping the learning rate
`10` (default) | positive integer

`LearnRateDropFactor` — Factor for dropping the learning rate
`0.1` (default) | scalar from `0` to `1`

`MaxEpochs` — Maximum number of epochs
`100` (default) | positive integer

`Beta` — Coefficient applied to tune the reconstruction loss of an autoencoder
`0` (default) | nonnegative scalar

`WindowSize` — Size of data frames
`Inf` (default) | positive integer

`NumWindowFraction` — Fraction of total number of frames or batches
`1` (default) | positive scalar less than or equal to `1`

`Overlap` — Size of overlap
`"auto"` (default) | integer

`ODESolverOptions` — ODE solver options for continuous-time systems
`nssDLODE45` (default)

`InputInterSample` — Input interpolation method
`'foh'` (default) | `'zoh'` | `'spline'` | `'cubic'` | `'makima'` | `'pchip'`

Examples

Create SGDM Option Set to Train a Neural State-Space System

Version History

R2026a: Train neural state-space models using mini-batch learning

R2026a: `MiniBatchSize` property is removed

See Also

Objects

Functions

Blocks

Topics

nssTrainingSGDM

Description

Creation

Properties

UpdateMethod — Solver used to update network parameters "SGDM" (default)

Momentum — Contribution of previous step 0.95 (default) | nonnegative scalar less than 1

LossFcn — Type of function used to calculate loss "MeanAbsoluteError" (default) | "MeanSquaredError"

PlotLossFcn — Option to plot the value of the loss function during training true (default) | false

Lambda — Loss function regularization constant 0 (default) | positive scalar

LearnRate — Learning rate 0.01 (default) | positive scalar

LearnRateSchedule — Learning rate schedule "none" (default) | "piecewise"

LearnRateDropPeriod — Number of epochs for dropping the learning rate 10 (default) | positive integer

LearnRateDropFactor — Factor for dropping the learning rate 0.1 (default) | scalar from 0 to 1

MaxEpochs — Maximum number of epochs 100 (default) | positive integer

Beta — Coefficient applied to tune the reconstruction loss of an autoencoder 0 (default) | nonnegative scalar

WindowSize — Size of data frames Inf (default) | positive integer

NumWindowFraction — Fraction of total number of frames or batches 1 (default) | positive scalar less than or equal to 1

Overlap — Size of overlap "auto" (default) | integer

ODESolverOptions — ODE solver options for continuous-time systems nssDLODE45 (default)

InputInterSample — Input interpolation method 'foh' (default) | 'zoh' | 'spline' | 'cubic' | 'makima' | 'pchip'

Examples

Create SGDM Option Set to Train a Neural State-Space System

Version History

R2026a: Train neural state-space models using mini-batch learning

R2026a: MiniBatchSize property is removed

See Also

Objects

Functions

Blocks

Topics

`UpdateMethod` — Solver used to update network parameters
`"SGDM"` (default)

`Momentum` — Contribution of previous step
`0.95` (default) | nonnegative scalar less than `1`

`LossFcn` — Type of function used to calculate loss
`"MeanAbsoluteError"` (default) | `"MeanSquaredError"`

`PlotLossFcn` — Option to plot the value of the loss function during training
`true` (default) | `false`

`Lambda` — Loss function regularization constant
`0` (default) | positive scalar

`LearnRate` — Learning rate
`0.01` (default) | positive scalar

`LearnRateSchedule` — Learning rate schedule
`"none"` (default) | `"piecewise"`

`LearnRateDropPeriod` — Number of epochs for dropping the learning rate
`10` (default) | positive integer

`LearnRateDropFactor` — Factor for dropping the learning rate
`0.1` (default) | scalar from `0` to `1`

`MaxEpochs` — Maximum number of epochs
`100` (default) | positive integer

`Beta` — Coefficient applied to tune the reconstruction loss of an autoencoder
`0` (default) | nonnegative scalar

`WindowSize` — Size of data frames
`Inf` (default) | positive integer

`NumWindowFraction` — Fraction of total number of frames or batches
`1` (default) | positive scalar less than or equal to `1`

`Overlap` — Size of overlap
`"auto"` (default) | integer

`ODESolverOptions` — ODE solver options for continuous-time systems
`nssDLODE45` (default)

`InputInterSample` — Input interpolation method
`'foh'` (default) | `'zoh'` | `'spline'` | `'cubic'` | `'makima'` | `'pchip'`

R2026a: `MiniBatchSize` property is removed