base_models¶

You can defined your model by implementing the abstract methods, BaseModel.train_data_sources(), BaseModel.predict(), and BaseModel.loss(). See Defining a model for instructions. Currently, there are three base models that you can inherit from,

BaseModel
BaseSPSS
BaseVAE

BaseModel¶

class morgana.base_models.BaseModel[source]¶

Bases: torch.nn.modules.module.Module

Creates an abstract model class with utility functions.

Any additional kwargs specified in __init__ should be passed to the command line argument model_kwargs.

normalisers¶

Normalisers specified within the morgana.data._DataSource in self.train_data_sources.

Type: None or dict[str, morgana.data._FeatureNormaliser]

mode¶

Stage of training, set in morgana.experiment_builder.ExperimentBuilder.*_epoch, for use with self.metrics.

Type: {‘’, ‘train’, ‘valid’, ‘test’}

metrics¶

Handler for tracking metrics in an online fashion (over multiple batches).

Type: morgana.metrics.Handler

step¶

Step in training, calculated using epoch number, batch number, and number of batches per epoch. This is updated automatically by morgana.experiment_builder.ExperimentBuilder. Useful for logging to self.tensorboard.

Type: int

tensorboard¶

Type: tensorboardX.SummaryWriter

train_data_sources(self)[source]¶

Specifies the data that will be loaded and used in training.

Only specifies what data will be loaded, but not where from.

Returns: The data sources used by morgana.experiment_builder.ExperimentBuilder for the training data, can be any data structure containing morgana.data._DataSource instances.
Return type: features

valid_data_sources(self)[source]¶

Specifies the data that will be loaded and used in validation.

Only specifies what data will be loaded, but not where from.

Returns: The data sources used by morgana.experiment_builder.ExperimentBuilder for the validation data, can be any data structure containing morgana.data._DataSource instances.
Return type: features

test_data_sources(self)[source]¶

Specifies the data that will be loaded and used in testing.

Only specifies what data will be loaded, but not where from.

Returns: The data sources used by morgana.experiment_builder.ExperimentBuilder for the testing data, can be any data structure containing morgana.data._DataSource instances.
Return type: features

forward(self, features)[source]¶

Defines the computation graph, including calculation of loss.

Parameters

features (dict[str, torch.Tensor]) – The ground truth features produced by self.*_data_sources.

Returns

loss (float) – Loss of the model, as defined by self.loss.
output_features – Predictions made by the model, can be any data structure containing torch.Tensor instances.

predict(self, features)[source]¶

Defines the computation graph.

Parameters: features (dict[str, torch.Tensor]) – The ground truth features produced by self.*_data_sources.
Returns: Predictions made by the model, can be any data structure containing torch.Tensor instances.
Return type: output_features

loss(self, features, output_features)[source]¶

Defines which predictions should be scored against which ground truth features.

Typically this method should use _loss() to calculate the sequence loss for the target-prediction pairs.

Parameters

features (dict[str, torch.Tensor]) – The ground truth features produced by self.*_data_sources.
output_features (torch.Tensor or list[torch.Tensor] or dict[str, torch.Tensor]) – Predictions output by user-defined predict().

Returns

Overall loss between (user-defined) pairs of values in features and output_features.

Return type

float

_loss(self, targets, predictions, seq_lens=None, loss_weights=None)[source]¶

Defines the sequence loss for multiple target-prediction pairs.

If targets and predictions are iterables they must be in the same order, i.e. when zipped corresponding elements will be used as a target-prediction pair for calculating the loss.

The loss value between two frames of the target and prediction is given by loss_fn(). Currently this must be the same for all target-prediction pairs.

Parameters

targets (list[torch.Tensor] or torch.Tensor, shape (batch_size, seq_len, feat_dim)) – Ground truth tensor(s).
predictions (list[torch.Tensor] or torch.Tensor, shape (batch_size, seq_len, feat_dim)) – Prediction tensor(s).
seq_lens (None or list[torch.Tensor] or torch.Tensor, shape (batch_size,)) – Sequence length features. If one tensor is given it will be used for all target-prediction pairs, otherwise the length of the list given must match the length of targets and predictions.
loss_weights (None or list[float], shape (num_pairs)) – The weight for each target-prediction pair’s loss. If None then returns the average of all pair’s losses.

Returns

Overall (average or weight) loss.

Return type

float

Raises

ValueError – If targets, predictions, seq_len, or loss_weights are lists with non-matching lengths.

loss_fn(self, target, prediction)[source]¶

Defines the frame-wise loss calculation between ground truth and predictions.

Parameters

target (torch.Tensor, shape (batch_size, seq_len, feat_dim)) – Ground truth feature.
prediction (torch.Tensor, shape (batch_size, seq_len, feat_dim)) – Predicted feature.

Returns

Loss between feature and prediction.

Return type

torch.Tensor, shape (batch_size, seq_len, feat_dim)

save_parameters(self, experiment_dir, epoch)[source]¶

Saves the model’s state_dict to a .pt file.

Parameters

experiment_dir (str) – The experiment directory, within which the checkpoints directory will be created.
epoch (int) – The epoch number, used to create the checkpoint file name, epoch_{}.pt

load_parameters(self, checkpoint_path, strict=True, device=None)[source]¶

Loads a state_dict from a .pt file.

Parameters

checkpoint_path (str) – The file path of the .pt file containing the state_dict to be loaded
strict (bool) – Whether to strictly enforce that the keys in the loaded state_dict match this model’s structure.
device (str or torch.device or dict or callable) – Specifies how to remap storage locations, passed to torch.load().

Returns

state_dict – Parameters and persistent buffers that define the model.

Return type

dict

analysis_for_train_batch(self, features, output_features, out_dir, **kwargs)[source]¶

Hook used by morgana.experiment_builder.ExperimentBuilder after training batches for some epochs.