fl4health.model_bases.masked_layers.masked_normalization_layers module¶

class MaskedBatchNorm1d(num_features, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True, device=None, dtype=None)[source]¶

Bases: _MaskedBatchNorm

Applies (masked) Batch Normalization over a 2D or 3D input. Input shape should be (N, C) or (N, C, L), where N is the batch size, C is the number of features/channels, and L is the sequence length.

class MaskedBatchNorm2d(num_features, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True, device=None, dtype=None)[source]¶

Bases: _MaskedBatchNorm

Applies (masked) Batch Normalization over a 4D input (a mini-batch of 2D inputs with additional channel dimension).

class MaskedBatchNorm3d(num_features, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True, device=None, dtype=None)[source]¶

Bases: _MaskedBatchNorm

Applies (masked) Batch Normalization over a 5D input (a mini-batch of 3D inputs with additional channel dimension).

class MaskedLayerNorm(normalized_shape, eps=1e-05, elementwise_affine=True, bias=True, device=None, dtype=None)[source]¶

Bases: LayerNorm

__init__(normalized_shape, eps=1e-05, elementwise_affine=True, bias=True, device=None, dtype=None)[source]¶

Implementation of the masked Layer Normalization module. When elementwise_affine is True, nn.LayerNorm has a learnable weight and (optional) bias. For MaskedLayerNorm, the weight and bias do not receive gradient in back propagation. Instead, two score tensors - one for the weight and another for the bias - are maintained. In the forward pass, the score tensors are transformed by the Sigmoid function into probability scores, which are then used to produce binary masks via Bernoulli sampling. Finally, the binary masks are applied to the weight and the bias. During training, gradients with respect to the score tensors are computed and used to update the score tensors.

When elementwise_affine is False, nn.LayerNorm does not have weight or bias. Under this condition, both score tensors are None and MaskedLayerNorm acts in the same way as nn.LayerNorm.

NOTE: The scores are not assumed to be bounded between 0 and 1.

Parameters:

normalized_shape (TorchShape) – Input shape from an expected input. If a single integer is used, it is treated as a singleton list, and this module will normalize over the last dimension which is expected to be of that specific size.
eps (float) – A value added to the denominator for numerical stability. Default: 1e-5
elementwise_affine (bool) – A boolean value that when set to True, this module has learnable per-element affine parameters initialized to ones (for weights) and zeros (for biases). Default: True.
bias (bool) – If set to False, the layer will not learn an additive bias (only relevant if elementwise_affine is True). Default: True.
device (torch.device | None, optional) – Device to which this module should be sent. Defaults to None.
dtype (torch.dtype | None, optional) – Type of the tensors. Defaults to None.

forward(input)[source]¶

Mapping function for the MaskedLayerNorm.

Parameters:: input (Tensor) – Tensor to be mapped by the layer.
Returns:: Output tensor after mapping of the input tensor.
Return type:: Tensor

classmethod from_pretrained(layer_norm_module)[source]¶

Return an instance of MaskedLayerNorm whose weight and bias have the same values as those of layer_norm_module.

Parameters:: layer_norm_module (nn.LayerNorm) – Target module to be converted
Returns:: New copy of the provided module with mask layers added to enable FedPM
Return type:: MaskedLayerNorm