olm.nn.norms.rms_norm¶

Classes¶

`RMSNorm`(args, *kwargs)	RMSNorm (Root Mean Square Layer Normalization) layer.

class olm.nn.norms.rms_norm.NormBase(*args: Any, **kwargs: Any)¶

Bases: Module, ABC

Abstract base class for normalization layers (e.g., LayerNorm, RMSNorm).

Standardizes the interface for normalization, ensuring all implementations handle model dimension, device, and dtype consistently.

d_model¶

The dimension of the input features to normalize.

Type: int

device¶

The device the module is on.

Type: torch.device, optional

dtype¶

The data type of the module parameters.

Type: torch.dtype

abstractmethod forward(x: torch.Tensor) → torch.Tensor¶

Apply normalization to the input tensor.

class olm.nn.norms.rms_norm.RMSNorm(*args: Any, **kwargs: Any)¶

Bases: NormBase

RMSNorm (Root Mean Square Layer Normalization) layer.

Implements RMSNorm as described in “Root Mean Square Layer Normalization” (https://arxiv.org/abs/1910.07467). A simplified version of LayerNorm that scales invariance properties.

Parameters:
d_model (int) – The dimension of the model to normalize.
eps (float , optional) – Small constant for numerical stability. Defaults to 1e-5.
device (torch.device , optional) – Target device.
dtype (torch.dtype , optional) – Target data type.

weight¶

Learnable scale parameter.

Type: nn.Parameter

forward(x: torch.Tensor) → torch.Tensor¶

Forward pass of RMSNorm.

Parameters: x (torch.Tensor) – Input tensor of shape (batch_size, sequence_length, d_model).
Returns: Normalized output tensor of the same shape.
Return type: torch.Tensor