`olm.models.google.gemma2`

Source: src/olm/models/google/gemma2.py:1

Classes

`Gemma2Block(embed_dim: int, intermediate_size: int, num_heads: int, num_kv_heads: int, max_seq_len: int, dropout: float, rope_theta: float, head_dim: int, sliding_window: int | None = 4096, attn_logit_softcap: float | None = 50.0, query_pre_attn_scalar: float | None = 256.0)`

Bases: olm.nn.structure.block.Block

Source: src/olm/models/google/gemma2.py:39

A single Transformer block for Gemma 2.

Implements the "Sandwich" Normalization pattern: Norm -> Attn -> Norm -> Residual Norm -> MLP -> Norm -> Residual

Methods

`forward(self, x)`

Source: src/olm/models/google/gemma2.py:86

`Gemma2Embedding(vocab_size: int, embedding_dim: int)`

Bases: olm.nn.embeddings.token_embed.Embedding

Source: src/olm/models/google/gemma2.py:15

Gemma 2 token embedding with hidden-size scaling.

Methods

`forward(self, x)`

Source: src/olm/models/google/gemma2.py:22

`Gemma2FinalLogitSoftcap(softcap: float | None = 30.0)`

Bases: Module

Source: src/olm/models/google/gemma2.py:26

Gemma 2 final logit soft-capping.

Methods

`forward(self, logits)`

Source: src/olm/models/google/gemma2.py:33

`Gemma2Model(vocab_size: int, embed_dim: int, intermediate_size: int, num_layers: int, num_heads: int, num_kv_heads: int, head_dim: int, max_seq_len: int, rope_theta: float = 10000.0, dropout: float = 0.0, sliding_window: int | None = 4096, attn_logit_softcap: float | None = 50.0, final_logit_softcap: float | None = 30.0, query_pre_attn_scalar: float | None = 256.0, tie_weights: bool = True)`

Bases: olm.nn.structure.block.Block

Source: src/olm/models/google/gemma2.py:108

Base class for Gemma 2 models.

Structure

Scaled token embedding -> [Gemma2Block] x N -> RMSNorm -> tied OutputHead -> optional final logit softcap.

Forward

Accepts token IDs shaped [batch, seq_len] and returns logits shaped [batch, seq_len, vocab_size].

Methods

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

Source: src/olm/nn/structure/block.py:26

Apply each block to the input in sequence.

Parameters

x: Input tensor.

Returns

Output tensor after all blocks have been applied.

`Gemma2_27B()`

Bases: olm.models.google.gemma2.Gemma2Model

Source: src/olm/models/google/gemma2.py:175

Gemma 2 27B Model.

Methods

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

Source: src/olm/nn/structure/block.py:26

Apply each block to the input in sequence.

Parameters

x: Input tensor.

Returns

Output tensor after all blocks have been applied.

`Gemma2_2B()`

Bases: olm.models.google.gemma2.Gemma2Model

Source: src/olm/models/google/gemma2.py:209

Gemma 2 2B Model.

Methods

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

Source: src/olm/nn/structure/block.py:26

Apply each block to the input in sequence.

Parameters

x: Input tensor.

Returns

Output tensor after all blocks have been applied.

`Gemma2_9B()`

Bases: olm.models.google.gemma2.Gemma2Model

Source: src/olm/models/google/gemma2.py:192

Gemma 2 9B Model.

Methods

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

Source: src/olm/nn/structure/block.py:26

Apply each block to the input in sequence.

Parameters

x: Input tensor.

Returns

Output tensor after all blocks have been applied.

Classes

Gemma2Block(embed_dim: int, intermediate_size: int, num_heads: int, num_kv_heads: int, max_seq_len: int, dropout: float, rope_theta: float, head_dim: int, sliding_window: int | None = 4096, attn_logit_softcap: float | None = 50.0, query_pre_attn_scalar: float | None = 256.0)

Methods

forward(self, x)

Gemma2Embedding(vocab_size: int, embedding_dim: int)

Methods

forward(self, x)

Gemma2FinalLogitSoftcap(softcap: float | None = 30.0)

Methods

forward(self, logits)

Methods

forward(self, x: torch.Tensor) -> torch.Tensor (inherited from Block)

Gemma2_27B()

Methods

forward(self, x: torch.Tensor) -> torch.Tensor (inherited from Block)

Gemma2_2B()

Methods

forward(self, x: torch.Tensor) -> torch.Tensor (inherited from Block)

Gemma2_9B()

Methods

forward(self, x: torch.Tensor) -> torch.Tensor (inherited from Block)

`Gemma2Block(embed_dim: int, intermediate_size: int, num_heads: int, num_kv_heads: int, max_seq_len: int, dropout: float, rope_theta: float, head_dim: int, sliding_window: int | None = 4096, attn_logit_softcap: float | None = 50.0, query_pre_attn_scalar: float | None = 256.0)`

`forward(self, x)`

`Gemma2Embedding(vocab_size: int, embedding_dim: int)`

`forward(self, x)`

`Gemma2FinalLogitSoftcap(softcap: float | None = 30.0)`

`forward(self, logits)`

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

`Gemma2_27B()`

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

`Gemma2_2B()`

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)

`Gemma2_9B()`

`forward(self, x: torch.Tensor) -> torch.Tensor` (inherited from `Block`)