easydel.modules.exaone.modeling_exaone#
- class easydel.modules.exaone.modeling_exaone.ExaoneAttention(*args: Any, **kwargs: Any)[source]#
Bases:
ModuleWrapper around ExaoneAttentionInner to wire it into decoder layers.
- class easydel.modules.exaone.modeling_exaone.ExaoneAttentionInner(*args: Any, **kwargs: Any)[source]#
Bases:
UnifiedAttentionExaone attention with partial RoPE.
- projection_mapping: ClassVar[dict[str, str]] = {'key_projection': 'k_proj', 'output_projection': 'out_proj', 'qkv_projection': 'qkv_proj', 'query_projection': 'q_proj', 'value_projection': 'v_proj'}#
- class easydel.modules.exaone.modeling_exaone.ExaoneDecoderLayer(*args: Any, **kwargs: Any)[source]#
Bases:
ModuleSingle Exaone decoder block combining attention and gated MLP.
- class easydel.modules.exaone.modeling_exaone.ExaoneForCausalLM(*args: Any, **kwargs: Any)[source]#
Bases:
BaseCausalLMModule[ExaoneModel,ExaoneConfig]Exaone model with a language modeling head for causal language modeling tasks.
- class easydel.modules.exaone.modeling_exaone.ExaoneForSequenceClassification(*args: Any, **kwargs: Any)[source]#
Bases:
BaseSequenceClassificationModule[ExaoneModel,ExaoneConfig]Exaone model with a Sequence Classification head.
- class easydel.modules.exaone.modeling_exaone.ExaoneGatedMLP(*args: Any, **kwargs: Any)[source]#
Bases:
ModuleGated feed-forward block used inside Exaone decoder layers.
- class easydel.modules.exaone.modeling_exaone.ExaoneModel(*args: Any, **kwargs: Any)[source]#
Bases:
EasyDeLBaseModuleDecoder-only Exaone transformer composed of embedding, stacked blocks, and final norm.
- property frequencies#
Retrieves or computes the frequency components (e.g., for RoPE) from the configuration.
Uses self.config.get_basic_frequencies() and caches the result.
- Returns
The frequency components, potentially cached.
- Return type
jnp.ndarray