easydel.layers.attention_operator.modules.__init__#
- class easydel.layers.attention_operator.modules.__init__.FlashAttn(metadata: AttentionMetadata)[source]#
Bases:
AttentionImpl- forward_cpu(*args, **kwargs) AttentionOutput[source]#
- forward_cuda(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, causal: bool = False, **ignore) AttentionOutput[source]#
- forward_gpu(*args, **kwargs) AttentionOutput[source]#
- forward_native(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, causal: bool = False, **ignore) AttentionOutput[source]#
- forward_rocm(*args, **kwargs) AttentionOutput[source]#
- forward_tpu(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, causal: bool = False, **ignore) AttentionOutput[source]#
- get_impl_metadata() AttentionMetadata[source]#
- class easydel.layers.attention_operator.modules.__init__.RingAttn(metadata: AttentionMetadata)[source]#
Bases:
AttentionImpl- forward_cpu(*args, **kwargs) AttentionOutput[source]#
- forward_cuda(*args, **kwargs) AttentionOutput[source]#
- forward_gpu(*args, **kwargs) AttentionOutput[source]#
- forward_native(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, deterministic: bool = False, dropout_rng: Optional[PRNGKey] = None, causal: bool = True, **ignore) AttentionOutput[source]#
- forward_rocm(*args, **kwargs) AttentionOutput[source]#
- forward_tpu(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, deterministic: bool = False, dropout_rng: Optional[PRNGKey] = None, causal: bool = True, **ignore) AttentionOutput[source]#
- get_impl_metadata() AttentionMetadata[source]#
- class easydel.layers.attention_operator.modules.__init__.ScaledDotProductAttn(metadata: AttentionMetadata)[source]#
Bases:
AttentionImpl- forward_cpu(*args, **kwargs) AttentionOutput[source]#
- forward_cuda(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, causal: bool = False, **ignore) AttentionOutput[source]#
- forward_gpu(*args, **kwargs) AttentionOutput[source]#
- forward_native(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, causal: bool = False) AttentionOutput[source]#
- forward_rocm(*args, **kwargs) AttentionOutput[source]#
- forward_tpu(*args, **kwargs) AttentionOutput[source]#
- get_impl_metadata() AttentionMetadata[source]#
- class easydel.layers.attention_operator.modules.__init__.SplashAttn(metadata: AttentionMetadata)[source]#
Bases:
AttentionImpl- forward_cpu(*args, **kwargs) AttentionOutput[source]#
- forward_cuda(*args, **kwargs) AttentionOutput[source]#
- forward_gpu(*args, **kwargs) AttentionOutput[source]#
- forward_native(*args, **kwargs) AttentionOutput[source]#
- forward_rocm(*args, **kwargs) AttentionOutput[source]#
- forward_tpu(q: Array, k: Array, v: Array, mask: Optional[Array] = None, causal: bool = True, **ignore) AttentionOutput[source]#
- get_impl_metadata() AttentionMetadata[source]#
- class easydel.layers.attention_operator.modules.__init__.VanillaAttn(metadata: AttentionMetadata)[source]#
Bases:
AttentionImpl- forward_cpu(*args, **kwargs) AttentionOutput[source]#
- forward_cuda(*args, **kwargs) AttentionOutput[source]#
- forward_gpu(*args, **kwargs) AttentionOutput[source]#
- forward_native(q: Array, k: Array, v: Array, mask: Optional[Array] = None, bias: Optional[Array] = None, init_bias: Optional[Callable[[], Array]] = None, deterministic: bool = False, dropout_rng: Optional[PRNGKey] = None, **ignore) AttentionOutput[source]#
- forward_rocm(*args, **kwargs) AttentionOutput[source]#
- forward_tpu(*args, **kwargs) AttentionOutput[source]#
- get_impl_metadata() AttentionMetadata[source]#