easydel.layers.quantization.linear_8bit#
- class easydel.layers.quantization.linear_8bit.Linear8bit(*args: Any, **kwargs: Any)[source]#
Bases:
QauntModuleAn 8-bit quantized version of the linear transformation applied over the last dimension of the input.
Uses eformer’s Array8B implicit array for efficient 8-bit quantization.
- classmethod from_linear(linear: Linear, rngs: flax.nnx.rnglib.Rngs | None = None, **kwargs) Linear8bit[source]#
Create a Linear8bit module from a regular Linear module.
- to_linear(rngs: flax.nnx.rnglib.Rngs | None = None) Linear[source]#
Convert this Linear8bit module back to a regular Linear module.