Paged Attention#
- easydel.layers.caching.paged_attention.__init__
ActiveSequenceBatchAllocatedPrefillPagesGenerationStepTaskHBMPageManagerInferenceSchedulerInitialSequenceRequestModelIOProcessorModelInputBatchModelOutputBatchModelOutputSummaryNextIterationPlanPagedAttentionCachePagedAttentionCacheMetaDataPagedAttentionCacheViewPagedAttentionMetadataSamplingParamsSlotPageAssignment
- easydel.layers.caching.paged_attention.paged_attention_cache
- easydel.layers.caching.paged_attention.paged_attention_cache_test