vllm.v1.worker.gpu.kv_connector ¶
ActiveKVConnector ¶
Bases: KVConnector
Source code in vllm/v1/worker/gpu/kv_connector.py
clear_metadata ¶
Clear the connector metadata. Call this after draft model runs.
KVConnector ¶
KVConnector interface used by GPUModelRunner.