Disaggregated Inference on Cognaptus

Disaggregated Inference on Cognaptus https://cognaptus.com/tags/disaggregated-inference/ Recent content in Disaggregated Inference on Cognaptus Hugo -- 0.145.0 en-us Wed, 27 May 2026 00:00:00 +0000 The KV Cache Is Not a Detail: Why LLM Compression Needs a Control Plane https://cognaptus.com/blog/2026-05-27-the-kv-cache-is-not-a-detail-why-llm-compression-needs-a-control-plane/ Wed, 27 May 2026 00:00:00 +0000 https://cognaptus.com/blog/2026-05-27-the-kv-cache-is-not-a-detail-why-llm-compression-needs-a-control-plane/ KVServe shows why KV cache compression in disaggregated LLM serving should be treated as service-aware control, not a static infrastructure tweak.