Batching on Cognaptus

Batching on Cognaptus https://cognaptus.com/tags/batching/ Recent content in Batching on Cognaptus Hugo -- 0.145.0 en-us Sat, 13 Jun 2026 00:00:00 +0000 Mixed Feelings: When LLM Batching Stops Being Obviously Better https://cognaptus.com/blog/2026-06-13-mixed-feelings-when-llm-batching-stops-being-obviously-better/ Sat, 13 Jun 2026 00:00:00 +0000 https://cognaptus.com/blog/2026-06-13-mixed-feelings-when-llm-batching-stops-being-obviously-better/ A systems paper shows why mixed batching is not a universal default for LLM inference, and why bandwidth-aware scheduling may matter more than scheduler fashion.