Cover image

Rank and File: MatryoshkaLoRA Turns One Adapter into Many

A mechanism-first reading of MatryoshkaLoRA, showing why one diagonal training weight can make LoRA adapters usable across multiple deployment ranks.

May 27, 2026 · 17 min · Zelina
Cover image

The Edge Case for LLM Routing: Why Cheap Local Inference Needs a Risk Gate

CR2 shows why mobile-edge LLM routing is not just model selection with a smaller model attached, but a two-stage deployment problem where local confidence, wireless cost, and risk control must be designed together.

May 27, 2026 · 15 min · Zelina
Cover image

The Experts Are Sparse Inside: Why MoE Cost Cuts Stop at 1.2x

A mechanism-first reading of intra-expert activation sparsity in MoE models, and why large theoretical sparsity becomes modest but useful inference savings in production.

May 27, 2026 · 16 min · Zelina
Cover image

The KV Cache Is Not a Detail: Why LLM Compression Needs a Control Plane

KVServe shows why KV cache compression in disaggregated LLM serving should be treated as service-aware control, not a static infrastructure tweak.

May 27, 2026 · 15 min · Zelina
Cover image

AdamW and the Cost of Being Reasonable: Choosing LLM Optimizers Without Leaderboard Theater

A business-facing reading of why LLM optimizer choice is less about replacing AdamW and more about trading memory, stability, wall-clock time, and hardware fit.

May 26, 2026 · 16 min · Zelina
Cover image

No More Low-Rank Detours: GPart and the Geometry of Fine-Tuning

A mechanism-first reading of GPart, a PEFT method that replaces LoRA’s bilinear adapter detour with a direct isometric map into model weight space.

May 26, 2026 · 15 min · Zelina
Cover image

RL Needs a Menu, Not a Miracle

A recent arXiv paper shows why reinforcement learning works better when a model has already seen multiple verified ways to solve the same problem.

May 25, 2026 · 14 min · Zelina
Cover image

The Heart of the Model: ECG Foundation Models Need the Right Backbone Before More Data

A systematic ECG foundation-model study shows why architecture fit and pretraining objective matter more than fashionable scale alone.

May 24, 2026 · 14 min · Zelina
Cover image

Red Queen Receipts: AI Security Testing Needs Logs, Not Vibes

AVISE shows why AI security evaluation should move from one-off jailbreak anecdotes toward repeatable, auditable test pipelines.

May 22, 2026 · 14 min · Zelina
Cover image

Context Is the New Attack Surface

A business-focused reading of Jailbreak Mimicry, explaining why LLM safety failures often live in task framing rather than forbidden words.

May 16, 2026 · 13 min · Zelina