Cover image

Speculation, But With Standards: Training Draft Models That Actually Get Accepted

VSD shows why speculative decoding improves when draft models are trained for accepted paths, not merely probable tokens.

February 8, 2026 · 13 min · Zelina
Cover image

Tokens, Watts, and Waste: The Hidden Energy Bill of LLM Inference

A mechanism-first reading of why LLM inference energy is shaped by prefill, decoding, prompt length, and unnecessary generation—not merely model size.

February 8, 2026 · 14 min · Zelina
Cover image

Ultra‑Sparse Embeddings Without Apology

CSRv2 shows that ultra-sparse embeddings fail less because sparsity is impossible, and more because we have been training them badly.

February 8, 2026 · 19 min · Zelina
Cover image

When Words Start Walking: Rethinking Semantic Search Beyond Averages

A comparison-based reading of why Word Mover’s Distance with GloVe outperforms centroid-style semantic search in statement-level retrieval, and where that lesson actually applies in business systems.

February 8, 2026 · 15 min · Zelina
Cover image

Benchmarks Lie, Rooms Don’t: Why Embodied AI Fails the Moment It Enters Your House

A mechanism-first reading of TEA, an in-situ task-generation framework showing why embodied AI needs environment-specific evaluation before deployment.

February 7, 2026 · 17 min · Zelina
Cover image

Beyond Cosine: When Order Beats Angle in Embedding Similarity

A business-focused reading of recos, a Rearrangement Inequality-based similarity metric that tests whether embedding similarity should care about ordered structure, not only vector angle.

February 7, 2026 · 14 min · Zelina
Cover image

First Proofs, No Training Wheels

Why unpublished research lemmas expose the difference between fluent mathematical performance and proof-grade AI reasoning.

February 7, 2026 · 15 min · Zelina
Cover image

Hallucination-Resistant Security Planning: When LLMs Learn to Say No

A mechanism-first reading of how abstention, lookahead, and feedback turn LLM incident-response planning from fluent guessing into calibrated decision support.

February 7, 2026 · 18 min · Zelina
Cover image

When AI Forgets on Purpose: Why Memorization Is the Real Bottleneck

A mechanism-first analysis of how attention sinks can reveal and suppress harmful learning during LLM fine-tuning.

February 7, 2026 · 15 min · Zelina
Cover image

When One Heatmap Isn’t Enough: Layered XAI for Brain Tumour Detection

A mechanism-first reading of why combining GRAD-CAM, LRP, and SHAP can turn medical AI explanations from decorative heatmaps into a practical assurance layer.

February 7, 2026 · 17 min · Zelina