Cover image

If Logic Were Enough: Why LLMs Still Miss the Point of Conditionals

A promise is rarely just a logical operator. “If you mow the lawn, I’ll give you 50 dollars” does not sound like a philosophical exercise in truth tables. It sounds like a deal. Most people hear it as: no mowing, no money. By contrast, “If you’re hungry, there’s pizza in the oven” does not mean the pizza appears only under the metaphysical condition of your hunger. It means the pizza is there, and your hunger merely explains why I am telling you. ...

May 29, 2026 · 16 min · Zelina
Cover image

Red Queen Receipts: AI Security Testing Needs Logs, Not Vibes

Security testing is not a screenshot. A model gives a dangerous answer. Someone posts the transcript. A vendor says the model has been updated. A consultant turns the incident into a slide titled “AI risk is real.” Everyone nods gravely. Very mature. Very enterprise. The harder question is less theatrical: can the same vulnerability be tested again, under controlled conditions, with visible logs, a consistent evaluator, repeatable statistics, and enough human inspection to make the result defensible? ...

May 22, 2026 · 14 min · Zelina
Cover image

Think Less, Align Better: The New Economics of AI Reasoning

Opening — Why this matters now Enterprise AI is entering its mildly awkward teenage phase: everyone wants intelligence, nobody wants the invoice. For the last two years, much of the AI conversation has revolved around more: more context, more reasoning tokens, more chain-of-thought, more human feedback, more evaluators, more synthetic data, more agents, more dashboards to explain why the agents broke the dashboards. The operating assumption was simple enough: if the model thinks more, explains more, or trains on more feedback, it should perform better. ...

May 9, 2026 · 19 min · Zelina

The AI Stack in Plain English

A plain-English guide to the main layers of a modern AI system, from models and prompts to retrieval, tools, guardrails, and review.

April 23, 2026 · 6 min · Michelle
Cover image

CQ or Consequences: What This LLM Benchmark Reveals About AI Requirements Work

Requirements work has a reputation problem. It is rarely the part of an AI project that receives the keynote slide, the demo video, or the executive applause. Nobody opens a budget meeting by saying, “What we really need is a better way to ask the system what it must know.” They should, but apparently civilization still has limits. ...

April 22, 2026 · 17 min · Zelina
Cover image

CQ, AI & The Question of Questions

Questions look cheap. That is why they are dangerous. In most enterprise AI projects, the visible work arrives late: dashboards, RAG demos, knowledge graphs, compliance assistants, workflow copilots, and executive slides with arrows pointing to a “semantic layer.” The invisible work arrives earlier and is less glamorous: deciding what the system must actually know, answer, retrieve, distinguish, reject, and explain. ...

April 22, 2026 · 16 min · Zelina
Cover image

Graph RAG, No Smoke: Why Explainable AI in Manufacturing Needs a Memory

Factory AI has an old communication problem. The model can say, “this screw-placement attempt is likely to fail.” The operator then asks the obvious follow-up: “Because of what?” A dashboard answers with a probability. A SHAP plot answers with colored bars. A feature-importance chart answers with something that looks scientific enough to intimidate the meeting room into silence. None of these answers necessarily tells the worker, engineer, or manager what is connected to what: the screw geometry, the robot arm, the training dataset, the preprocessing step, the model, the task, and the explanation artifact. ...

April 22, 2026 · 15 min · Zelina
Cover image

When AI Learns the Trick First: Why Insight Beats Brute Force in Theorem Proving

The trick usually comes before the proof. That is not how most AI demos are staged, of course. The demo asks a model a difficult question, the model produces a long answer, and everyone pretends length is evidence of thought. Mathematics is less polite. A proof can be long, fluent, and wrong. It can also be short because the solver noticed the one move that makes the rest almost mechanical. ...

April 22, 2026 · 16 min · Zelina
Cover image

From Words to Workflows: Why AI Still Struggles to Think Like an Operations Research Analyst

A warehouse manager does not ask for “a constraint optimization problem.” She asks whether tomorrow’s orders can be shipped without overtime. A university administrator does not request “a mixed-integer formulation.” He asks whether lectures can be scheduled without room conflicts. A retail planner does not want “a MiniZinc model.” She wants to know which stores should receive scarce inventory before the promotion starts. ...

April 15, 2026 · 15 min · Zelina
Cover image

Thinking Fast, Remembering Slow: Why SWE-AGILE Fixes the Memory Crisis of AI Agents

Memory sounds like a storage problem. Give the agent a longer context window, let it keep the full conversation, and the work should become easier. This is the kind of solution that looks obvious until it meets a real software repository, a failing test suite, a long terminal log, and a model that now has to find one important clue buried somewhere in the middle of its own autobiography. ...

April 14, 2026 · 18 min · Zelina