Cover image

Mind Over Modules: How Smart Agents Learn What to See—and What to Be

TL;DR for operators Agentic AI is not only a model-selection problem. It is an environment-design problem. Two recent papers make that point from opposite ends of the stack. One studies LLM agents in a controlled repeated routing game and shows that the way history, rewards, and peer actions are represented can significantly change behaviour.1 The other proposes SwarmAgentic, a framework that automatically generates and optimises agent roles, execution policies, and collaboration structures using a language-based version of particle swarm optimisation.2 ...

June 19, 2025 · 14 min · Zelina
Cover image

Reflections in the Mirror Maze: Why LLM Reasoning Isn't Quite There Yet

TL;DR for operators Adding “reasoning” to an LLM agent is not the same as making it reason better. Wong et al. test four open-source models across dynamic SmartPlay tasks using a baseline prompt, reflection, reflection plus an Oracle that mutates heuristics, and reflection plus a Planner that simulates short future trajectories.1 The clean result is not “planning wins” or “bigger models win.” The result is more annoying, therefore more useful: the same scaffold can be a booster, a distraction, or a failure amplifier. ...

May 17, 2025 · 15 min · Zelina
Cover image

Guess How Much? Why Smart Devs Brag About Cheap AI Models

TL;DR for operators Cheap models are not a moral victory. They are useful when the surrounding system knows what to ask, how to check the answer, and when to escalate. The practical lesson from FrugalGPT and later model-routing research is that AI cost optimisation is less about picking one “best value” model and more about designing an inference pipeline that spends intelligence only where intelligence is needed.1 ...

March 30, 2025 · 17 min · Zelina
Cover image

Blind Trust, Fragile Brains: Why LoRA and Prompts Need a Confidence-Aware Backbone

TL;DR for operators LoRA and prompts are attractive because they make model adaptation feel almost too easy: add a few examples, attach a small adapter, nudge the model into a domain, and call it customised. The uncomfortable part is that adaptation changes not only what a model says, but how confidently it says it. A compliance assistant that becomes slightly more domain-specific but far more overconfident has not been improved. It has been promoted beyond its competence, a classic corporate move. ...

March 25, 2025 · 14 min · Zelina