Cover image

Greedy, but Not Blind: Teaching Optimization to Listen

Budget meetings have a familiar rhythm. Someone brings the spreadsheet. Someone brings the map. Someone else brings the sentence that ruins the spreadsheet: “This district looks inefficient on paper, but the roads are worse than the data says.” Classical optimization knows what to do with numbers. It does not naturally know what to do with that sentence. In public health planning, infrastructure rollout, retail site selection, and ESG investment, those sentences are often where the real institutional knowledge lives. Unfortunately, once the sentence enters the room, the algorithm usually leaves through the back door. Or worse, the organization pretends the sentence has been “encoded” into a weight, because apparently all human judgment becomes rigorous once it is multiplied by 0.37. ...

January 19, 2026 · 14 min · Zelina
Cover image

Explaining the Explainers: Why Faithful XAI for LLMs Finally Needs a Benchmark

Hiring. A candidate writes a personal statement. A screening model gives a score. A manager asks the AI system why. The explanation says work experience mattered most, education came next, and demographic variables barely moved the decision. Everyone relaxes, because the explanation sounds reasonable. That is the dangerous part. A reasonable explanation is not necessarily a faithful explanation. A counterfactual edit that looks plausible is not necessarily a causal counterfactual. And a model that appears insensitive to demographic concepts may not be “fair”; it may simply have learned, or been aligned, to suppress visible sensitivity in the narrow setting being tested. ...

January 17, 2026 · 15 min · Zelina
Cover image

When LLMs Stop Talking and Start Driving

Factory trouble usually begins in language. Not elegant language. Not the polished language of annual reports and transformation roadmaps. The useful trouble is buried in work orders, technician notes, supplier messages, inspection records, customer complaints, meeting minutes, and logs written by people who had better things to do than produce clean training data. ...

January 11, 2026 · 18 min · Zelina
Cover image

Model Cannibalism: When LLMs Learn From Their Own Echo

Feedback is usually sold as the civilized part of AI deployment. Users interact with the model. The product team collects prompts, outputs, ratings, usage logs, corrections, maybe a few thumbs-up signals. The model is fine-tuned. The next version is better. Everybody nods. A dashboard is opened. Someone says “continuous improvement.” The room relaxes. ...

January 9, 2026 · 19 min · Zelina
Cover image

When Three Examples Beat a Thousand GPUs

A GPU bill is usually treated as a hardware problem. Buy faster accelerators, shorten training runs, negotiate a better cloud contract. Less often asked is whether the expensive part of the pipeline began with a badly calibrated prompt. An LLM generating neural-network architectures can create thousands of candidates before training begins. If the prompt provides too little context, the model may repeatedly produce shallow variations of the same familiar design. Add more examples, and it may combine useful ideas across architectural families. Add still more, and the output can become worse, incomplete, or invalid. ...

January 3, 2026 · 15 min · Zelina
Cover image

Deployed, Retrained, Repeated: When LLMs Learn From Being Used

Acceptance is a reward, even when nobody writes reward = 1. Imagine an enterprise deploys an AI agent to generate code, reconcile invoices, or prepare operational plans. Some outputs pass automated checks and enter production. Others fail, disappear into logs, and are never seen again. Months later, the accepted outputs are collected and used to fine-tune the next model. ...

January 1, 2026 · 18 min · Zelina
Cover image

Forgetting That Never Happened: The Shallow Alignment Trap

Forgetting That Never Happened: The Shallow Alignment Trap Forgetfulness is an expensive diagnosis. When an internal AI system performs well on last month’s support taxonomy, then underperforms after being fine-tuned on this month’s compliance cases, the obvious story is simple: the model forgot. That story usually triggers an equally obvious response: replay old data, retrain more broadly, freeze more parameters, or panic politely in a meeting while calling it “model lifecycle management.” ...

December 27, 2025 · 17 min · Zelina
Cover image

Doctor GPT, But Make It Explainable

Triage begins with messy language. A patient does not usually arrive as a clean feature vector. They arrive with “I feel tired,” “my stomach is strange,” “I have fever but not always,” or the classic: “I searched online and now I am either fine or dying.” Traditional diagnostic models are not built for this level of human poetry. They prefer structured fields, stable vocabularies, and the fantasy that symptoms behave like dropdown menus. ...

December 22, 2025 · 15 min · Zelina
Cover image

Prompt-to-Parts: When Language Learns to Build

The compiler is the interesting part Blocks are easy to understand. That is why this paper is more interesting than it first looks. At the surface, Prompt-to-Parts: Generative AI for Physical Assembly and Scalable Instructions is a paper about using large language models to generate LEGO-style assemblies from natural language prompts.1 It shows a medieval castle, an International Space Station model, a modular multitool kit, and an image-to-parts helicopter conversion. Naturally, the tempting summary is: “LLMs can now design LEGO models.” ...

December 20, 2025 · 16 min · Zelina
Cover image

ID Crisis, Resolved: When Semantic IDs Stop Fighting Hash IDs

Catalogs have a boring problem. Most items are nearly invisible. A platform may have millions of products, posts, videos, restaurants, songs, or ads, but user interaction is never evenly distributed. A small number of head items collect enough clicks, saves, purchases, and dwell time to become statistically legible. The rest live in the long tail, where the system is expected to recommend them intelligently despite barely having seen them. Very democratic. Very inconvenient. ...

December 14, 2025 · 16 min · Zelina