Cover image

Kitchen Confidential: FoodMonitor and the Compliance AI Reality Check

Cameras are easy. Audits are not. That is the useful irritation inside FoodMonitor: Benchmarking MLLMs for Explainable Compliance Analysis, a new benchmark for testing multimodal large language models on commercial-kitchen compliance monitoring.1 The paper is not asking whether a model can watch a kitchen video and say something vaguely sensible about hygiene. Many systems can now do that, at least with enough confidence to impress a demo audience and mildly alarm the legal department. ...

June 13, 2026 · 15 min · Zelina
Cover image

From Genes to Memes: The Evolutionary Biology of Hugging Face's 2 Million Models

TL;DR for operators Open-model adoption is usually treated as procurement with a nicer download button: find a model, check the licence, skim the model card, run a few benchmarks, and move on. This paper makes that habit look under-specified. Laufer, Oderinwale, and Kleinberg analyse 1.86 million models on Hugging Face and reconstruct family trees linking models to fine-tuned, adapted, quantised, or merged descendants.1 The useful result is not merely that Hugging Face is large. We knew that. The useful result is that model lineages mutate their public governance signals as they spread. ...

August 12, 2025 · 19 min · Zelina
Cover image

Forgetting by Remembering: A Smarter Path to Machine Unlearning

TL;DR for operators Deletion sounds simple until the deleted record has already shaped millions of model parameters. The clean answer is to retrain the model without that record. The operational answer is usually less glamorous: nobody wants to burn a full training cycle every time a user, regulator, data-quality team, or security analyst says, “Remove this.” ...

August 1, 2025 · 16 min · Zelina