Cover image

Tunnel Vision, Literally: When Cropping Makes Multimodal Models Blind

Opening — Why this matters now Multimodal Large Language Models (MLLMs) can reason, explain, and even philosophize about images—until they’re asked to notice something small. A number on a label. A word in a table. The relational context that turns a painted line into a parking space instead of a traffic lane. The industry’s default fix has been straightforward: crop harder, zoom further, add resolution. Yet performance stubbornly plateaus. This paper makes an uncomfortable but important claim: the problem is not missing pixels. It’s missing structure. ...

December 14, 2025 · 3 min · Zelina
Cover image

Noisy but Wise: How Simple Noise Injection Beats Shortcut Learning in Medical AI

Opening — Why this matters now In a world obsessed with bigger models and cleaner data, a modest paper from the University of South Florida offers a quiet counterpoint: what if making data noisier actually makes models smarter? In medical AI—especially when dealing with limited, privacy-constrained datasets—overfitting isn’t just a technical nuisance; it’s a clinical liability. A model that learns the quirks of one hospital’s X-ray machine instead of the biomarkers of COVID-19 could fail catastrophically in another ward. ...

November 9, 2025 · 3 min · Zelina
Cover image

Spurious Minds: How Embedding Regularization Could Fix Bias at Its Roots

Why this matters now Modern AI models are astonishingly good at pattern recognition—and dangerously bad at knowing which patterns matter. A neural network that labels birds can achieve 95% accuracy on paper yet collapse when the background changes from lake to desert. This fragility stems from spurious correlations—the model’s habit of linking labels to irrelevant cues like color, lighting, or background texture. The deeper the network, the deeper the bias embeds. ...

November 8, 2025 · 4 min · Zelina