Cover image

When Bigger Isn’t Smarter: Stress‑Testing LLMs in the ICU

A hospital does not buy “intelligence.” It buys a workflow. That distinction sounds obvious until an AI vendor arrives with a model that has billions of parameters, a clinical pretraining story, and the gentle implication that smaller models are now museum pieces. In the ICU, however, the useful question is not whether the model can talk like a doctor. It is whether it can detect tomorrow’s clinical deterioration from messy notes better than simpler systems that cost less, run faster, and attract fewer infrastructure headaches. ...

December 24, 2025 · 12 min · Zelina
Cover image

Black Boxes, White Coats: AI Epidemiology and the Art of Governing Without Understanding

A hospital does not need a perfect theory of neural network internals before it can notice that one clinical AI keeps recommending the wrong kind of follow-up. A bank does not need to decode every transformer layer before it can see that a credit assistant behaves oddly around post-bankruptcy applicants. A regulator does not need metaphysics. It needs repeatable measurements. ...

December 20, 2025 · 18 min · Zelina
Cover image

When Tools Think Before Tokens: What TxAgent Teaches Us About Safe Agentic AI

When Tools Think Before Tokens: What TxAgent Teaches Us About Safe Agentic AI Tools are supposed to make AI safer. That is the sales pitch, anyway. Give the model access to curated biomedical databases, let it call APIs instead of hallucinating from memory, and clinical reasoning suddenly becomes more grounded. Less improvisation, more evidence. Less theatrical confidence, more traceable work. ...

December 15, 2025 · 13 min · Zelina
Cover image

Lost in Translation: When Multilingual LLMs Miss the Medical Plot

Accuracy is a seductive number. It is tidy, executive-friendly, and easy to put in a slide deck. A model gets 82% accuracy, someone says “good enough,” and suddenly a clinical workflow is being “transformed.” Healthcare, as usual, has a way of punishing this kind of optimism. Not loudly at first. Quietly. Through false negatives, silent majority-class prediction, and a dashboard that looks reassuring until someone asks the rude question: what exactly did the model miss? ...

December 6, 2025 · 16 min · Zelina
Cover image

Fog of Neuro: Why Speech May Become the Next MRI

Fog of Neuro: Why Speech May Become the Next MRI Speech is a strange medical instrument. It does not look like one. It does not come with a scanner room, a radiology report, or a patient lying very still while a machine complains loudly. It comes out in ordinary life: a story, a pause, a word search, a sentence that loses its thread halfway through. For many neurological conditions, especially rare metabolic and neurodegenerative diseases, that ordinary speech may contain something today’s clinic often misses: the patient’s real cognitive state between appointments. ...

December 5, 2025 · 13 min · Zelina
Cover image

Scan, Plan, Report: When Agentic AI Starts Thinking Like a Radiologist

Scan, Plan, Report: When Agentic AI Starts Thinking Like a Radiologist Report writing is the visible part of radiology. It is also the part easiest for AI vendors to misunderstand. A radiology report looks like text, so the naive automation pitch is obvious: give the CT scan to a vision-language model, ask for a report, and let the model type faster than a human. Congratulations, we have reinvented autocomplete with more liability. ...

December 3, 2025 · 18 min · Zelina
Cover image

Uncertainty, But Make It Clinical: How MedBayes‑Lite Teaches LLMs to Say 'I Might Be Wrong'

A hospital does not need a chatbot that sounds certain. It needs a system that knows when certainty would be irresponsible. That sounds obvious until one remembers how most AI demos behave: fluent answer first, caveat somewhere after the damage has already put on shoes. In clinical decision support, this is not a stylistic defect. It is an operating risk. A model can be wrong in many ways, but the most dangerous version is the confidently wrong one: the triage answer that should have been escalated, the medication suggestion that should have been checked, the risk score that looks clean only because the system has no vocabulary for doubt. ...

November 22, 2025 · 16 min · Zelina
Cover image

CURE Enough: When Multimodal EHR Models Finally Grow Up

Hospitals do not run on clean datasets. They run on discharge notes, lab panels, repeated admissions, missing context, and the occasional clinical abbreviation that looks like it escaped from a tax form. That is the awkward reality behind chronic-disease prediction. The patient record is not just text. It is not just lab values. It is not just a sequence of visits. It is all three, with timing doing much of the quiet work. A patient returning after 42 days does not mean the same thing as a patient returning after 420 days, even when the diagnosis code looks identical. Healthcare operations already know this. Many AI models, bless their expensive little hearts, still behave as if they do not. ...

November 17, 2025 · 14 min · Zelina
Cover image

Karma, But Make It Causal: Why Simulation Is Finally Growing Up

A hospital monitor, a factory sensor array, and a trading dashboard have a shared irritation: they all produce time-series data that everyone wants to model, almost nobody wants to share, and absolutely nobody fully understands from correlations alone. That is the practical problem behind KarmaTS, a proposed interactive framework for constructing executable, lag-indexed causal simulations for multivariate time series.1 The paper is not trying to sell another magical causal-discovery algorithm. Good. We have enough of those wandering around with heroic acronyms and very delicate assumptions. ...

November 17, 2025 · 14 min · Zelina
Cover image

Scalpels, Agents, and Orchestrators: When Surgery Meets Autonomous Workflows

The surgeon does not need another chatbot Operating rooms already have enough things demanding attention. Monitors, tools, imaging, staff coordination, alarms, procedural checklists, and the small matter of the patient. In robotic surgery, the problem becomes sharper: the surgeon’s hands are occupied and their visual attention is locked into the console. The data may be nearby, but nearby is not the same as usable. ...

November 16, 2025 · 14 min · Zelina