Healthcare AI

Say Less: A Child-Speech Screener Designed to Stop Before Diagnosis

TL;DR for operators The paper presents a narrowly scoped screening pipeline for detecting common Polish sibilant substitutions in children.1 It does not attempt to transcribe unrestricted speech, diagnose a speech sound disorder, or replace a speech-language pathologist. That restraint is not an apology. It is the architecture. The system works by preserving non-canonical pronunciations as explicit tokens, aligning them against a known prompted pronunciation, converting the difference into a structured error description, and allowing a caregiver-facing assistant to speak only through clinician-reviewable templates. ...

No CIG, Still Checking: When Medical Guidelines Become Executable

TL;DR for operators Most organizations do not have a compliance problem because nobody wrote the rules down. They have a compliance problem because the rules exist in prose, the operational evidence exists in messy records, and the bridge between the two is usually a small group of overworked experts quietly aging in a meeting room. ...

The Clean Label Fairy Is Not Coming

TL;DR for operators Hospitals do not label images the same way. Radiologists disagree on contours. Pathologists disagree on grades. Automatically generated masks miss structures, add structures, or quietly confuse one target for another. In centralized AI, those errors are already irritating. In federated learning, they become operationally awkward because the data cannot simply be pooled, inspected, cleaned, and morally forgiven by a heroic annotation team. ...

The Missing Ingredient Wasn’t Vision: NutriMLLM and the Data Recipe for Micronutrient AI

TL;DR for operators Food-image nutrition AI is usually sold as a vision problem: recognise the meal, estimate the portion, output the nutrients, preferably with a pleasant progress spinner. NutriMLLM suggests that this is only half right. The harder missing piece is not necessarily seeing the food. It is knowing the full nutrient profile once the food is identified. ...

Chart Check: Why Clinical Summaries Need Detectors Before Alignment

Chart review is the boring part of medicine, which is exactly why AI systems should learn from it. A clinical discharge summary does not fail only when it sounds clumsy. It fails when it tells a patient something that did not happen, invents a medication change, adds a procedure, misstates a timing detail, or turns a vague note into a confident medical fact. The prose may still be smooth. The bedside manner may even be excellent. Unfortunately, a hallucination delivered in fluent patient-friendly language is not safer because it has better manners. ...

The Heart of the Model: ECG Foundation Models Need the Right Backbone Before More Data

Cost is not always about size. That is an inconvenient sentence for anyone trying to sell a larger medical foundation model by waving parameter counts like a hospital procurement trophy. In ECG modeling, the expensive question is not simply whether one can pretrain on more recordings. The harder question is whether the model architecture and pretraining task actually match the structure of the signal. ...

MARCH Orders: When AI Holds a CT Case Conference

The useful meeting, unfortunately, exists Meetings are usually where productivity goes to file a complaint. But there is one kind of meeting that high-stakes work still needs: the review session where a first draft is challenged, evidence is checked, and a senior decision-maker signs off. Radiology has long understood this. A resident may draft the report. A fellow may question the interpretation. An attending radiologist resolves the remaining uncertainty. The point is not ceremony. The point is controlled disagreement. ...

The Cost of Playing It Safe: When AI Safety Creates Harm

Refusal looks safe. That is the problem. A user says they have run out of ordinary options: the specialist is gone, the appointment is weeks away, the emergency department has already sent them home, and the remaining medication supply is not enough to bridge the gap. The user asks an AI system what to do. The model refuses to provide concrete guidance and recommends the same professional route the user has just explained is unavailable. ...

When AI Starts Writing Papers: The Rise of the Medical AI Scientist

Papers used to have a useful quality: they were difficult to produce. Not always good, unfortunately, but difficult. Someone had to identify a problem, read the literature, design the method, write the code, run the experiment, repair the code, compare the result, draw the figures, write the manuscript, and then survive peer review with only minor emotional damage. ...

When EEG Stops Thinking in Squares: Why Linear-Time Models Are Quietly Winning

The hospital problem is not that EEG is too small. It is that EEG refuses to stay the same shape. A hospital does not run machine learning inside a clean benchmark. It runs it across devices, departments, vendors, technicians, recording protocols, and patients who rarely behave like textbook signals. Electroencephalography, or EEG, makes this especially inconvenient. The signal is long, noisy, clinically useful, and structurally inconsistent. Different datasets may use different electrode counts. Different institutions may follow different montage conventions. A model that looks competent on one electrode layout can become less confident when the scalp is wired slightly differently. Apparently, brains did not agree to standardize themselves for our convenience. ...