Cover image

When LLMs Stop Guessing and Start Complying: Agentic Neuro-Symbolic Programming

The problem is not that LLMs cannot write code. It is that they write the wrong kind too confidently. A familiar scene: someone gives an LLM a task, receives a block of code that looks elegant, runs it, and discovers that it has invented an API, misunderstood the library, or solved a neighboring problem with excellent grammar. This is annoying when the target is ordinary Python. It is worse when the target is a specialized framework where the code is supposed to encode logic, constraints, and domain structure. ...

January 5, 2026 · 13 min · Zelina
Cover image

Secrets, Context, and the RAG Illusion

An employee privately tells a colleague that she plans to resign. Weeks later, she asks her AI assistant to draft an email to her manager about her future goals. The assistant searches her previous conversations, retrieves the resignation discussion, and helpfully writes that her priority is preparing for a smooth transition because she has accepted another role. ...

January 2, 2026 · 14 min · Zelina
Cover image

Silent Scholars, No More: When Uncertainty Becomes an Agent’s Survival Instinct

RAG is a very polite librarian. It fetches documents, quotes passages, and helps an agent look less ignorant in public. Then the agent closes the book, answers the user, and leaves no trace except a chat log, a cache entry, or perhaps another small pile of private “reflections” that no one else will ever see. ...

December 28, 2025 · 18 min · Zelina
Cover image

Echoes, Not Amnesia: Teaching GUI Agents to Remember What Worked

Memory is not a folder A useful employee does not fill out the same form from scratch every morning as if yesterday never happened. They remember which menu hides the export button, which warning can be ignored, which field must be filled before the “Next” button wakes up, and which apparently harmless click sends the process into a small bureaucratic swamp. ...

December 23, 2025 · 17 min · Zelina
Cover image

Doctor GPT, But Make It Explainable

Triage begins with messy language. A patient does not usually arrive as a clean feature vector. They arrive with “I feel tired,” “my stomach is strange,” “I have fever but not always,” or the classic: “I searched online and now I am either fine or dying.” Traditional diagnostic models are not built for this level of human poetry. They prefer structured fields, stable vocabularies, and the fantasy that symptoms behave like dropdown menus. ...

December 22, 2025 · 15 min · Zelina
Cover image

ASKing Smarter Questions: When Scholarly Search Learns to Explain Itself

Search used to be a polite negotiation with a database. You typed keywords. The system returned papers. You inspected titles, opened tabs, skimmed abstracts, cursed quietly, adjusted the keywords, and repeated the ritual until either the literature became clear or your soul left the building. Large language models changed the ritual, but not always for the better. Now a system can answer a research question directly, which feels magical until one remembers that “fluent” and “correct” are not synonyms. In scholarly work, this distinction is not academic decoration. It is the difference between literature discovery and very confident misinformation wearing a lab coat. ...

December 21, 2025 · 16 min · Zelina
Cover image

Let There Be Light (and Agents): Automating Quantum Experiments

Let There Be Light (and Agents): Automating Quantum Experiments A lab notebook is not just a diary. It is an institutional memory system with bad handwriting, missing parameter values, and occasional coffee damage. That is not a joke, unfortunately. In experimental science, much of the valuable knowledge sits between formal theory and physical execution: which crystal goes with which pump, how the beams should be routed, which detector timing window is plausible, which old setup can be reused, and which beautiful simulation is quietly lying through its teeth. ...

December 20, 2025 · 16 min · Zelina
Cover image

Painkillers with Foresight: Teaching Machines to Anticipate Cancer Pain

A patient says the pain is manageable. The medication chart looks stable. The latest score is not alarming. Then, sometime before the next formal reassessment, the pain breaks through. That is the operational problem behind Zhuang et al.’s study on predicting lung-cancer pain episodes with a hybrid machine-learning and large-language-model pipeline.1 The paper is not really about whether “AI can predict pain,” a sentence that sounds impressive until one remembers that dashboards have been predicting things since before consultants discovered the word “agentic.” The more interesting question is narrower and more useful: when should a hospital trust structured data, and when should it ask a language model to read the messy clinical story around the data? ...

December 19, 2025 · 15 min · Zelina
Cover image

Picking Less to Know More: When RAG Stops Ranking and Starts Thinking

Search is not judgment Search is easy to admire because it produces something visible. A ranked list. A bigger context window. A satisfying pile of passages that says, “Look, we retrieved evidence.” Very comforting. Also not the same as knowing what evidence is actually needed. That distinction is the core of Context-Picker: Dynamic Context Selection Using Multi-stage Reinforcement Learning.1 The paper studies a familiar RAG problem: if a system retrieves too little, it misses the answer; if it retrieves too much, it drags in distractors, repeats, weakly related fragments, and the usual long-context swamp where useful evidence politely disappears in the middle. ...

December 17, 2025 · 14 min · Zelina
Cover image

Ports, But Make Them Agentic: When LLMs Start Running the Yard

Ports are already full of automation. Cranes move containers, AGVs follow routes, software coordinates flows, dashboards blink reassuringly at managers who are paid to pretend that blinking equals control. Then one terminal changes its layout, closes a road, adds a vehicle restriction, or introduces a new safety corridor. Suddenly the “automated” dispatching system needs engineers, operations researchers, domain experts, test scripts, model reformulation, solver debugging, and several meetings where everyone discovers that “just adjust the rule” was not, in fact, just. ...

December 17, 2025 · 16 min · Zelina