Cover image

When Markets Dream: The Rise of Agentic AI Traders

Opening — Why this matters now The line between algorithmic trading and artificial intelligence is dissolving. What once were rigid, rules-based systems executing trades on predefined indicators are now evolving into learning entities — autonomous agents capable of adapting, negotiating, and even competing in simulated markets. The research paper under review explores this frontier, where multi-agent reinforcement learning (MARL) meets financial markets — a domain notorious for non-stationarity, strategic interaction, and limited data transparency. ...

November 5, 2025 · 3 min · Zelina
Cover image

Agents with Interest: How Fintech Taught RAG to Read the Fine Print

Opening — Why this matters now The fintech industry is an alphabet soup of acronyms and compliance clauses. For a large language model (LLM), it’s a minefield of misunderstood abbreviations, half-specified processes, and siloed documentation that lives in SharePoint purgatory. Yet financial institutions are under pressure to make sense of their internal knowledge—securely, locally, and accurately. Retrieval-Augmented Generation (RAG), the method of grounding LLM outputs in retrieved context, has emerged as the go-to approach. But as Mastercard’s recent research shows, standard RAG pipelines choke on the reality of enterprise fintech: fragmented data, undefined acronyms, and role-based access control. The paper Retrieval-Augmented Generation for Fintech: Agentic Design and Evaluation proposes a modular, multi-agent redesign that turns RAG from a passive retriever into an active, reasoning system. ...

November 4, 2025 · 4 min · Zelina
Cover image

Smarter, Not Wiser: What Happens When AI Boosts Our Efficiency but Not Our Minds

Opening — Why this matters now In a world obsessed with productivity hacks and digital assistants, a new study offers a sobering reminder: being faster is not the same as being smarter. As tools like ChatGPT quietly integrate into workplaces and classrooms, the question isn’t whether they make us more efficient — they clearly do — but whether they actually reshape the human mind. Recent findings from the Universidad de Palermo suggest they don’t. ...

November 4, 2025 · 4 min · Zelina
Cover image

The Agent Olympics: How Toolathlon Tests the Limits of AI Workflows

Opening — Why this matters now The AI world is obsessed with benchmarks. From math reasoning to coding, each new test claims to measure progress. Yet, none truly capture what businesses need from an agent — a system that doesn’t just talk, but actually gets things done. Enter Toolathlon, the new “decathlon” for AI agents, designed to expose the difference between clever text generation and real operational competence. In a world where large language models (LLMs) are being marketed as digital employees, Toolathlon arrives as the first test that treats them like one. Can your AI check emails, update a Notion board, grade homework, and send follow-up messages — all without breaking the workflow? Spoiler: almost none can. ...

November 4, 2025 · 4 min · Zelina
Cover image

The Memory Illusion: Why AI Still Forgets Who It Is

Opening — Why this matters now Every AI company wants its assistant to feel personal. Yet every conversation starts from zero. Your favorite chatbot may recall facts, summarize documents, even mimic a tone — but beneath the fluent words, it suffers from a peculiar amnesia. It remembers nothing unless reminded, apologizes often, and contradicts itself with unsettling confidence. The question emerging from Stefano Natangelo’s “Narrative Continuity Test (NCT)” is both philosophical and practical: Can an AI remain the same someone across time? ...

November 3, 2025 · 4 min · Zelina
Cover image

Two Minds in One Machine: How Agentic AI Splits—and Reunites—the Field

Opening — Why this matters now Agentic AI is the latest obsession in artificial intelligence: systems that don’t just respond but decide. They plan, delegate, and act—sometimes without asking for permission. Yet as hype grows, confusion spreads. Many conflate these new multi-agent architectures with the old, symbolic dream of reasoning machines from the 1980s. The result? Conceptual chaos. A recent comprehensive survey—Agentic AI: A Comprehensive Survey of Architectures, Applications, and Future Directions—cuts through the noise. It argues that today’s agentic systems are not the heirs of symbolic AI but the offspring of neural, generative models. In other words: we’ve been speaking two dialects of intelligence without realizing it. ...

November 3, 2025 · 4 min · Zelina
Cover image

Who Really Runs the Workflow? Ranking Agent Influence in Multi-Agent AI Systems

Opening — Why this matters now Multi-agent systems — the so-called Agentic AI Workflows — are rapidly becoming the skeleton of enterprise-grade automation. They promise autonomy, composability, and scalability. But beneath this elegant choreography lies a governance nightmare: we often have no idea which agent is actually in charge. Imagine a digital factory of LLMs: one drafts code, another critiques it, a third summarizes results, and a fourth audits everything. When something goes wrong — toxic content, hallucinated outputs, or runaway costs — who do you blame? More importantly, which agent do you fix? ...

November 3, 2025 · 5 min · Zelina
Cover image

Bias on Demand: When Synthetic Data Exposes the Moral Logic of AI Fairness

Bias on Demand: When Synthetic Data Exposes the Moral Logic of AI Fairness In the field of machine learning, fairness is often treated as a technical constraint — a line of code to be added, a metric to be optimized. But behind every fairness metric lies a moral stance: what should be equalized, for whom, and at what cost? The paper “Bias on Demand: A Modelling Framework that Generates Synthetic Data with Bias” (Baumann et al., FAccT 2023) breaks this technical illusion by offering a framework that can manufacture bias in data — deliberately, transparently, and with philosophical intent. ...

November 2, 2025 · 4 min · Zelina
Cover image

From Prototype to Profit: How IBM's CUGA Redefines Enterprise Agents

When AI agents first emerged as academic curiosities, they promised a future of autonomous systems capable of navigating apps, websites, and APIs as deftly as humans. Yet most of these experiments never left the lab. The jump from benchmark to boardroom—the point where AI must meet service-level agreements, governance rules, and cost-performance constraints—remained elusive. IBM’s recent paper, From Benchmarks to Business Impact, finally brings data to that missing bridge. The Benchmark Trap Generalist agents such as AutoGen, LangGraph, and Operator have dazzled the research community with their ability to orchestrate tasks across multiple tools. But academic triumphs often hide operational fragility. Benchmarks like AppWorld or WebArena measure intelligence; enterprises measure ROI. They need systems that are reproducible, auditable, and policy-compliant—not just clever. ...

November 2, 2025 · 4 min · Zelina
Cover image

Recursive Minds: How ReCAP Turns LLMs into Self-Correcting Planners

In long-horizon reasoning, large language models still behave like short-term thinkers. They can plan, but only in a straight line. Once the context window overflows, earlier intentions vanish, and the model forgets why it started. The new framework ReCAP (Recursive Context-Aware Reasoning and Planning)—from Stanford’s Computer Science Department and MIT Media Lab—offers a radical solution: give LLMs a recursive memory of their own reasoning. The Problem: Context Drift and Hierarchical Amnesia Sequential prompting—used in CoT, ReAct, and Reflexion—forces models to reason step by step along a linear chain. But in complex, multi-stage tasks (say, cooking or coding), early goals slide out of the window. Once the model’s focus shifts to later steps, earlier plans are irretrievable. Hierarchical prompting tries to fix this by spawning subtasks, but it often fragments information across layers—each sub-agent loses sight of the global goal. ...

November 2, 2025 · 4 min · Zelina