Cover image

Textual Gradients and Workflow Evolution: How AdaptFlow Reinvents Meta-Learning for AI Agents

From Static Scripts to Living Workflows The AI agent world has a scaling problem: most automated workflow builders generate one static orchestration per domain. Great in benchmarks, brittle in the wild. AdaptFlow — a meta-learning framework from Microsoft and Peking University — proposes a fix: treat workflow design like model training, but swap numerical gradients for natural language feedback. This small shift has a big implication: instead of re-engineering from scratch for each use case, you start from a meta-learned workflow skeleton and adapt it on the fly for each subtask. ...

August 12, 2025 · 3 min · Zelina
Cover image

Search When It Hurts: How UR² Teaches Models to Retrieve Only When Needed

Most “smart” RAG stacks are actually compulsive googlers: they fetch first and think later. UR² (“Unified RAG and Reasoning”) flips that reflex. It trains a model to reason by default and retrieve only when necessary, using reinforcement learning (RL) to orchestrate the dance between internal knowledge and external evidence. Why this matters for builders: indiscriminate retrieval is the silent cost center of LLM systems—extra latency, bigger bills, brittle answers. UR² shows a way to make retrieval selective, structured, and rewarded, yielding better accuracy on exams (MMLU‑Pro, MedQA), real‑world QA (HotpotQA, Bamboogle, MuSiQue), and even math. ...

August 11, 2025 · 5 min · Zelina
Cover image

Mind Over Modules: How Smart Agents Learn What to See—and What to Be

In the race to build more autonomous, more intelligent AI agents, we’re entering an era where “strategy” isn’t just about picking the next move—it’s about choosing the right mind for the job and deciding which version of the world to trust. Two recent arXiv papers—one on state representation in dynamic routing games, the other on self-generating agentic systems with swarm intelligence—show just how deeply this matters in practice. We’re no longer only asking: What should the agent do? We now must ask: ...

June 19, 2025 · 5 min · Zelina