Model First, Think Later: Why LLMs Fail Before They Reason

The schedule looked reasonable. That was the problem.

Imagine asking an AI agent to build a weekly medical schedule. It produces a neat plan. The steps are numbered. The tone is confident. The explanation is calm enough to sedate a committee.

Then someone checks the details.

A medication interval is violated. A resource is assigned twice. A prerequisite appears after the action that depends on it. Nothing looks absurd sentence by sentence, but the plan is broken as a system.

This is the kind of failure that Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling wants us to take seriously.¹ The paper’s core argument is not that LLM agents need to “think harder.” That has become the industry’s favorite reflex, usually followed by longer chain-of-thought, more tool calls, more agent loops, and occasionally a dashboard because apparently every error becomes more mature once plotted.

The paper argues something more basic: many LLM-agent planning failures happen before reasoning begins. The agent is not necessarily applying bad inference to a good problem model. It may never have built a stable problem model in the first place.

That distinction matters. If the failure is inferential, the remedy is better reasoning. If the failure is representational, the remedy is better modeling. These are not the same engineering problem.

CoT improves the route, not the map

Chain-of-Thought prompting asks the model to explain its intermediate reasoning. ReAct extends this by interleaving reasoning with actions and observations. Both paradigms helped move LLMs from answer machines toward agent-like systems.

But neither forces the model to define the world it is acting in.

That is the paper’s useful correction. In CoT, entities, states, constraints, and transitions often appear as needed. The model may mention a constraint in step two, quietly forget it in step six, and still produce prose that sounds locally coherent. ReAct improves interaction, but the state still lives inside a free-form trace. Observations may be treated as if they were derived, when they were only assumed. Global constraints may remain scattered across the conversation rather than enforced as a stable structure.

The result is a familiar agent failure pattern:

Agent behavior	Surface appearance	Underlying representational problem
Produces fluent steps	Looks thoughtful	No explicit inventory of entities or states
Uses tools or observations	Looks grounded	Observations may not update a durable state model
Mentions constraints	Looks compliant	Constraints are not represented as invariants
Revises a plan midstream	Looks adaptive	Prior commitments can drift or vanish
Gives a final plan	Looks complete	Validity was never checked against a fixed model

The important phrase here is fixed model. The paper repeatedly contrasts LLM-agent prompting with classical planning traditions such as STRIPS and PDDL, where problem definition comes before problem solving. Classical planners are rigid and not fashionable at dinner parties, but they remember something modern agents often forget: planning requires a representation of objects, actions, preconditions, effects, and constraints.

LLMs are flexible. That flexibility is useful. It is also how a planning system can politely invent its own universe halfway through a task.

Model-First Reasoning makes the agent commit before it acts

Model-First Reasoning, or MFR, is the paper’s proposed remedy. The method is intentionally simple: separate model construction from reasoning.

Before producing a plan, the LLM must explicitly define:

relevant entities;
state variables;
possible actions, including preconditions and effects;
constraints that valid solutions must respect.

Only after this model is created does the agent generate the solution. The second phase is instructed to reason using only the model defined in the first phase.

This sounds modest, but the mechanism is sharper than it looks. MFR turns the model construction step into a representational commitment. The agent is no longer allowed to solve and define the task at the same time. It must first say what exists, what can change, what actions are legal, and what rules must not be broken.

That changes the failure mode. Without MFR, an error can remain hidden inside a plausible narrative. With MFR, many errors become mismatches between the plan and the declared model. The agent can still be wrong, of course. This is not a formal verifier. But wrongness becomes easier to inspect.

The paper calls this a form of “soft symbolic grounding.” That phrase is doing useful work. MFR does not force the LLM into a rigid symbolic planner. The model can be expressed in natural language, semi-structured text, or pseudo-formal notation. The point is not to resurrect old-school AI with a nicer font. The point is to borrow the stabilizing discipline of explicit representation without giving up the flexibility of language models.

The mechanism is the evidence filter

The paper’s experiments compare CoT, ReAct, and MFR across representative constraint-driven planning tasks, including multi-step medication scheduling, route planning with temporal dependencies, resource allocation with sequential constraints, logic puzzles, and procedural synthesis tasks.

The evidence should be read carefully. The paper does not present a large quantitative benchmark leaderboard. It emphasizes qualitative assessment over selected representative tasks. Outputs are manually checked along three dimensions: constraint satisfaction, implicit assumptions, and structural clarity.

That makes the results useful, but not in the way a casual reader may expect. This is not a “MFR beats ReAct by 17.3%” paper. It is closer to a mechanism demonstration: when the model is forced to externalize problem structure before planning, the resulting plans show fewer visible constraint violations, fewer implicit assumptions, and higher structural clarity.

The central comparison is:

Reasoning strategy	Constraint violations	Implicit assumptions	Structural clarity
Chain-of-Thought	Medium	Frequent	Low
ReAct	Medium–Low	Occasional	Medium
Model-First	Low	Rare	High

The table is qualitative, so it should not be inflated into a precision claim. Its value is diagnostic. CoT tends to produce fluent reasoning but can skip intermediate states, introduce unstated actions, and lose global consistency. ReAct improves local interaction but still depends on natural-language traces for state tracking. MFR improves structure because the agent has a stable reference object: the explicit model.

The paper’s Figure 1 is best understood as a mechanism illustration. It shows CoT moving from problem description to reasoning to answer; ReAct looping through reasoning, action, and observation; and MFR inserting an explicit problem model before reasoning. Figure 2 and Table 1 are the main comparative evidence. They support the paper’s claim that explicit modeling improves constraint adherence and interpretability in the tested settings.

One caution: the abstract says ablation studies confirm that separating modeling from reasoning is critical, but the accessible paper text primarily presents the qualitative comparison among prompting paradigms rather than a detailed separate ablation table. So the safe interpretation is this: the paper supports the separation principle through comparative qualitative evidence, but it does not give enough numeric ablation detail to support fine-grained claims about which exact subcomponent contributes how much.

That boundary matters. Good analysis is allowed to be impressed without becoming obedient.

Hallucination is not always false content; sometimes it is missing structure

The paper’s most useful move is conceptual. It broadens the meaning of hallucination in agentic contexts.

In ordinary chatbot use, hallucination often means a false statement: a fake citation, a wrong date, a confident biography of a person who never existed. In planning tasks, hallucination can be subtler. The agent may invent a permissible action. It may assume a resource is available. It may treat a constraint as optional because no explicit state representation forces the constraint to persist.

In other words, hallucination becomes a representational failure.

That framing is valuable because it explains why longer reasoning traces can fail. If the model is reasoning over an incomplete or unstable representation, more reasoning may simply produce a longer path through the wrong state space. More steps do not save a bad model. They just make the mistake better documented.

This is especially important for agent deployments. A customer-support chatbot can sometimes recover from a loose answer. A scheduling agent, compliance assistant, workflow planner, procurement bot, or operations coordinator cannot simply sound sensible. It must preserve state and constraints across steps.

The business problem is not “Can the agent explain itself?” The business problem is “Can the agent maintain a valid model of the task while acting?”

Those are annoyingly different questions. Naturally, procurement will discover this after approving the pilot.

Where MFR fits in an agent stack

MFR should not be treated as a replacement for CoT, ReAct, retrieval, tools, or verification. The paper positions it as a foundational layer that can combine with existing methods.

That is the right reading. In a production setting, MFR would sit near the beginning of the agent loop:

User task
  ↓
Model construction
  ↓
Model inspection or correction
  ↓
Planning over model
  ↓
Tool execution / ReAct loop
  ↓
State update against model
  ↓
Verification before output or action

The practical value is not that MFR makes the agent magically correct. It makes the agent’s assumptions visible early enough to inspect.

For business workflows, the model-construction phase can become an audit surface. A human reviewer, rule engine, or secondary model can check whether the agent identified the right entities, constraints, action types, and state variables before allowing planning to continue.

That suggests a simple operational rule:

For low-stakes tasks, ask the agent for an answer. For high-stakes tasks, ask the agent for a model first.

The difference is not cosmetic. A generated plan is hard to debug once it has already mixed task interpretation, action selection, and justification into one fluent soup. A separate model can be reviewed, reused, corrected, and logged.

The ROI is cheaper diagnosis, not just better answers

The obvious business interpretation is that MFR may reduce agent errors in constraint-heavy tasks. True, but slightly incomplete.

The deeper ROI is diagnostic efficiency.

When an agent fails under ordinary CoT or ReAct prompting, teams often inspect the final answer and the trace, then try to infer where the task went wrong. Was the instruction ambiguous? Did retrieval miss a fact? Did the agent misunderstand the constraint? Did a tool output fail to update state? Did the model invent a step because the prompt was too loose?

This is slow and unpleasant, which is why teams often solve it with optimism. A bold choice.

MFR narrows the debugging surface. If the explicit model is wrong, the failure is in task representation. If the model is right but the plan violates it, the failure is in reasoning or execution. If both are right but the real-world outcome fails, the issue may lie in tool integration, data freshness, or external uncertainty.

That separation creates a cleaner incident analysis workflow:

Failure location	What to inspect	Likely fix
Model construction	Missing entity, state variable, action, or constraint	Improve model prompt, add schema, add domain rules
Reasoning over model	Plan violates declared precondition or constraint	Add plan verifier, stricter reasoning prompt, constrained decoding
Tool execution	Tool output does not update state correctly	Improve tool wrapper, state persistence, observation parsing
Final validation	Output is plausible but unsafe	Add policy checks, human review, or domain-specific verifier

This is where MFR becomes more than a prompt trick. It becomes an architectural pattern: separate representation from reasoning so errors can be assigned to the right layer.

What the paper directly shows, and what business teams can infer

The paper directly shows a qualitative comparison: across selected constraint-heavy planning examples, MFR produces outputs rated as lower in constraint violations, lower in implicit assumptions, and higher in structural clarity than CoT and ReAct. It also gives a mechanism: explicit model construction stabilizes entities, states, actions, and constraints before planning.

Cognaptus’ business inference is narrower than the marketing brochure version:

Paper result	Business interpretation	Boundary
MFR improves structural clarity	Agent plans become easier to inspect and debug	Clarity does not equal correctness
MFR reduces implicit assumptions	Better fit for regulated or procedural workflows	Only if the model construction step captures the right constraints
MFR lowers qualitative constraint violations	Useful for scheduling, allocation, routing, compliance, and operations tasks	Evidence is qualitative and selected, not exhaustive benchmarking
MFR needs no architectural changes	Can be added to existing LLM workflows through prompting	Token cost and latency increase
MFR externalizes the problem model	Creates an audit checkpoint before action	Still needs human or automated verification in high-risk domains

This is the correct level of enthusiasm: useful, implementable, and not a miracle wearing a lab coat.

The biggest risk is a beautifully wrong model

MFR moves the problem upstream. That is helpful, but it does not eliminate the problem.

If the model-construction phase misses a constraint, defines the wrong state variables, or misunderstands an action’s effect, the reasoning phase can become more confidently wrong. This is the classic danger of structure: once a bad structure is accepted, it becomes a machine for producing disciplined errors.

The paper acknowledges this model dependence. It also notes token overhead: explicit model construction increases prompt and output length. For occasional high-value planning tasks, that cost may be trivial. For high-volume low-margin workflows, it matters.

There is also a scope boundary. MFR is most relevant to tasks with explicit structure: planning, scheduling, resource allocation, procedural execution, compliance logic, and safety-sensitive multi-step workflows. It is less obviously valuable for open-ended ideation, stylistic writing, or tasks where the problem representation is inherently fluid.

Finally, MFR is not verification. It makes constraints visible. It does not prove that all constraints are complete, correct, or satisfied. A serious deployment would still need validators, logs, human escalation thresholds, and domain-specific guardrails.

The paper’s lesson is not “trust the agent after it models.” The lesson is “do not let the agent act before it models.”

Subtle difference. Expensive difference.

A better agent does not start by thinking

The common agent-building instinct is to add more reasoning: longer traces, stronger models, more tool calls, more elaborate loops. Sometimes that helps. But this paper points to a prior question: what exactly is the agent reasoning over?

Model-First Reasoning is valuable because it restores an old discipline to a new architecture. Before planning, define the world. Before action, define legal action. Before optimization, define constraints. Before trusting the answer, inspect the model that made the answer possible.

For business users, this means the next generation of reliable agents may not be distinguished by how dramatically they “think.” They may be distinguished by whether they can expose a stable, reviewable model of the task before they begin.

That is less glamorous than autonomous intelligence. It is also much closer to how real operations work.

The agent should model first, think later. Preferably before it reschedules the pharmacy.

Cognaptus: Automate the Present, Incubate the Future.

Gaurav Kumar and Annu Rana, “Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling,” arXiv:2512.14474, 2025. https://arxiv.org/abs/2512.14474 ↩︎

The schedule looked reasonable. That was the problem.#

CoT improves the route, not the map#

Model-First Reasoning makes the agent commit before it acts#

The mechanism is the evidence filter#

Hallucination is not always false content; sometimes it is missing structure#

Where MFR fits in an agent stack#

The ROI is cheaper diagnosis, not just better answers#

What the paper directly shows, and what business teams can infer#

The biggest risk is a beautifully wrong model#

A better agent does not start by thinking#