Reasoning Models

Answer, Then Audit: How 'ReSA' Turns Jailbreak Defense Into a Two‑Step Reasoning Game

TL;DR Reasoned Safety Alignment (ReSA) reframes safety from guarding inputs to auditing intended outputs. The model first drafts a concise intended answer summary in hidden reasoning, then runs a safety analysis on that summary before issuing the final reply. In evaluations across StrongREJECT, HarmBench, and AdvBench with multiple adaptive attacks (PAIR, PAP, GPTFuzzer, ReNeLLM, TAP, DeepInception), ReSA‑tuned models beat fine‑tuned and post‑hoc baselines while reducing over‑refusals and preserving reasoning performance. Notably, authors report competitive gains with only ~500 training samples, hinting that robust safety behaviors may be learned data‑efficiently. ...

Stop at 30k: How Hermes 4 Turns Long Chains of Thought into Shorter Time‑to‑Value

TL;DR Hermes 4 is an open‑weight “hybrid reasoner” that marries huge synthetic reasoning corpora with carefully engineered post‑training and evaluation. The headline for operators isn’t just benchmark wins—it’s control: control of format, schema, and especially when the model stops thinking. That last bit matters for latency, cost, and reliability. Why this matters for business readers If you’re piloting agentic or “think‑step” LLMs, two pains dominate: Unbounded reasoning length → blow‑ups in latency and context costs. Messy outputs → brittle downstream integrations. Hermes 4 addresses both with: (a) rejection‑sampled, verifier‑backed reasoning traces to raise answer quality, and (b) explicit output‑format and schema adherence training plus length‑control fine‑tuning to bound variance. That combo is exactly what production teams need. ...

Beyond the Pareto Frontier: Pricing LLM Mistakes in the Real World

For all the hype about model accuracy, inference cost, and latency, most organizations are still squinting at scatter plots to decide which large language model (LLM) to use. But what if we could cut through the tradeoff fog with a single number that tells you exactly which model is worth deploying—for your use case, under your constraints? That’s the bold proposal in a recent paper by Zellinger and Thomson from Caltech: treat LLM selection as an economic decision. Rather than searching for models on the accuracy-cost “Pareto frontier,” they suggest an approach grounded in price-tagging errors, delays, and abstentions in dollar terms. Think of it as a model selection framework that answers: How much is a mistake worth to you? ...

Cut the Fluff: Leaner AI Thinking

Cut the Fluff: Leaner AI Thinking When it comes to large language models (LLMs), brains aren’t the only thing growing—so are their waistlines. As AI systems become increasingly powerful in their ability to reason, a hidden cost emerges: token bloat, high latency, and ballooning energy consumption. One of the most well-known methods for boosting LLM intelligence is Chain-of-Thought (CoT) reasoning. CoT enables models to break down complex problems into a step-by-step sequence—much like how humans tackle math problems by writing out intermediate steps. This structured thinking approach, famously adopted by models like OpenAI’s o1 and DeepSeek-R1 (source), has proven to dramatically increase both performance and transparency. ...