When Coders Prove Theorems: Agents, Lean, and the Quiet Death of the Specialist Prover
A mechanism-first reading of Numina-Lean-Agent, showing why the real lesson is not a perfect Putnam score but a verifiable agent loop for high-stakes reasoning.
A mechanism-first reading of Numina-Lean-Agent, showing why the real lesson is not a perfect Putnam score but a verifiable agent loop for high-stakes reasoning.
A mechanism-first reading of ARK, a training-free knowledge-graph retriever that lets LLMs control when to search broadly, when to traverse locally, and when to stop.
A mechanism-first reading of how AI infrastructure enters GDP: through capex, imports, data-center services, and accounting channels—not instant productivity magic.
A mechanism-first reading of high-dimensional clustering: why better representations can still produce worse clusters when abstraction is pushed too far.
A mechanism-first reading of Deep GraphRAG, showing why hierarchical retrieval and adaptive reward balancing matter more than another benchmark table.
CRANE shows why multimodal recommendation needs recursive alignment, symmetric user-item semantics, and graph structure—not just more images and text poured into the same old model.
A mechanism-first reading of FAQ, a data-centric post-training quantization method that uses larger in-family models to regenerate calibration data and reduce quantization damage.
A mechanism-first reading of SD-RAG and what it teaches businesses about building privacy-aware RAG systems that do not rely on the answering model to protect secrets it has already seen.
A mechanism-first reading of why machine learning does not remove human control from science, but quietly redistributes it across goals, metrics, and methodological tradeoffs.
XChoice shows why AI–human alignment in constrained decisions should be audited through hidden trade-off mechanisms, not just plausible-looking outputs.