From Black-Box to Boarding Gate: When LLMs Finally Learn to Show Their Work
A mechanism-first reading of how ontology-scaffolded LLM extraction can turn airport operating manuals into traceable knowledge graphs and process maps.
A mechanism-first reading of how ontology-scaffolded LLM extraction can turn airport operating manuals into traceable knowledge graphs and process maps.
AutoB2G shows how LLM agents can turn building–grid simulation from a manual engineering workflow into a structured, executable, and repairable automation pipeline.
A mechanism-first reading of GUIDE, a training-free framework that turns tutorial videos into task-specific planning and grounding knowledge for GUI agents.
A mechanism-first reading of BeSafe-Bench and what it reveals about unsafe success in agentic AI systems.
A mechanism-first reading of AIRA2: why scalable AI research agents need shared evolutionary memory, protected evaluation, and interactive operators—not just bigger models and more GPUs.
A mechanism-first reading of PAPO, showing why separating correctness rewards from process rubrics can keep reasoning-model RL useful without paying models to perform for the judge.
A mechanism-first reading of HIVE, a prompt-selection method that cuts waste in RL training by finding the moving learning edge before expensive rollouts begin.
A mechanism-first reading of Vision Hopfield Memory Networks and what memory-centric vision backbones may mean for data-efficient, auditable AI systems.
A mechanism-first reading of Photon, a 3D medical multimodal model that makes CT-volume reasoning cheaper by pruning visual tokens according to the question being asked.
A mechanism-first reading of PIDP-Attack, showing why RAG risk emerges from the interaction between query rewriting, poisoned retrieval, and obedient generation.