Packing a Punch: How Model‑Based AI Outperformed Decades of Sphere‑Packing Theory

Opening — Why this matters now

AI’s recent victories in mathematics—AlphaGeometry, DeepSeek‑Prover, AlphaEvolve—have leaned on a familiar formula: overwhelming compute, evolutionary thrashing, and enough sampling to make Monte Carlo blush. Effective, yes. Elegant? Hardly.

Sphere packing, however, does not care for this style of progress. Each evaluation in the three‑point SDP framework can require days, not milliseconds. There is no room for “just try another million candidates.” Any system operating here must think, not flail.

The paper at hand—Model‑Based and Sample‑Efficient AI‑Assisted Math Discovery in Sphere Packing fileciteturn0file0—proposes precisely that shift. Instead of brute force, it introduces a disciplined, model‑based optimisation procedure that behaves more like a mathematician under time pressure: it chooses carefully, evaluates sparingly, and squeezes insight out of every datapoint.

And the result? New state‑of‑the‑art upper bounds across 12 dimensions (4–7, 9–16). In an area notorious for glacial progress, this is a jolt.

Background — A century of constraints

Sphere packing asks a question as old as Euclid: how densely can we fill $\mathbb{R}^n$ with non‑overlapping spheres of equal radius? Optimal solutions are known only for $n = 2, 3, 8, 24$. Everything else is a battleground of upper and lower bounds.

The three‑point method by Cohn, de Laat, and Salmon turns this problem into a colossal semidefinite program (SDP). Each candidate SDP, defined by geometric parameters $(r, R)$ and polynomial building blocks $(f_1, f_2)$, yields an upper bound on density.

But the search space is malicious:

Continuous variables: $(r, R)$
Discrete symbolic objects: polynomial monomials constructed from seven base polynomials
Strict analytic constraints: positivity, symmetry, Fourier‑space behaviour

Brute force collapses under the weight of the SDPs themselves. Every failed guess costs days.

Analysis — What the paper actually does

The authors reframe the entire endeavour as a sequential decision process: the SDP Game.

Stage 1 — Choose geometric parameters $(r, R)$
- The agent uses Bayesian Optimisation (BO) to model the reward landscape (i.e., SDP objective values). BO’s uncertainty quantification shines here: each evaluation is expensive, so the agent must minimise regret.
Stage 2 — Construct the polynomial witnesses $(f_1, f_2)$
- The agent uses Monte Carlo Tree Search (MCTS) over a constrained vocabulary of polynomial tokens.
- Each “sentence” corresponds to a mathematically valid pair of functions.
- The grammar guarantees admissibility (continuity, sign patterns, PSD kernels, etc.).
Stage 3 — Solve the induced SDP
- The environment returns the upper bound.
Stage 4 — Iterate
- The surrogate models improve.
- The exploration routines sharpen.

This is not a model-free evolutionary brawl. It’s methodological, hierarchical, and almost annoyingly efficient.

Why this is novel

The agent discovers 80–85% new monomial structures across all dimensions.
It independently recovers human‑designed structures, confirming correctness.
It explores the $(r, R)$ landscape beyond the classical $r=1$ regime and finds previously unstudied high‑performing regions.
It even approximates the special sign‑structure behaviour of Viazovska’s magic function in $n = 8$.

In short: the agent isn’t guessing; it’s learning the geometry.

Findings — What progress looks like in numbers

Below is a condensed version of the key table (all values from Table 1 in the paper):

New Upper Bounds Across 12 Dimensions

Dim	LP Bound	Three‑Point (Cohn et al.)	AI‑Based Bound	% New Monomials
4	0.64771	0.63610733	0.63610277	83.3%
5	0.52498	0.51264513	0.51226450	85.1%
6	0.41767	0.41030328	0.41029050	84.7%
7	0.32746	0.32114711	0.32111475	85.1%
9	0.19456	0.19112042	0.19099063	83.7%
10	0.14759	0.14341009	0.14340271	83.3%
11	0.11169	0.10672529	0.10672201	80.8%
12	0.08378	0.07971177	0.07970660	80.0%
13	0.06248	0.06016444	0.06012103	84.4%
14	0.04637	0.04506122	0.04501031	82.5%
15	0.03425	0.03375644	0.03372585	81.6%
16	0.02520	0.02499441	0.02492121	82.9%

Structural Insights

Two visual findings stand out:

Low‑degree monomials dominate the agent’s choices (Figure 4 of the paper). Lower degree = more expressive SDP blocks.
Exploration of $(r,R)$ escapes classical mathematical intuition (Figure 5). BO consistently finds pockets of tighter bounds outside $r=1$.

This is the kind of structural discovery mathematicians care about—not just the numbers but the shape of the space where good ideas live.

Case Study — $n = 8$

The agent approximates—without being told—the key features of Viazovska’s magic function:

Non‑positivity beyond $\sqrt{2}$
Non‑negative Fourier transform
Approximate root placement
Approaching the optimal density numerically

This is not a replication of the modular‑form breakthrough, but it is a remarkable act of convergent reasoning.

Implications — Why this matters beyond pure math

Three messages for industry and policy:

1. Model‑based search will become essential wherever evaluation is expensive

This includes:

Drug design with costly wet‑lab trials
Nuclear materials simulation
Autonomous trading systems under strong latency penalties
Regulatory simulations for systemic risk

Brute‑force AI will not scale into these domains; disciplined Bayesian agents will.

2. Symbolic‑numeric hybrids represent a new frontier in AI assurance

This system constructs proofs—not just predictions. It outputs certificates: SDPs whose feasibility can be independently verified. For AI governance, this is a dream scenario.

3. The approach hints at future AI assistance in theoretical research

Mathematicians may soon use agents to:

Explore constrained functional spaces
Propose admissible witness functions
Suggest promising regimes for analytical attacks

Not replacing human insight, but accelerating its scaffolding.

Conclusion — A shift from power to precision

This work signals a turn away from the “scale solves everything” era and toward a more ascetic AI philosophy: fewer evaluations, more structure, stronger inductive bias.

In a field as stiff and unforgiving as sphere packing, this approach didn’t just survive—it produced new world‑class results. If model‑based, sample‑efficient AI can advance a problem that resisted mathematicians for centuries, its implications for scientific computing, optimisation, and governance are profound.

Cognaptus: Automate the Present, Incubate the Future.

Opening — Why this matters now#

Background — A century of constraints#

Analysis — What the paper actually does#

Why this is novel#

Findings — What progress looks like in numbers#

New Upper Bounds Across 12 Dimensions#

Structural Insights#

Case Study — $n = 8$#

Implications — Why this matters beyond pure math#

1. Model‑based search will become essential wherever evaluation is expensive#

2. Symbolic‑numeric hybrids represent a new frontier in AI assurance#

3. The approach hints at future AI assistance in theoretical research#

Conclusion — A shift from power to precision#