MaskOpt or It Didn’t Happen: Teaching AI to See Chips Like Lithography Engineers

Opening — Why this matters now

AI has officially entered semiconductor manufacturing—again. And once again, the promise is speed: faster mask optimization, fewer simulations, lower cost. Yet beneath the marketing gloss lies an inconvenient truth. Most AI models for optical proximity correction (OPC) and inverse lithography technique (ILT) have been trained on data that barely resembles real chips. Synthetic layouts. Isolated tiles. Zero awareness of standard-cell hierarchy. No real notion of context.

MaskOpt enters this scene with a blunt message: if you want AI to behave like a lithography engineer, you must show it what engineers actually see.

Background — Context is not optional in lithography

Modern optical lithography operates well below the exposure wavelength. Diffraction, interference, and process variation are no longer edge cases; they are the regime. OPC and ILT exist precisely because what prints on silicon is determined as much by neighbors as by the target shape itself.

Traditional model-based OPC and ILT workflows account for this through repeated simulation across context windows—accurate, but computationally brutal. Deep learning promised a shortcut, but most existing datasets quietly amputated the hardest part of the problem: hierarchical layout structure and optical proximity context.

The result? Models that look impressive in benchmarks and fragile in production.

Analysis — What MaskOpt actually changes

MaskOpt is not a new neural architecture. It is something rarer and arguably more valuable: a dataset that forces models to confront reality.

1. Built from real designs, not design-rule fantasies

MaskOpt is constructed from five real IC designs at the 45nm node using the Nangate standard-cell library. Instead of rule-generated metal patterns, it clips layouts directly from placed-and-routed designs. This alone eliminates a major generalization gap present in prior benchmarks.

2. Cell-aware by construction

Each sample is clipped around a standard-cell placement, not an arbitrary grid. The same AND, NAND, AOI, or XOR cell appears repeatedly—exactly how hierarchical OPC exploits reuse in practice. Cell identity is preserved via an explicit cell tag, allowing models to learn reusable correction logic rather than memorizing geometry.

3. Context is a first-class variable

MaskOpt explicitly varies context window sizes (0–128 nm) around the 512 nm core region. This is not cosmetic. Optical proximity effects can extend hundreds of nanometers, especially for sparse via layers. MaskOpt lets models learn how much context is enough, instead of assuming one-size-fits-all.

4. Dual ground truth: OPC and ILT

Each layout tile is paired with two golden masks:

Model-based OPC masks
ILT masks generated via OpenILT

This makes MaskOpt suitable for both incremental correction tasks and full inverse mask synthesis.

Findings — What the benchmarks quietly reveal

MaskOpt evaluates several established models (GAN-OPC, DAMO, Neural-ILT, CFNO) under consistent conditions. The results are less about winners and more about trade-offs.

Context size matters—but differently

Layer	Best-performing context	Why
Metal	~32 nm	Dense patterns; too much context adds noise
Via	~128 nm	Sparse features; neighbors dominate print behavior

This alone undermines the common practice of training models on fixed-size tiles without physical justification.

Accuracy vs manufacturability is still a tension

Model	Strength	Weakness
DAMO	Lowest L2, EPE	Explodes shot count
OPC-GAN	Simple masks, low shot count	Loses fine detail
Neural-ILT	Balanced performance	No clear dominance

AI did not magically dissolve the physics–manufacturing trade-off. It merely shifted where the compromises surface.

Cell awareness is not optional

Ablation experiments removing the cell tag consistently degrade performance, especially for via layers and ILT tasks. In other words: without knowing which cell it is fixing, the model guesses—and silicon punishes guesses.

Implications — Why this dataset matters more than another model

MaskOpt’s real contribution is not higher benchmark scores. It is raising the bar for what “realistic” means in AI-for-manufacturing research.

For researchers:

Clever architectures cannot compensate for unrealistic data.
Context modeling must be physically motivated, not arbitrary.

For industry:

Dataset design is now a strategic decision, not a preprocessing detail.
Hierarchical reuse and context sensitivity are learnable—if you show them.

For the AI hardware ecosystem:

Expect a shift from synthetic benchmarks toward process-faithful datasets.
The bottleneck is no longer compute—it is representation.

Conclusion — AI doesn’t fail lithography; bad abstractions do

MaskOpt makes an uncomfortable point: many past failures of AI-based OPC were not algorithmic. They were epistemic. We trained models on worlds that do not exist.

By grounding learning in real layouts, real hierarchy, and real optical context, MaskOpt doesn’t just advance mask optimization. It quietly redefines what “learning the physics” actually means in semiconductor AI.

Cognaptus: Automate the Present, Incubate the Future.

Opening — Why this matters now#

Background — Context is not optional in lithography#

Analysis — What MaskOpt actually changes#

1. Built from real designs, not design-rule fantasies#

2. Cell-aware by construction#

3. Context is a first-class variable#

4. Dual ground truth: OPC and ILT#

Findings — What the benchmarks quietly reveal#

Context size matters—but differently#

Accuracy vs manufacturability is still a tension#

Cell awareness is not optional#

Implications — Why this dataset matters more than another model#

Conclusion — AI doesn’t fail lithography; bad abstractions do#