Targeted Forgetting: Why AI Can’t Just ‘Unlearn’ — And What TRU Fixes

Delete is a comforting word.

A user deletes an account. A marketplace removes a product. A shopper corrects a preference history because the recommendation engine has decided, with touching confidence, that one accidental click reveals a permanent love of baby strollers, golf gloves, or suspiciously ugly jackets.

In a normal database, deletion sounds like a row-level operation. Remove the row, update the index, move on with life. In a trained recommender model, deletion is less tidy. The deleted data may already have shaped user embeddings, item popularity, image-text fusion layers, and ranking behavior. The row is gone, but its ghost may still be politely recommending itself.

That is the problem behind TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning, an arXiv paper on unlearning in multimodal recommendation systems.¹ The paper’s central claim is not simply that AI systems need to forget. We knew that. The sharper claim is that multimodal recommenders do not forget uniformly, so a uniform reverse update is the wrong tool for the job.

This matters because many modern recommenders are no longer built only from user-item clicks. They mix collaborative graphs with item-side content: image features, text features, ID embeddings, and graph propagation. That richer structure improves personalization, but it also gives deleted information more places to hide. Very convenient for recommendation accuracy. Less convenient for privacy engineering. Life is balanced like that.

The paper proposes Targeted Reverse Update, or TRU, as a plug-and-play approximate unlearning framework. Instead of retraining the full recommender from scratch after every deletion request, TRU tries to push the existing model toward the behavior of a retrained model by applying reverse updates only where deleted-data influence is likely to remain: ranking outputs, modality branches, and sensitive layers.

The important business lesson is not “TRU is cheaper than retraining,” although that is part of the appeal. The more useful lesson is this:

Unlearning is not one operation. It is a diagnosis problem.

You first need to know where the deleted influence survives. Then you decide how aggressively to remove it without damaging everything the model still needs to know.

The mistaken mental model: deletion as stronger reverse training

A common intuition is that machine unlearning can be handled by taking the original training logic and running it backward on the data to forget. If the model learned from a user’s interactions, apply gradient ascent on those interactions. If forgetting is incomplete, turn the knob harder. Add a repair step on retained data. Congratulations, you have a privacy strategy, or at least something that looks very employable in a slide deck.

The TRU paper argues that this intuition breaks down in multimodal recommendation systems.

The reason is architectural heterogeneity. A multimodal recommender does not store influence in one neat parameter bucket. It usually contains several coupled structures:

Where influence lives	Why simple deletion fails	TRU’s response
Ranking exposure	Target items may remain popular because other retained users still interact with them	Suppress residual target-item visibility through a ranking/fusion gate
Modality branches	ID, image, and text branches respond differently to the same reverse signal	Scale reverse gradients branch by branch
Network layers	Deleted-data sensitivity concentrates in particular modules rather than the whole model	Apply reverse updates only to selected sensitive layers

This is why “just reverse harder” is too crude. A global reverse update can under-remove traces in one part of the model while over-damaging another. The paper’s framing is useful because it moves unlearning away from a moral slogan — respect deletion rights — and toward an engineering question: which part of the model still carries the deleted influence, and how much intervention can that part tolerate?

Mechanism 1: target items survive because popularity is shared

The first failure mode is target-item persistence.

In user-level deletion, the user’s interactions are removed from the training data. But the items connected to that user are often still connected to many other retained users. If a deleted user interacted with a popular product, removing that user’s edge does not make the product disappear from the collaborative graph. The item still has neighbors. It still has momentum. It may still appear in recommendations.

The paper illustrates this with Amazon-Clothing. Even after full retraining, target items still appear in Top-20 rankings; the authors report only marginal exposure reductions of 12.4% for MGCN and 18.3% for MIG-GT. The explanation is not mysterious: for most items, deleted interactions account for less than 10% of total engagement, leaving the retained graph to keep those items alive.

This is an uncomfortable result because full retraining is often treated as the gold standard for unlearning. But in recommendation, the deleted user’s influence and the item’s legitimate retained popularity are entangled. A product can be both part of a deleted user’s history and genuinely relevant to retained users. The model cannot simply erase the item from reality, tempting as that might be for compliance teams during audit season.

TRU responds by adding a ranking/fusion gate. The point is not to delete every target item from every recommendation list. The point is to suppress the residual visibility that is specifically carried through the deleted relation. In practical terms, TRU treats output ranking exposure as part of the unlearning surface, not merely a downstream metric.

That distinction matters. A deletion request is not satisfied just because training rows were removed. If the trained system still ranks target-linked items in a way that reflects deleted behavior, the operational risk remains visible at the product layer.

Mechanism 2: image, text, and ID features do not absorb forgetting equally

The second failure mode is modality imbalance.

Multimodal recommenders combine different item-side signals. ID embeddings capture collaborative identity. Image branches capture visual similarity. Text branches capture descriptions, categories, or semantic information. These branches are not interchangeable. They encode different structures and learn at different speeds.

The paper uses centered kernel alignment analysis to show weak alignment across modality representations in MGCN and MIG-GT. In plainer language: the ID, image, and text branches are not all looking at the same world through slightly different glasses. They are often building rather different internal maps.

This creates a problem for uniform unlearning. The same reverse update may be too strong for one branch and too weak for another. A fragile branch can be over-corrected, damaging retained representations. A dominant branch can remain under-corrected, preserving deleted traces. The result is the worst kind of compromise: not private enough, not accurate enough, and still somehow computationally expensive.

TRU’s branch-wise modality scaling is meant to address this. It estimates retain-side and forget-side gradient energies for each branch, then adjusts the reverse update by modality. If a branch already appears to have forgotten sufficiently, the method shifts emphasis toward preserving retained utility. If a branch still carries forget-side influence, it receives a stronger correction.

The business interpretation is simple but important: unlearning in multimodal systems should be instrumented by representation channel. A platform that treats text, image, audio, graph, and ID signals as one monolithic feature soup will struggle to explain where forgetting actually happened. That is fine until an auditor, regulator, or internal risk team asks a very old-fashioned question: “Show me.”

Mechanism 3: sensitive layers, not the whole model, need the knife

The third failure mode is layer-wise sensitivity.

A model’s parameters do not all respond equally to deletion. Some modules carry more deletion-sensitive information than others. The paper argues that conventional uniform reverse updates can over-shift early embedding modules relative to an exact retraining baseline. That is a serious warning sign: the method is not merely removing deleted influence; it is pushing parts of the model into parameter movement that retraining itself would not have produced.

TRU addresses this through capacity-aware layer selection. It estimates module sensitivity on the forget objective, selects the most deletion-sensitive modules, and expands the selected subset until it covers a minimum parameter-capacity threshold. The design tries to avoid two opposite errors: updating almost everything, which becomes a blunt global perturbation, or updating too little, which lacks enough capacity to actually forget.

This is the most operationally interesting part of the paper. It suggests that unlearning systems should produce something like a deletion heat map: which modules moved, why they moved, and how that movement compares with a retraining reference or audit baseline.

That is not just a technical nicety. In production governance, the question is rarely “Did you run the unlearning job?” The better question is:

Did the unlearning job touch the right representations, and did it avoid damaging retained behavior unnecessarily?

TRU’s layer selection gives a concrete way to ask that question.

What TRU actually does during unlearning

TRU follows the reverse-repair protocol used in prior approximate unlearning, but changes the reverse step.

At a high level, each unlearning epoch does three things:

Estimate modality-specific scaling factors from retain and forget gradient energies.
Estimate module sensitivity and build a mask for sensitive layers.
Apply a gated, scaled, and masked reverse update on the forget set, followed by the usual repair update on retained data.

The paper emphasizes that TRU is plug-and-play: it does not require redesigning the recommender backbone, changing the retained learning objective, or altering forward inference. That is important because enterprise recommendation systems do not enjoy being rebuilt every time a privacy paper gets a good result. They are, annoyingly, attached to revenue.

A compact way to understand TRU is this:

TRU component	Technical role	Operational consequence
Ranking gate	Penalizes residual target-item ranking influence at fusion-related parameters	Reduces visible recommendation traces tied to deletion requests
Branch-wise scaling	Adjusts reverse-gradient strength across ID, image, and text branches	Avoids over-correcting fragile modalities and under-correcting dominant ones
Layer selection	Restricts reverse updates to deletion-sensitive modules	Reduces collateral damage and improves efficiency versus blind updates

The framework is less about making unlearning magical and more about making it less stupid. A modest but valuable ambition.

The evidence: TRU improves the retain-forget trade-off, especially when baselines collapse

The experiments use two multimodal graph-based backbones: MGCN, which fuses ID-, image-, and text-side signals on the user-item graph, and MIG-GT, which decouples per-modality receptive fields and adds global transformer context. The datasets are three Amazon categories: Baby, Sports, and Clothing. The paper evaluates three deletion regimes: user-level, item-level, and interaction-level forgetting.

The baselines include Original, Retrain, UltraRE, MultiDelete, ScaleGUN, and MMRecUn. This matters because they represent different families: exact retraining, partition/retraining-style approaches, multimodal unlearning, graph unlearning, and the most direct prior method for multimodal recommendation unlearning.

The main table reports Recall@20 and NDCG@20 on retain and forget sets. The desired pattern is high retain-side performance and low forget-side performance, ideally approaching the exact retraining frontier.

The headline result is that TRU usually keeps retain-side accuracy closer to the original or retrained model while reducing forget-side exposure more effectively than prior approximate baselines.

A few examples show the shape of the result:

Setting	Baseline issue	TRU result to notice
MGCN / Baby	MMRecUn reduces forget metrics but loses retain utility	TRU preserves Retain Recall@20 around the original level while keeping forget metrics rounded to zero
MIG-GT / Baby	MMRecUn leaves non-trivial forget traces and weakens retain utility	TRU improves Retain Recall@20 to 0.1008 under user-level deletion while reducing Forget Recall@20 to 0.0132
MIG-GT / Clothing	MMRecUn leaves large user-level forget recall at 0.5234	TRU reduces it to 0.0022 while keeping retain utility close to original
MIG-GT / Sports	MMRecUn leaves interaction-level Forget Recall@20 at 0.1155	TRU reduces it to 0.0042

The deeper point is not that every number is perfect. It is that TRU’s gains are often strongest where uniform approaches expose their weakness: highly decoupled multimodal backbones and settings where forgetting one side without damaging the other becomes difficult.

The paper’s challenging-case analysis makes this clearer. On MIG-GT with Sports under user-level deletion, several baselines polarize. Some forget aggressively but damage retain utility; others preserve utility but fail to forget. TRU reports Retain Recall@20 of 0.1036 and Forget Recall@20 of 0.0051 in the main table, while the illustrative case table reports strong suppression of forget exposure with competitive retain performance. The exact formatting in the HTML version is messy, but the intended comparison is clear: TRU sits closer to the desirable middle ground rather than choosing one failure mode with confidence.

The security audits support the direction, not a legal guarantee

The paper also evaluates deeper erasure using membership inference attack balanced accuracy and backdoor attack success rate. This is useful because ranking metrics alone can be deceptive. A model may stop recommending a deleted item while still retaining structural information that an attack or audit can reveal.

The security results are best read carefully.

On MIG-GT with Sports, TRU reports MIA BalAcc of 0.5253, better than MMRecUn’s 0.5626 and closer to the random-guessing target of 0.5. It also ties for the lowest reported ASR in that setting. This supports the claim that TRU can reduce deeper residual traces, not merely clean up surface rankings.

On MGCN with Clothing, however, the story is less triumphant. TRU is the strongest approximate baseline in relative terms, but the MIA BalAcc remains high at 0.9388, far from the retraining reference of 0.5156. That is not a footnote to bury politely under “future work.” It is a boundary condition. TRU improves the situation, but some settings remain hard enough that approximate unlearning should not be sold as complete privacy removal.

This distinction is exactly where business readers need discipline. The paper supports better practical unlearning trade-offs. It does not prove that an approximate update automatically satisfies every legal, regulatory, or contractual interpretation of data erasure.

The ablation study is not decoration; it explains why the three mechanisms belong together

The ablation study removes TRU’s components one by one on the Baby dataset under user-level deletion. Its purpose is not to add another victory lap. It tests whether TRU’s three mechanisms are genuinely complementary.

The results are especially revealing for MGCN. The full model drives forget-side Recall@20 to zero. Removing the ranking gate causes Forget Recall@20 to jump to 0.0746. Removing modality scaling or layer selection gives smaller but still visible degradation. For MIG-GT, removing each component also weakens the forget-side profile, though the magnitudes differ.

The interpretation is straightforward:

Test	Likely purpose	What it supports	What it does not prove
Remove ranking gate	Component ablation	Ranking exposure needs direct suppression, especially for collaborative popularity inertia	That all target-item exposure is illegitimate
Remove modality scaling	Component ablation	Branches need different reverse-update strengths	That the proposed scaling rule is optimal for all modalities
Remove layer selection	Component ablation	Sensitive-module targeting reduces deeper residual traces	That selected layers are stable under all future model updates
Hyperparameter sensitivity	Robustness/sensitivity test	TRU appears to work across broad operating regions rather than one brittle setting	That production tuning will be easy under live drift
MMRecUn comparison appendix	Comparison with prior work	TRU improves over the most directly relevant prior MRS unlearning baseline	That it dominates every possible future approximate method

This is where the mechanism-first reading pays off. If you only read the paper as “TRU beats baselines,” the ablation table is just another table. If you read it as a diagnosis of non-uniform influence, the table becomes evidence that each diagnosis corresponds to a different intervention.

What businesses should take from TRU

For product and data teams, TRU points toward a more realistic architecture for deletion handling in AI-powered recommendation systems.

First, deletion should be audited at the ranking layer, not only the data layer. A platform should be able to ask whether deleted interactions still affect exposure, especially when target items remain popular through retained users.

Second, multimodal systems need channel-level unlearning diagnostics. Image, text, ID, audio, behavioral graph, and other signals may not forget at the same rate. A single model-level metric hides too much.

Third, unlearning should be logged as a targeted intervention. Which modules were selected? Which branches were scaled? Which outputs were suppressed? How did retain and forget metrics move? This is boring infrastructure. Boring infrastructure is often what separates a privacy program from a press release.

The ROI argument is also more subtle than “avoid retraining cost.” Full retraining is expensive, but the operational value of targeted unlearning is broader:

Practical need	Why TRU is relevant	Remaining uncertainty
Account deletion	User-level forgetting is explicitly tested	Real users may trigger repeated, sequential deletions over time
Product takedown	Item-level forgetting is included	Marketplaces may face richer metadata, seller networks, and policy constraints
Preference correction	Interaction-level forgetting is tested	User intent may be ambiguous; not every correction is a privacy deletion
Compliance audit	MIA and backdoor audits go beyond ranking metrics	Legal sufficiency requires jurisdiction-specific standards and evidence
Cost control	Approximate updates avoid full retraining for each request	Live systems need drift handling, monitoring, rollback, and human review

Cognaptus’ inference is that TRU-like methods are best understood as part of a broader unlearning operations layer: a workflow that receives deletion requests, maps them to model components, applies targeted updates, records evidence, and monitors retained utility afterward.

That is not glamorous. It is also much closer to what businesses actually need.

Where the paper’s evidence stops

The paper’s evidence is meaningful, but bounded.

It uses three Amazon categories, two graph-based multimodal recommender backbones, and benchmarked user-, item-, and interaction-level deletion regimes. This is a strong controlled setup, not a production deployment. Real systems may include more modalities, retrieval-ranking cascades, real-time feature stores, reinforcement loops, advertising constraints, cold-start logic, and policy filters. Each of those can create additional places where deleted influence leaks or reappears.

The experiments are also mostly single-shot deletion evaluations. Production systems face streams of deletion requests, periodic model refreshes, feature drift, changing catalogs, and model versioning. A method that works well for one carefully constructed forget set may need additional safeguards for repeated or overlapping requests.

Finally, “closer to retraining” is not the same as “legally erased.” Exact retraining itself may still preserve item exposure when retained users legitimately support the same item. Security audits can reveal residual traces even when ranking metrics look clean. So the correct reading is not that TRU solves compliance. It gives engineers a sharper tool for one part of the compliance problem.

The paper’s own future direction — auditable and continuously deployable unlearning under evolving data and richer modalities — is exactly the right place to push next.

The real contribution: unlearning becomes targeted maintenance

The most useful idea in TRU is not any single component. It is the shift from uniform reversal to targeted maintenance.

A recommender system is not a whiteboard where deleted data can be rubbed out with one dramatic gesture. It is a network of popularity effects, modality-specific representations, and sensitive parameter regions. Some parts need suppression. Some need protection. Some need to be left alone because they belong to retained users and legitimate behavior.

That is why TRU is a good paper for business readers. It does not invite the fantasy that AI systems can simply forget because a lawyer asked nicely. It shows a more practical path: diagnose where influence remains, intervene only where necessary, and measure both forgetting and collateral damage.

In the coming years, the companies that handle AI deletion well will not be the ones with the most cheerful privacy page. They will be the ones that can explain, with evidence, how deletion requests move through the model stack.

TRU is not the final answer. But it is a useful reminder that responsible AI often looks less like ethics theater and more like careful maintenance of messy machinery.

Cognaptus: Automate the Present, Incubate the Future.

Zhanting Zhou, KaHou Tam, Ziqiang Zheng, Zeyu Ma, and Yang Yang, “TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning,” arXiv:2604.02183v2, 10 April 2026, https://arxiv.org/abs/2604.02183. ↩︎

The mistaken mental model: deletion as stronger reverse training#

Mechanism 1: target items survive because popularity is shared#

Mechanism 2: image, text, and ID features do not absorb forgetting equally#

Mechanism 3: sensitive layers, not the whole model, need the knife#

What TRU actually does during unlearning#

The evidence: TRU improves the retain-forget trade-off, especially when baselines collapse#

The security audits support the direction, not a legal guarantee#

The ablation study is not decoration; it explains why the three mechanisms belong together#

What businesses should take from TRU#

Where the paper’s evidence stops#

The real contribution: unlearning becomes targeted maintenance#