Causality Remembers: Teaching Social Media Defenses to Learn from the Past

Opening — Why this matters now

Social media coordination detection is stuck in an awkward adolescence. Platforms know coordinated inauthentic behavior exists, regulators know it scales faster than moderation teams, and researchers know correlation-heavy detectors are brittle. Yet most deployed systems still behave as if yesterday’s parameters will work tomorrow.

This paper introduces Adaptive Causal Coordination Detection (ACCD)—not as another accuracy tweak, but as a structural correction. Instead of freezing assumptions into static thresholds and embeddings, ACCD treats coordination detection as a learning system with memory. And that subtle shift matters more than the headline F1 score.

Background — From correlation theater to causal liability

Most coordination detectors fall into one of three camps:

Correlation-based heuristics — fast, cheap, and easily fooled by trending topics.
Fixed-parameter causal models — mathematically elegant, operationally fragile.
Supervised classifiers — accurate until labels dry up or behaviors mutate.

Convergent Cross Mapping (CCM) helped the field move from correlation toward causation, but in practice it shipped with hidden liabilities: fixed embedding dimensions, quadratic scaling, and manual tuning that assumes yesterday’s attacks resemble today’s.

Meanwhile, behavioral classifiers quietly accumulated annotation debt—high accuracy purchased with endless human labeling.

ACCD starts from a blunt premise: coordination detection fails not because models are weak, but because they forget.

Analysis — What ACCD actually does (and why it’s different)

ACCD is a three-stage system, but more importantly, it is stateful.

Stage 1: Adaptive causal discovery (memory beats brute force)

Instead of treating CCM parameters as universal constants, ACCD stores historical performance of embedding choices $(E, \tau)$ in a long-term memory keyed by coarse context (user volume, activity span, temporal density).

Parameter selection becomes a trade-off between exploitation and exploration:

$$ (E, \tau) = \arg\max_{e,\tau}; \alpha \cdot \text{precision}_{hist} + (1-\alpha) \cdot e^{-\beta \cdot usage} $$

In plain terms: what worked before, adjusted for overuse.

Computationally, ACCD avoids the $O(N^2)$ trap by clustering users first, running CCM primarily within clusters, and sampling across them. The result is a practical reduction to roughly $O(N^{1.4})$ without collapsing signal quality.

Stage 2: Semi-supervised classification (labels as a scarce resource)

Rather than pretending labels are free, ACCD treats them as expensive and deploys them surgically.

Random Forest classifiers estimate uncertainty per user.
Humans label only the most ambiguous cases.
High-confidence predictions are cached as pseudo-labels.
Training follows a curriculum: easy cases first, edge cases later.

The result is not theoretical elegance but operational relief—over 60% fewer manual labels without degrading accuracy.

Stage 3: Adaptive causal validation (goodbye static thresholds)

Most causal pipelines fail quietly at validation. Fixed $p$-values and one-size-fits-all thresholds assume identical data difficulty across platforms and time.

ACCD replaces this with experience-weighted validation:

$$ \theta_{adapt} = \theta_{base} \cdot e^{\gamma \cdot success_rate} $$

Multiple causal estimators (synthetic control, causal forests, GAN-based models) are scored jointly on uncertainty, precision, and recall. Only effects that survive cross-model agreement and automated refutation tests are retained.

This is less about statistical purity—and more about institutional memory.

Findings — What changes when systems remember

Metric	Best Prior Baseline	ACCD
F1-score (Twitter IRA)	75.8%	87.3%
Manual labeling	100%	−68%
Training time	181 min	72 min
Memory usage	8.2 GB	4.5 GB

Beyond metrics, ACCD exposes structure:

Leader accounts exhibit strong outbound causal influence.
Follower accounts absorb rather than initiate signals.
Performance gains are consistent across hashtags, retweets, and forum-style discussions.

This matters for forensics, not just detection.

Implications — Detection systems as learning institutions

ACCD’s real contribution is philosophical:

Detection is not a model—it’s a process.
Causality without adaptation is brittle.
Automation without memory just scales mistakes faster.

For platforms, this suggests a path away from endless rule-tuning. For regulators, it hints at auditability grounded in causal reasoning rather than opaque correlation scores. For AI governance, it quietly reframes moderation as a form of adaptive risk management.

Conclusion — Memory is the missing primitive

ACCD does not claim to end coordinated manipulation. It does something more realistic: it builds a system that learns from its own past successes and failures.

In an adversarial environment where behaviors evolve faster than policies, memory is not a luxury—it is the only durable defense.

Cognaptus: Automate the Present, Incubate the Future.

Opening — Why this matters now#

Background — From correlation theater to causal liability#

Analysis — What ACCD actually does (and why it’s different)#

Stage 1: Adaptive causal discovery (memory beats brute force)#

Stage 2: Semi-supervised classification (labels as a scarce resource)#

Stage 3: Adaptive causal validation (goodbye static thresholds)#

Findings — What changes when systems remember#

Implications — Detection systems as learning institutions#

Conclusion — Memory is the missing primitive#