When Models Remember Too Much: The Quiet Problem of Memorization Sinks
Opening — Why this matters now Large language models are getting better at everything—writing, coding, reasoning, and politely apologizing when they hallucinate. Yet beneath these broad performance gains lies a quieter, more structural issue: memorization does not happen evenly. Some parts of the training data exert disproportionate influence, acting as gravitational wells that trap model capacity. These are what the paper terms memorization sinks. ...