All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

Deployment-Time Memorization in Foundation-Model Agents: Privacy-Utility Tradeoffs in Persistent Memory Systems

By

[Submitted on 8 Jun 2026]

10d ago· 2 min readenInsight

Summary

This paper introduces the concept of "deployment-time memorization" in foundation-model agents, where memory is an explicit function during deployment rather than just a property of model weights. The authors study how memory-design choices (summarization aggressiveness, retrieval breadth, and deletion mode) jointly affect personalization utility, extraction risk, and deletion fidelity. They propose metrics including Personalization Recall (PR), Adversarial Extraction Rate (AER), and Forgetting Residue Score (FRS). Key findings show that key-fact summarization reduces canary extraction by 76% on Gemma 3 12B and 64% on GPT-4o-mini while preserving personalization recall, but raw-only deletion leaves derived summary copies recoverable in ~20% of instances, requiring full-pipeline purge or tombstone redaction for complete erasure.

Source

bskyDeployment-Time Memorization in Foundation-Model Agents: Privacy-Utility Tradeoffs in Persistent Memory Systemsarxiv.org

Key quotes

· 5 pulled
Foundation-model agents are increasingly long-lived systems that remember users across interactions, making memorization an explicit deployment-time function rather than solely a property of model weights.
Existing work addresses parametric memorization or audits fixed memory configurations, but does not characterize how memory-design choices jointly shape personalization utility, extraction risk, and deletion fidelity.
On LongMemEval, key-fact summarization reduces canary extraction by 76% on Gemma 3 12B and 64% on GPT-4o-mini while preserving nearly all personalization recall; critically, once content is compressed away, increasing k no longer restores leakage.
The same compression, however, induces a deletion-fidelity failure: raw-only deletion leaves derived summary copies recoverable in approximately 20% of instances, and only full-pipeline purge or tombstone redaction drives worst-tier residue to zero.
Together, these results establish that persistent agent memory must be evaluated as a first-class memorization mechanism -- assessed by what it helps agents recall, what it makes extractable, and what it can truly erase.
Snippet from the RSS feed
Foundation-model agents are increasingly long-lived systems that remember users across interactions, making memorization an explicit deployment-time function rather than solely a property of model weights. Existing work addresses parametric memorization o

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.