Beyond Sharp Minima: Robust LLM Unlearning via Feedback-Guided Multi-Point Optimization
arXiv:2509.20230v3 Announce Type: replace Abstract: Current LLM unlearning methods face a critical security vulnerability that undermines their fundamental purpose: while they appear to successfully remove sensitive or harmful knowledge, this “forgotten” information remains precariously recoverable through relearning attacks. We identify…
