Exploring the Secondary Risks of Large Language Models
arXiv:2506.12382v3 Announce Type: replace Abstract: Ensuring the safety and alignment of Large Language Models is a significant challenge with their growing integration into critical applications and societal functions. While prior research has primarily focused on jailbreak attacks, less attention has…
