Vulnerabilities
Study Reveals Vulnerabilities in Large Language Models
A recent study has introduced a new framework called DrAttack, which enhances the success rate of jailbreak attacks on Large Language Models by employing a comprehensive approach that includes decomposition, implicit reconstruction, and synonym search.