Tags → #ai safety

Research Highlights Risks in General-Purpose AI Deployment

Research by Mario Fritz and his team explores the significance of Data-Instruction Separation (DIS) in enhancing the security of AI-driven systems against evolving cybersecurity threats.
SQL Injection Jailbreak Method Exposes LLM Vulnerabilities

Recent research has introduced the SQL Injection Jailbreak method, which reveals vulnerabilities in large language models and emphasizes the need for robust defense mechanisms to enhance AI security.
Study Reveals Vulnerabilities in Large Language Models

A recent study reveals that large language models are vulnerable to jailbreak attacks, prompting researchers to propose a new defense framework called AutoDefense to enhance their security.
Study Reveals Vulnerabilities in Large Language Models

A recent study has introduced a new framework called DrAttack, which enhances the success rate of jailbreak attacks on Large Language Models by employing a comprehensive approach that includes decomposition, implicit reconstruction, and synonym search.
Concerns Raised Over Safety of Large Language Models

Recent studies highlight concerns about the safety and ethical implications of large language models (LLMs), revealing vulnerabilities in current unlearning methods that could allow adversaries to access sensitive information.