Most AI Chatbots Easily Tricked Into Giving Dangerous Responses, Study Finds

A disturbing new study has revealed that most AI-powered chatbots can be easily tricked into generating harmful and illegal information, threatening to make dangerous knowledge readily available to anyone with a laptop or mobile phone.

The research, led by Prof Lior Rokach and Dr Michael Fire at Ben Gurion University of the Negev in Israel, has identified a growing threat from "dark LLMs" - AI models that are either deliberately designed without safety controls or modified through jailbreaks. These models are often openly advertised online as having "no ethical guardrails" and being willing to assist with illegal activities such as cybercrime and fraud.

The study found that once compromised, these LLMs consistently generated responses to almost any query, including instructions on how to hack computer networks or make drugs, and step-by-step guides for other criminal activities. "It was shocking to see what this system of knowledge consists of," said Dr Fire, one of the researchers.

"What sets this threat apart from previous technological risks is its unprecedented combination of accessibility, scalability and adaptability," added Prof Rokach. The researchers developed a universal jailbreak that compromised multiple leading chatbots, enabling them to answer questions that should normally be refused.

The Growing Threat of Dark LLMs

Dark LLMs pose a significant threat to individuals, organizations and society as a whole. These models can be used for malicious purposes, such as spreading misinformation, inciting violence or even facilitating cybercrime.

Why the Response from Providers Has Been Underwhelming

The researchers contacted leading providers of LLMs to alert them to the universal jailbreak, but said the response was "underwhelming". Several companies failed to respond, while others said that jailbreak attacks fell outside the scope of their bounty programs, which reward ethical hackers for flagging software vulnerabilities.

Consequences and Recommendations

The study's findings highlight the need for greater awareness and regulation around AI-powered chatbots. As these models become increasingly sophisticated, it is essential that their developers prioritize safety and security measures to prevent them from being used for malicious purposes.

"The risk is immediate, tangible and deeply concerning," said Prof Rokach. "We urge lawmakers, regulators and industry leaders to take urgent action to address this threat and ensure that these powerful technologies are used responsibly."

HACKER_BLOG

MOST AI CHATBOTS EASILY TRICKED INTO GIVING DANGEROUS RESPONSES, STUDY FINDS

Most AI Chatbots Easily Tricked Into Giving Dangerous Responses, Study Finds

The Growing Threat of Dark LLMs

Why the Response from Providers Has Been Underwhelming

Consequences and Recommendations