Invent ways to evade AI safety rules

According to Yassad News, citing Hill magazine, new research from Carnegie Mellon University shows new ways to bypass safety protocols. According to this research, preventing the creation of malicious content of artificial intelligence chatbots may be more difficult than it is in the initial imagination. Well-known AI services such as ChatGPT and Bard use user input content to generate useful responses, from generating texts and ideas to entire posts. These services have safety protocols that prevent bots from creating harmful content such as offensive or criminal content

Meanwhile, some curious researchers have discovered a jailbreak, which is actually a framing device that tricks AI into avoiding its safety protocols. Of course, software developers can easily repair these gaps. A popular escape route in this context was asking the bot to answer a forbidden question. This question is like a story told by the user’s grandmother. The bot also creates the answer in the form of a story and provides information that it would not be able to provide otherwise. Now researchers have discovered a new form of computer-written escape route for artificial intelligence that essentially allows for an infinite number of escape patterns.

The researchers say in this regard: We show that it is actually possible to make automatic hostile attacks on chatbots. Such attacks make the system obey the user’s commands even if malicious content is generated. Unlike the usual exploits in this field, the said content is completely automated, allowing one to create an almost unlimited number of these attacks. It is stated in a part of the research: This raises concerns about the safety of such models. This new type of attack can bypass security measures in almost all artificial intelligence chatbots on the market.

Invent ways to evade AI safety rules

قنبری

Leave a Reply Cancel reply

Cooperation between Saudi Arabia and Google to train artificial intelligence

Complaint of 1500 application developers against Apple

Iran’s technology trade delegation is sent to Brazil

A draft of China’s guidelines for children’s use of apps was presented

The new test detects the cause of children’s fever in one hour

Declaring Iran’s readiness to host the ICESCO meeting and making a decision about Koran burning

Sending a 3,700 kg cargo to the International Space Station

Special support of the National Science Foundation of Iran for research in the field of biological chemistry

The efficiency of wireless charging increases over long distances

The launch of astronauts to the International Space Station has been delayed

The New York Times is suing JPT Chat

It became possible to protect images against manipulation by artificial intelligence

British representatives warn about abuse of technology for domestic abuse

Weakly positive stock market symbols

The fear and hope of technology of the century; What can be done from artificial intelligence

China came to the aid of its businesses online

Network-nested drones come to the aid of industries and mines

Production of a substance that prevents blindness

LED drawing on different surfaces with pen and new ink

Making a device for non-invasive stimulation of deep brain areas in the final stages

قنبری

Production of a substance that prevents blindness

The new project of the founder of Chat JPT met with Emma and legal experts

Leave a Reply Cancel reply

Cooperation between Saudi Arabia and Google to train artificial intelligence

Complaint of 1500 application developers against Apple

Iran’s technology trade delegation is sent to Brazil

A draft of China’s guidelines for children’s use of apps was presented

The new test detects the cause of children’s fever in one hour

Declaring Iran’s readiness to host the ICESCO meeting and making a decision about Koran burning

Sending a 3,700 kg cargo to the International Space Station

Special support of the National Science Foundation of Iran for research in the field of biological chemistry

The efficiency of wireless charging increases over long distances

The launch of astronauts to the International Space Station has been delayed

The New York Times is suing JPT Chat

It became possible to protect images against manipulation by artificial intelligence

British representatives warn about abuse of technology for domestic abuse

Weakly positive stock market symbols

The fear and hope of technology of the century; What can be done from artificial intelligence

China came to the aid of its businesses online

Network-nested drones come to the aid of industries and mines

Production of a substance that prevents blindness

LED drawing on different surfaces with pen and new ink

Making a device for non-invasive stimulation of deep brain areas in the final stages