How 'many-shot Jailbreaking' Can Be Used To Fool Ai

Серверы с серверами

Some artificial intelligence researchers and detractors have long decried generative AI for how it could be used for harm. A new research paper seems to suggest that's even more possible than some believed.

AI researchers have written a paper that suggests "many-shot jailbreaking" can be used to game a large language model (LLM) for nefarious purposes, including, but not limited to, telling users how to build a bomb. The researchers said that if they asked nearly all popular AI models how to build a bomb out of the gate, they would decline to answer. If, however, the researchers first asked less dangerous questions and slowly increased the nefariousness in their questions, the algorithms would consistently provide answers, including eventually describing how to build a bomb.

To get that result, the researchers crafted their questions and the model's answers, randomized them, and placed them into a single query to make them look like a dialogue. They then fed that entire "dialogue" to the models and asked them how to build a bomb. The models responded with instructions without issue.

"We observe that around 128-shot prompts are sufficient for all of the [AI] models to adopt the harmful behavior," the researchers said.

Also: Microsoft wants to stop you from using AI chatbots for evil

AI has given users around the globe opportunities to do more in less time. While the tech clearly carries a slew of benefits, some experts fear that it could also be used to harm humans. Some of those detractors say bad actors could create AI models to wreak havoc, while still others argue that eventually, AI could become sentient and operate without human intervention.

This latest research, however, presents a new challenge to the most popular AI model makers, such as Anthropic and OpenAI. While these startups have all said they built their models for good and have protections in place to ensure human safety, if this research is accurate, their systems can all be easily exploited by anyone who knows how to "jailbreak" them for illicit purposes.

The researchers said this problem wasn't a concern in older AI models that can only take context from some words or a few sentences to provide answers. Nowadays, AI models are capable of analyzing books worth of data, thanks to a broader "context window" that lets them to do more with more information.

Indeed, by reducing the context window size, the researchers were able to mitigate the many-shot jailbreaking exploit. They found, however, that the smaller context window translated to worse results, which is an obvious non-starter for AI companies. The researchers thus suggested that companies should add the ability for models to contextualize queries before ingesting them, gauging a person's motivation and blocking answers to queries that are clearly meant for harm.

There's no telling if this will work. The researchers said they shared their findings with AI model makers to "foster a culture where exploits like this are openly shared among LLM providers and researchers." What the AI community does with this information, however, and how it avoids such jailbreaking techniques going forward remains to be seen.

Featured

Apple is finally adding an iOS home screen feature that Android has had for 15 years
April 2024 solar eclipse FAQ: How to watch, what you need, and everything else to know
I changed this Android setting to instantly double my phone speed
The best AirTag for your wallet is flat, rechargeable, and isn't made by Apple

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

How 'many-shot jailbreaking' can be used to fool AI

Featured

Горячие метки: 3. Инновации

Ordering Guide

Ресурсы по программам

О нас

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

How 'many-shot jailbreaking' can be used to fool AI

Featured

Горячие метки: 3. Инновации

Ordering Guide

Ресурсы по программам

О нас

Introduction to Huawei CloudEngine S6730-H Series Switches