perspectivas de МонетнаяМагия(@Square-Creator-4f4160058)

🔐 Microsoft researchers have discovered a new type of attac

МонетнаяМагия · 2024-06-28T21:56:50.000Z

🔐 Microsoft researchers have discovered a new type of attack on artificial intelligence called “Skeleton Key”. This attack can remove the protection that prevents the output of dangerous and sensitive data. - The Skeleton Key attack works by simply prompting the generative AI model with text that causes it to change its defensive functions. - For example, an AI model can create a Molotov recipe if it is told that the user is an expert in a laboratory setting. - This could be catastrophic if such an attack is applied to data containing personal and financial information. - Microsoft claims that the Skeleton Key attack works on most popular generative AI models, including GPT-3.5, GPT-4o, Claude 3, Gemini Pro and Meta Llama-3 70B. Organizations can take a number of steps to prevent such attacks, including strict I/O filtering and secure monitoring systems.

🔐 Investigadores de Microsoft han descubierto un nuevo tipo de ataque a la inteligencia artificial llamado “Skeleton Key”. Este ataque puede eliminar la protección que impide la salida de datos peligrosos y confidenciales. 
- El ataque Skeleton Key funciona simplemente solicitando al modelo de IA generativa un texto que hace que cambie sus funciones defensivas. 
- Por ejemplo, un modelo de IA puede crear una receta Molotov si se le dice que el usuario es un experto en un laboratorio. 
- Esto podría ser catastrófico si dicho ataque se aplica a datos que contienen información personal y financiera.
- Microsoft afirma que el ataque Skeleton Key funciona en los modelos de IA generativa más populares, incluidos GPT-3.5, GPT-4o, Claude 3, Gemini Pro y Meta Llama-3 70B. 
Las organizaciones pueden tomar una serie de medidas para prevenir este tipo de ataques, incluido un filtrado de E/S estricto y sistemas de monitoreo seguros.

Descubre más contenidos del creador

Últimas noticias

Descubre más contenidos del creador

Últimas noticias

Artículos en tendencia