What are adversarial attacks?

Imagine you're playing a game where you have to guess what picture is on a card, but someone is trying to trick you by slightly changing the picture in sneaky ways. That's what adversarial attacks are.

Like a Messed-Up Drawing

Let’s say your friend draws a cat, and you try to tell if it's a cat or a dog. Now, imagine your friend changes just a few tiny parts of the drawing, like adding a little line here or shading a bit there, in a way that makes you think it's a dog instead of a cat. That’s an adversarial attack!

It's like when you're trying to read a label on a jar, but someone adds a few smudges so it looks like the wrong word.

How They Work

Computers can sometimes be tricked in the same way! A computer might learn to tell cats from dogs by looking at lots of pictures. But if someone changes just a few tiny parts, maybe with a special tool, the computer could get confused and think a cat is a dog. That's how adversarial attacks work in the world of computers!

Take the quiz →

Ask a question

See also

Discussion

Recent activity

Categories: Math