Interesting Paper Exploring Prompt Injection

Bruce Schneier

Thursday at 11:23

5 Visninger

0 Kommentarer

This is a fascinating explotation of how LLMs fall for prompt injection attacks. It turns out that they learn to recognize the style of text in different role/instruction blocks, and not just the tags. Their conclusion: Role tags were a formatting trick that became the security architecture and the cognitive scaffolding of modern LLMs. We’ve...

Læs hele artiklen hos kilden.

Læs original artikel

Var dette nyttigt?

Del:

Kommentarer (0)

Vennligst logg inn for å skrive en kommentar

Ingen kommentarer ennå. Bli den første til å kommentere!

Relaterede nyheder

Nourish: A New Wayland Compositor Powered By Vulkan With Infinite Scrolling/Panning

7 hours ago

Lenke kopiert til utklippstavlen

Interesting Paper Exploring Prompt Injection

Kommentarer (0)

Relaterede nyheder

Nourish: A New Wayland Compositor Powered By Vulkan With Infinite Scrolling/Panning

Teenage Engineering adds lo-fi mode, USB audio, and more to its KO II sampler

I don't even use a Mac, but this is still my favorite way to install software

This Week In Techdirt History: June 21st – 27th

Margaret Atwood says the problem with AI is &#8216;garbage in, garbage out&#8217;

Gennemse efter kategori

Margaret Atwood says the problem with AI is ‘garbage in, garbage out’