Interesting Paper Exploring Prompt Injection

Bruce Schneier

Thursday at 11:23

4 Visningar

0 Kommentarer

This is a fascinating explotation of how LLMs fall for prompt injection attacks. It turns out that they learn to recognize the style of text in different role/instruction blocks, and not just the tags. Their conclusion: Role tags were a formatting trick that became the security architecture and the cognitive scaffolding of modern LLMs. We’ve...

Läs hela artikeln hos källan.

Läs originalartikeln

Var detta hjälpsamt?

Dela:

Kommentarer (0)

Vänligen logga in för att publicera en kommentar

Inga kommentarer ännu. Bli först med att kommentera!

Relaterade nyheter

Nourish: A New Wayland Compositor Powered By Vulkan With Infinite Scrolling/Panning

7 hours ago

Länk kopierad till urklipp

Interesting Paper Exploring Prompt Injection

Kommentarer (0)

Relaterade nyheter

Nourish: A New Wayland Compositor Powered By Vulkan With Infinite Scrolling/Panning

Teenage Engineering adds lo-fi mode, USB audio, and more to its KO II sampler

I don't even use a Mac, but this is still my favorite way to install software

This Week In Techdirt History: June 21st – 27th

Margaret Atwood says the problem with AI is &#8216;garbage in, garbage out&#8217;

Bläddra efter kategori

Margaret Atwood says the problem with AI is ‘garbage in, garbage out’