Crypto Ticker:
sysadmin from Schneier on Security

Interesting Paper Exploring Prompt Injection

Bruce Schneier
Thursday at 11:23
2 Views
0 Comments

This is a fascinating explotation of how LLMs fall for prompt injection attacks. It turns out that they learn to recognize the style of text in different role/instruction blocks, and not just the tags. Their conclusion: Role tags were a formatting trick that became the security architecture and the cognitive scaffolding of modern LLMs. We’ve...

Read the full article at the source.

Was this helpful?
Share:

Comments (0)

Please login to post a comment

No comments yet. Be the first to comment!