In-Context Learning Attack

Hijacking Large Language Models via Adversarial In-Context Learning featured image

Hijacking Large Language Models via Adversarial In-Context Learning

This work introduces a novel transferable attack against In-Context-Learning to hijack LLMs to generate the target response or jailbreak. We also propose a defense strategy …

avatar
Xiangyu Zhou