Large Language Models

Hijacking Large Language Models via Adversarial In-Context Learning

This work introduces a novel transferable attack against In-Context-Learning to hijack LLMs to generate the target response or jailbreak. We also propose a defense strategy …

Xiangyu Zhou

• Nov 16, 2023 • 1 min read

Large Language Models

An example preprint / working paper

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis posuere tellus ac convallis placerat. Proin tincidunt magna sed ex sollicitudin condimentum.

Xiangyu Zhou

• Apr 7, 2019 • 1 min read

Large Language Models

An example conference paper

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis posuere tellus ac convallis placerat. Proin tincidunt magna sed ex sollicitudin condimentum.

Xiangyu Zhou

• Jul 1, 2013 • 1 min read

No results found

Large Language Models

Hijacking Large Language Models via Adversarial In-Context Learning

An example preprint / working paper

An example conference paper