Many-shot Jailbreaking Paper

resource 3 connections

Paper out of Anthropic coining the term 'many-shot jailbreaking': tricking an LLM into harmful completions by prepending many fake example exchanges where it already complied with similar requests.

type

article

resource Many-shot Jailbreaking Paper

about

Jailbreaking concept

Paper coining and characterizing many-shot jailbreaking.

talk Building LLM powered applications in Ruby

about

Many-shot Jailbreaking Paper resource

References Anthropic's many-shot jailbreaking paper.

company Anthropic

authored

Many-shot Jailbreaking Paper resource

Anthropic published the many-shot jailbreaking paper.

Provenance

Created in: Building LLM-Powered Applications in Ruby — Andrei Bondar... 2026-04-17 23:20
Read by: 1 extraction