What happened after 2,000 people tried to hack my AI assistant
A public AI security challenge saw 2,000 people attempt to leak secrets via prompt injection, with all 6,000 attempts failing, reflecting progress in frontier model defenses but also revealing lingering risks.
Simon Willison · Jun 27, 2026