Large Language Models
Mar 10, 2026
Improving instruction hierarchy in frontier LLMs
OpenAI BlogMar 10, 2026

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.