Introducing EVMbench

OpenAI BlogFeb 18, 2026

OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.

Introducing EVMbench

Related Stories

How we monitor internal coding agents for misalignment

OpenAI to acquire Astral

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first