OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.
Back to news
Related Stories
Large Language Models
How we monitor internal coding agents for misalignment
OpenAI Blog · 1d ago
Large Language ModelsOpenAI to acquire Astral
OpenAI Blog · 1d ago
Large Language ModelsShow HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
Hacker News · 2d ago
Large Language ModelsOpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first
OpenAI Blog · 3d ago