Friday, March 6, 2026
HomeCryptocurrencyOpenAI Researches AI Brokers Detecting Sensible Contract Flaws

OpenAI Researches AI Brokers Detecting Sensible Contract Flaws


OpenAI has launched a brand new benchmark that evaluates how nicely totally different AI fashions detect, patch, and even exploit safety vulnerabilities present in crypto sensible contracts.

OpenAI launched the “EVMbench: Evaluating AI Brokers on Sensible Contract Safety” paper on Wednesday, in collaboration with crypto funding agency Paradigm and crypto safety agency OtterSec, to judge how a lot the AI brokers might theoretically exploit from 120 sensible contract vulnerabilities.

Anthropic’s Claude Opus 4.6 got here out on prime with a median “detect award” of $37,824, adopted by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Professional at $31,623 and $25,112, respectively.

Detect awards gained by AI brokers. Supply: OpenAI

Whereas AI brokers have gotten more and more environment friendly at dealing with fundamental duties, OpenAI mentioned it’s turning into extra vital to judge their efficiency in “economically significant environments.”

“Sensible contracts safe billions of {dollars} in belongings, and AI brokers are more likely to be transformative for each attackers and defenders.”

“We anticipate agentic stablecoin funds to develop, and assist floor it in a site of rising sensible significance,” OpenAI added.