OpenAI has launched a brand new benchmark that evaluates how nicely totally different AI fashions detect, patch, and even exploit safety vulnerabilities present in crypto sensible contracts.
OpenAI launched the “EVMbench: Evaluating AI Brokers on Sensible Contract Safety” paper on Wednesday, in collaboration with crypto funding agency Paradigm and crypto safety agency OtterSec, to judge how a lot the AI brokers might theoretically exploit from 120 sensible contract vulnerabilities.
Anthropic’s Claude Opus 4.6 got here out on prime with a median “detect award” of $37,824, adopted by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Professional at $31,623 and $25,112, respectively.
Whereas AI brokers have gotten more and more environment friendly at dealing with fundamental duties, OpenAI mentioned it’s turning into extra vital to judge their efficiency in “economically significant environments.”
“Sensible contracts safe billions of {dollars} in belongings, and AI brokers are more likely to be transformative for each attackers and defenders.”
“We anticipate agentic stablecoin funds to develop, and assist floor it in a site of rising sensible significance,” OpenAI added.
Circle CEO Jeremy Allaire predicted on Jan. 22 that billions of AI brokers shall be transacting with stablecoins for on a regular basis funds on behalf of customers inside 5 years, whereas former Binance boss Changpeng “CZ” Zhao additionally just lately tipped that crypto would find yourself being the “native foreign money for AI brokers.”
The necessity to check agentic AI efficiency in recognizing safety vulnerabilities comes as attackers stole $3.4 billion value of crypto funds in 2025, a marginal enhance from 2024.
Associated: China’s AI lead will form crypto’s future
EVMbench drew on 120 curated vulnerabilities from 40 sensible contract audits, most of which had been sourced from open-source audit competitions. OpenAI mentioned it hopes the benchmark will assist monitor AI progress in recognizing and mitigating sensible contract vulnerabilities at scale.
Sensible contracts weren’t constructed for people: Dragonfly
In a submit to X on Wednesday, Dragonfly’s managing companion Haseeb Qureshi mentioned crypto’s promise of changing property rights and authorized contracts by no means materialized, not as a result of the expertise failed, however as a result of it was by no means designed for human instinct.
Qureshi mentioned it nonetheless feels “terrifying” to signal giant transactions, notably with drainer wallets and different threats at all times current, whereas financial institution transfers not often provoke the identical concern.
Dragonfly’s @hosseeb explains why AI brokers will use crypto slightly than the normal monetary system:
“You may see it proper now on Moltbook. Brokers are looking for methods to pay one another for issues. It’s extremely primitive proper now, however you’ll be able to see the place it is going.”
“If I… pic.twitter.com/oWzQuuZcWN
— TBPN (@tbpn) February 18, 2026
As an alternative, Qureshi believes the way forward for crypto transactions shall be facilitated by AI-intermediated, self-driving wallets, which can care for these threats and handle advanced operations on behalf of customers:
“A expertise typically snaps into place as soon as its complement lastly arrives. GPS needed to look ahead to the smartphone, TCP/IP needed to look ahead to the browser. For crypto, we would simply have discovered it in AI brokers.”
Journal: IronClaw rivals OpenClaw, Olas launches bots for Polymarket — AI Eye
