
New benchmark evaluates AI agents’ ability to detect, patch, and exploit smart contract vulnerabilities. GPT-5.3-Codex scores 72.2% on exploit tasks. (Read More)
Phone

New benchmark evaluates AI agents’ ability to detect, patch, and exploit smart contract vulnerabilities. GPT-5.3-Codex scores 72.2% on exploit tasks. (Read More)