[RESOURCES]

Benchmarks, guides, and compliance docs.

XOR is the verification platform for AI coding agents. One loop: detect the vulnerability, patch it with an agent, verify the fix, and feed results back so agents learn.

Current verified dataset: 128 CVE samples, 1,920 evaluations across 15 agent configurations. Target scale: 6,138+ vulnerabilities across 250+ projects.

Platform overview: Detect → Patch → Verify → Learn.

Compare 15 agents on real bugs →

[START HERE]

One loop. Patch, verify, learn.

OutcomeKnow which agents fix real vulnerabilities before you deploy them.

MechanismXOR detects the CVE, dispatches an agent to patch it, writes a verifier, confirms the fix, and feeds results back into the agent harness.

ProofCurrent verified dataset: 128 CVE samples, 1,920 evaluations.

Compare 15 agents on real bugs →

75 resources across 9 topic clusters

[BENCHMARK]

CVE-Agent-Bench

62.7% pass rate. $2.64 per fix. Real data from 1,920 evaluations.

Results & Leaderboard

62.7% pass rate. $2.64 per fix. Real data from 1,920 evaluations.

Agent Profiles→Economics→Methodology→Cost Analysis→Bug Complexity→Agent Strategies→Execution Metrics→Validation Process→Council Deliberations→Pricing Transparency→

[PLATFORM]

Patches CVEs automatically. Reviews every AI-generated PR with a pass/fail verification report.

Getting Started→Capabilities→PR Verification→Automated Patching→Dependabot Verification→Compliance Evidence→Compatibility→Command Reference→

[SECURITY]

AI agents run with real permissions. XOR verifies tool configurations, sandbox boundaries, and credential exposure.

Third-Party Risk

33% of enterprise software will be agentic by 2028. 40% of those rollouts will be canceled due to governance failures. A risk overview for CTOs.

Agent Attack Landscape→MCP Security→Security Economics→Verified AgentSkills→Building Secure Skills→Secure Agent Handoff→Supply Chain Risk Index→

[STANDARDS & COMPLIANCE]

Standards Overview

How XOR signed audit logs satisfy SOC 2, EU AI Act, PCI DSS, NIST, and other compliance requirements.

Agent Governance→OWASP Agentic Top 10→Agentic SecEcon→Agent Compliance Evidence→Agent Trajectories→

[AGENTS]

Claude Opus 4.5→Claude Opus 4.6→Codex GPT-5.2→Codex GPT-5.2 Codex→Cursor Composer 1.5→Cursor GPT-5.2→Cursor GPT-5.3 Codex→Cursor Opus 4.6→Gemini 3 Pro→Gemini 3.1 Pro→OpenCode Opus 4.5→OpenCode Opus 4.6→OpenCode Gemini 3.1→OpenCode GPT-5.2→OpenCode GPT-5.2 Codex→

[LABS]

Google→OpenAI→Anthropic→Nvidia→

[COMPARISONS]

Native vs Wrapper→Cost vs Performance→Behavioral Clusters→Cross-Agent Agreement→Model Upgrades→Ensemble Strategies→

[VULNERABILITY TYPES]

Bounds checks→Guard checks→Logic fixes→Allocation fixes→

[CODEBASES]

Text Shaping→Archive Library→Service Proxy→Web Server→Git Library→Network Switch→Data Compressor→Mesh Networking→PGP Library→Image Codec→