AgentBound: Verifiable Behavioral Governance for Autonomous AI Agents
Quick Answer
AgentBound introduces a runtime governance framework for autonomous AI agents, ensuring verifiable behavioral oversight through delegated authorization, owner-signed constitutions, and site action contracts.
Quick Take
AgentBound introduces a runtime governance framework for autonomous AI agents, ensuring verifiable behavioral oversight through delegated authorization, owner-signed constitutions, and site action contracts. It generates cryptographically verifiable governance receipts, enhancing accountability and allowing independent verification of actions while supporting long-running agents with refreshed governance policies.
Key Points
- AgentBound evaluates actions using three independent authorities for governance.
- It generates cryptographically verifiable receipts for accountability and policy provenance.
- The framework supports long-running agents with continuously refreshed governance policies.
- AgentBound complements model alignment by providing a deterministic governance layer.
- AgentBound-Bench is introduced for evaluating governance correctness and authority composition.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 30970v1 Announce Type: new Abstract: Autonomous AI agents increasingly perform consequential actions on behalf of human principals, including financial transactions, external communications, and enterprise workflows. Existing agent infrastructure relies on identity federation and delegated authorization to authenticate workloads and control resource access, but it cannot determine whether an authorized action should be executed under the current behavioral and operational context.
We present AgentBound, a runtime governance framework that provides verifiable behavioral oversight for autonomous AI agents. AgentBound evaluates each proposed action using three independent authorities: delegated authorization, owner-signed behavioral constitutions, and site action contracts. Their judgments are conservatively composed through a formal decision model to determine whether an action should be permitted, reviewed, or denied before execution.
To provide accountability, AgentBound generates cryptographically verifiable governance receipts that bind every action to the exact delegation, policy, and semantic artifacts governing the decision, enabling independent replay verification and policy provenance. The framework also introduces standing delegation for long-running agents, allowing periodic workloads to operate under continuously refreshed governance policies while preserving revocability and bounded authority.
We present the formal foundation, system architecture, governance receipt protocol, and AgentBound-Bench, a benchmark framework for evaluating governance correctness, authority composition, and accountability. Rather than replacing model alignment, AgentBound complements it by providing a deterministic governance layer between authorization and execution, transforming governance from a process that must be trusted into one that can be independently verified.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →The Verification Horizon: No Silver Bullet for Coding Agent Rewards
As coding agents evolve, verifying solutions becomes more challenging than generating them, necessitating a focus on scalable, faithful, and robust verification methods. The study reveals that no fixed reward function can sustain effectiveness as model capabilities advance, emphasizing the need for verification to evolve alongside solution generation.


