Which Changes Matter? Towards Trustworthy Legal AI via Relevance-Sensitive Evaluation and Solver-Grounded Reasoning

arXiv cs.AI·Chen Linze, Cai Yufan, Hou Zhe, Dong Jin Song

3d ago

·~1 min·5/27/2026·en·0

Quick Take

The paper introduces LexGuard, a framework enhancing legal AI's reliability by focusing on legally relevant changes, addressing the sensitivity of existing LLMs to irrelevant variations. It demonstrates improved legal reasoning through adversarial multi-agent strategies and SMT solvers, ensuring accuracy and consistency in legal evaluations.

Key Points

LexGuard formalizes statutes into executable constraints for better legal reasoning.
Existing legal LLMs struggle with irrelevant variations, affecting their reliability.
Adversarial agents in LexGuard improve disambiguation among similar statutes.
The framework enhances consistency under benign reformulations of legal queries.
Legal trustworthiness requires calibrated sensitivity to material changes.

Article Content

From source RSS / original summary

arXiv:2605. 26530v1 Announce Type: new Abstract: Legal reasoning requires distinguishing changes that matter from those that do not. Legal AI should remain stable under legally irrelevant perturbations, but should change when perturbations alter legally material points. We formulate this requirement as a legal-relevance-sensitive evaluation problem: LLMs should only be sensitive to the legally relevant change.

We introduce a unified evaluation suite covering should-change and should-not-change evaluation across judicial fairness, robustness, and statute-confusion scenarios. Our evaluation shows that existing legal LLMs are systematically sensitive to legally irrelevant variations and often fail to distinguish related legal elements and statutory rules. To mitigate these failures, we present LexGuard, an adversarial multi-agent framework grounded in formal reasoning.

LexGuard formalizes statutes into executable constraints, uses adversarial agents to extract competing fact-statute arguments, and invokes SMT solvers to verify legal satisfaction and logical consistency. Experiments show that LexGuard improves legal reasoning reliability by reducing vulnerability to manipulative framing, improving disambiguation among similar statutes, limiting the influence of legally irrelevant attributes, and increasing consistency under benign reformulations.

We show that legal trustworthiness requires not only accuracy, but calibrated sensitivity to legally material changes.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·Tyler Akidau, Tyler Rockwood, Johannes Br\"uderl, Marc Millstone

1d ago

FeaturedOriginal

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

AI Summary

The Redpanda Agentic Data Plane (ADP) introduces out-of-band metadata channels to enhance the safety of autonomous AI agents, ensuring secure data access and tamper-proof audit trails. This architecture mitigates risks associated with unpredictable AI behavior by enforcing governance throughout the agent lifecycle, demonstrated in a multi-agent trading system with strict data scoping and approval thresholds.

#Agent #Robotics #Security #Policy