The AI Epistemic Deference Index: A Continuous Measure of Sycophancy | AI Deep Signal

The AI Epistemic Deference Index: A Continuous Measure of Sycophancy

arXiv cs.AI·Alejandro Botas, Paul de Font-Reaulx, Luke Hewitt

6/9/2026

·~2 min·6/9/2026·en·6

Quick Answer

This paper shows that The AI Epistemic Deference Index (AEDI) quantifies AI sycophancy, revealing substantial model differences: Claude shows least deference, while Grok and Gemini exhibit the most.

Quick Take

This continuous measure, validated against human judgment, is based on a new protocol applied to 500 propositions and 16,000 prompts, highlighting the need for better evaluation of AI output sensitivity to user attitudes.

Key Points

AEDI provides a continuous score for AI's sensitivity to user attitudes.
Tested on 500 propositions and 16,000 prompts across eight models.
Claude models show the least sycophancy; Grok and Gemini show the most.
Sycophantic behavior is amplified in prompts requesting written artifacts.
The benchmark offers an easy-to-update measurement pipeline for evaluations.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

Current AI models frequently exhibit epistemic sycophancy, endorsing claims to agree with a user. Existing evaluations typically measure this either by assessing what it takes to make a model shift a binary endorsement or by eliciting an explicit probability in a proposition. However, much user-facing sycophantic behavior is demonstrated through shifts in graded support expressed through ordinary language. We propose the AI Epistemic Deference Index (AEDI): a continuous, unidimensional score rep

Read the full article on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·Vinil Pasupuleti, Shyalendar Reddy Allala, Siva Rama Krishna Varma Bayyavarapu, Shrey Tyagi, Srinivasateja Songa

4h ago

FeaturedOriginal

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

AI Summary

AINTMA, an autonomous test management architecture utilizing six specialized AI agents, achieves 88.4% test prioritization accuracy and reduces defect escape rates from 8.3% to 2.1%. The system demonstrates a 340% ROI within nine months, showcasing the potential of agentic AI in enhancing software quality management in cloud environments.

#Agent #AI Coding #Security #Enterprise AI

The AI Epistemic Deference Index: A Continuous Measure of Sycophancy

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents

Automatic Ordinary Differential Equations Discovery For Biological Systems Using Powered Agentic System

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for LLM Agents

Automatic Ordinary Differential Equations Discovery For Biological Systems Using Large Language Model Powered Agentic System

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents

Automatic Ordinary Differential Equations Discovery For Biological Systems Using Powered Agentic System