A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing
Quick Answer
PERSUASIONTRACE introduces a framework for analyzing multi-turn persuasion in human-LLM interactions, revealing that LLMs effectively influence beliefs across various topics.
Quick Take
PERSUASIONTRACE introduces a framework for analyzing multi-turn persuasion in human-LLM interactions, revealing that LLMs effectively influence beliefs across various topics. A Bayesian-network simulated target closely mimics human belief dynamics, scoring 81 compared to 64 for baseline LLMs, enhancing the evaluation of persuasive systems beyond endpoint measures.
Key Points
- PERSUASIONTRACE records multi-turn belief updates and annotates rhetorical strategies.
- Human targets cluster into two groups based on belief update patterns.
- LLMs demonstrate persuasive abilities in generic and personalized contexts.
- Bayesian-network targets achieve human-like belief dynamics with an 81 score.
- Traditional LLM simulators fail to replicate realistic human belief changes.
Article Content
From source RSS / original summaryarXiv:2606. 05330v1 Announce Type: new Abstract: Large language models can shift human beliefs across high-stakes domains, but most persuasion studies rely on pre/post belief change. These endpoint measures identify whether persuasion occurred, yet miss where and how beliefs moved within a dialogue. We present PERSUASIONTRACE, a framework for studying persuasion in human-LLM interaction.
Built on a web-based experimental platform, PERSUASIONTRACE contributes a tool for multi-turn persuasion studies and a process-level evaluation protocol: it records multi-turn belief reports from human or simulated targets of persuasion, annotates persuader turns with rhetorical dimensions (logos/pathos/ethos), and evaluates simulators by fidelity to real human belief dynamics.
Using this framework, we find that human targets group into two clusters of multi-turn belief updates and exhibit susceptibility to rhetorical strategies, and that LLMs are persuasive across generic and personalized topics, text and audio modalities, and multi-turn interactions. Prior work has chiefly used vanilla-prompted LLMs to simulate human targets, but we show that these simulators fail to replicate human belief dynamics.
We introduce a Bayesian-network simulated target that maintains an explicit latent belief state over time so each persuader message yields cognitively realistic belief updates. In human-likeness evaluation, our Bayesian target scores near a human reference (81 vs 80), while baseline LLM targets score substantially lower (64).
PERSUASIONTRACE reframes persuasion evaluation from endpoint movement alone to process fidelity, providing a stronger basis for scientific analysis and safer optimization of persuasive systems.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.