UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind

arXiv cs.CL·Cheng Qian, Jiayu Liu, Heng Ji

2d ago

·~1 min·5/28/2026·en·1

Quick Take

UserHarness introduces a framework for Theory-of-Mind (ToM) that reconstructs user mental states, achieving up to 95.94% macro accuracy across five benchmarks. This method outperforms existing inference techniques by over 15% and enhances prompt-only methods by about 20%, indicating a significant advancement in understanding user beliefs and intentions for agent assistants.

Key Points

UserHarness reframes ToM reasoning as explicit user-mind reconstruction.
Achieves 95.94% macro accuracy, improving existing methods significantly.
Outperforms traditional inference techniques by over 15% relative.
Enhances prompt-only methods by approximately 20% relative.
Positions user harnessing as a foundation for adaptive future assistants.

Article Content

From source RSS / original summary

arXiv:2605. 27721v1 Announce Type: new Abstract: Understanding what a user believes and intends is central to building effective agent assistants. This ability is often evaluated through Theory-of-Mind (ToM) tasks, where success requires reasoning from the user's perspective. However, many existing approaches address ToM with complex pipelines that model behavior indirectly, without explicitly reconstructing the user's mental state.

This misses the core structure of the problem: users act based on their beliefs, which are updated through observations of the environment; beliefs and intentions jointly determine actions, which in turn change the environment; and social reasoning often requires nested beliefs about what others believe or intend. We propose UserHarness, a simple framework that reframes ToM reasoning as explicit user-mind reconstruction.

UserHarness decomposes the user's mental state, its relation to the external environment, and the actions that follow from it, enabling agents to track what the user observes, believes, intends, and does. Across five benchmarks, UserHarness reaches up to 95. 94% macro accuracy, improving over existing inference methods by more than 15% relative and over the strongest prompt-only harness by about 20% relative.

These results suggest that robust user understanding requires reasoning from the roots of the user's mind, positioning user harnessing as a promising foundation for more adaptive future assistants.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

What are They Thinking? Delineation, Probing and Tracking of Concepts in LLMs

In-Context Optimization for Retrieval-Augmented Generation: A Gradient-Descent Perspective