A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism
Quick Take
A proactive multi-agent framework enhances assessment of social language disorder traits in autism using targeted questioning strategies.
Key Points
- Introduces TPA framework for autism language assessment.
- Achieves 82.1% SLD trait coverage in evaluations.
- Improves diagnostic efficiency over traditional methods.
Article Content
From source RSS / original summaryarXiv:2605. 22993v1 Announce Type: new Abstract: Characteristic linguistic behaviors associated with Social Language Disorder (SLD) in autism spectrum disorder, including echoic repetition, pronoun displacement, and stereotyped media quoting, are largely absent from spontaneous conversation and only emerge under specific conversational conditions.
In structured clinical assessments, this latency means that questioning strategy selection is a critical yet underappreciated determinant of how much diagnostic information a conversation yields. Whether large language models (LLMs) can be guided to proactively select questioning strategies that systematically surface these latent traits remains largely unexplored.
Here we present TPA (Think, Plan, Ask), a proactive multi-agent dialogue framework applied to the language assessment component of the Autism Diagnostic Observation Schedule Module 4 (ADOS-2), in which a doctor agent explicitly reasons about which traits remain unobserved before selecting a clinically grounded strategy and generating a targeted question.
A patient agent grounded in real ADOS-2 clinical data enables reproducible evaluation without real patient participation, validated across three independent experiments confirming adequate fidelity to real patient language. Evaluated on 484 episodes from 35 patients, TPA outperforms six competitive dialogue planning baselines across all primary metrics, achieving 82. 1% SLD trait coverage, 16. 6% higher than automated replay of real clinical dialogues conducted by trained clinicians (65.
5%), with substantially greater per-turn diagnostic efficiency (AUCC: 0. 628 vs. 0. 458, absolute gain +0. 170). These results demonstrate that proactive questioning strategy selection substantially improves the efficiency of automated SLD trait assessment, with direct implications for scalable AI-assisted clinical screening.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The reliability of LLM judges for evaluating deep research agents is critically assessed using the REFLECT benchmark.