Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies
Quick Take
The study introduces Keyphrase Generative Representation to enhance youth crisis conversation analysis beyond static taxonomies.
Key Points
- Analyzed 703,975 youth SMS conversations for mental health insights.
- Expanded taxonomy from 19 to 39 labels with high expert accuracy.
- KGR improved topic retrieval accuracy significantly over manual methods.
Article Content
From source RSS / original summaryarXiv:2605. 27546v1 Announce Type: new Abstract: Crisis Responders (CRs) rapidly assess thousands of youth SMS conversations each year to identify mental health concerns and guide support. Yet youth distress is increasingly expressed through evolving and context-specific language that often does not fit fixed-label taxonomies. This work analyzed 703,975 de-identified Kids Help Phone conversations (2018-2023) and expanded KHP's 19-label issue taxonomy into a 39-label hierarchical schema.
We then introduce Keyphrase Generative Representation (KGR), a constrained LLM generating concise, conversation-specific keyphrases, evaluated across 129 conversations and 387 expert annotations. The expanded taxonomy achieved expert consensus reliability, with an accuracy of 0. 96, and expert review found that 81% of keyphrases accurately reflected content and 74% improved clarity.
KGR surfaced identity-linked themes absent from the fixed taxonomy, including immigration problems and caregiver burden, and supported a topic-retrieval workflow that increased accuracy from 0. 25 to 0. 70 (+0. 45) over the manual analyst process. KGR marks a shift toward hybrid, interpretable generative representations that extend crisis response beyond static taxonomies to surface emerging and culturally grounded patterns of youth distress.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The reliability of LLM judges for evaluating deep research agents is critically assessed using the REFLECT benchmark.