Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies

arXiv cs.CL·Abeer Badawi, Will Aitken, Lydia Sequeira, Jocelyn Rankin, Maia Norman, Elham Dolatabadi

3h ago

·~1 min·5/28/2026·en·0

Quick Take

The study introduces Keyphrase Generative Representation to enhance youth crisis conversation analysis beyond static taxonomies.

Key Points

Analyzed 703,975 youth SMS conversations for mental health insights.
Expanded taxonomy from 19 to 39 labels with high expert accuracy.
KGR improved topic retrieval accuracy significantly over manual methods.

Article Content

From source RSS / original summary

arXiv:2605. 27546v1 Announce Type: new Abstract: Crisis Responders (CRs) rapidly assess thousands of youth SMS conversations each year to identify mental health concerns and guide support. Yet youth distress is increasingly expressed through evolving and context-specific language that often does not fit fixed-label taxonomies. This work analyzed 703,975 de-identified Kids Help Phone conversations (2018-2023) and expanded KHP's 19-label issue taxonomy into a 39-label hierarchical schema.

We then introduce Keyphrase Generative Representation (KGR), a constrained LLM generating concise, conversation-specific keyphrases, evaluated across 129 conversations and 387 expert annotations. The expanded taxonomy achieved expert consensus reliability, with an accuracy of 0. 96, and expert review found that 81% of keyphrases accurately reflected content and 74% improved clarity.

KGR surfaced identity-linked themes absent from the fixed taxonomy, including immigration problems and caregiver burden, and supported a topic-retrieval workflow that increased accuracy from 0. 25 to 0. 70 (+0. 45) over the manual analyst process. KGR marks a shift toward hybrid, interpretable generative representations that extend crisis response beyond static taxonomies to surface emerging and culturally grounded patterns of youth distress.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

In-Context Optimization for Retrieval-Augmented Generation: A Gradient-Descent Perspective

Extracting Training Data from Diffusion Language Models via Infilling