Toxicity in Twitch Chats: An LLM-Based Analysis Across Gaming Communities

arXiv cs.CL·Ronja Fuchs, Florian Rupp, Timo Bertram, Kai Eckert, Alexander Dockhorn

1d ago

·~2 min·5/26/2026·en·0

Quick Take

Analysis of 20 million Twitch chat messages reveals significant toxicity variations across gaming communities.

Key Points

2.4% of messages classified as toxic overall.
MOBA games have the highest toxicity rate at 3.2%.
Game-specific norms influence toxicity beyond genre.

Article Content

From source RSS / original summary

arXiv:2605. 24000v1 Announce Type: new Abstract: Toxicity in online gaming communities remains a persistent challenge, manifesting across genres, platforms, and player interactions. While much research is focused on in-game toxicity, less is known about how toxic behavior varies between gaming communities on streaming platforms. To address this shortcoming, we analyze approximately 20 million chat messages from 4,452 streams, spanning seven game genres on Twitch.

We categorize messages according to Twitch's toxicity taxonomy with a pre-trained Large Language Model using zero-shot classification. The taxonomy comprises four categories and eight subclasses, including harassment, discrimination, sexual content, and profanity. Our approach achieves an F1 score of 94. 5% on the TextDetox dataset and demonstrates human-model agreement comparable to inter-human agreement. Our analysis reveals that 2.

4% of all messages are classified as toxic, with notable differences across genres: streams of MOBA games exhibit the highest relative rate of toxicity (3. 2%), and sports games show the lowest rate (2%). Furthermore, results indicate that individual games differ significantly in their toxicity distributions, even within genres, suggesting the existence of game-specific community norms and mechanics that shape toxic behavior beyond genre-level effects.

These findings offer empirical insights into genre- and game-specific toxicity patterns on Twitch and can inform more targeted moderation strategies for gaming communities.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

Toxicity in Twitch Chats: An LLM-Based Analysis Across Gaming Communities

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

In-Context Optimization for Retrieval-Augmented Generation: A Gradient-Descent Perspective

Extracting Training Data from Diffusion Language Models via Infilling

Related in this space

Alignment Tuning for Large Language Models: A Data-Centric Lens on Alignment Data Pipelines

How Far Will They Go? Red-Teaming Online Influence with Large Language Models