Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

arXiv cs.CL·Aya Vera-Jimenez, Samuel Jaeger, Calvin Ibenye, Dhrubajyoti Ghosh

3h ago

·~2 min·6/4/2026·en·0

Quick Take

This study demonstrates that a random forest classifier can effectively detect AI-generated fake news across different prompts, achieving AUC values between 0.988 and 1.000. By analyzing interpretable linguistic features such as lexical diversity and readability, the model shows robust performance despite variations in prompting strategies, indicating stable properties of AI-generated text.

Key Points

Classifier trained on one prompt tested successfully on others with high AUC scores.
AI-generated texts show higher lexical diversity and lower emotional intensity.
Performance remains strong despite variations in prompting strategies.
Study utilizes three datasets combining AI-generated and real news articles.
Feature-based approaches can enhance detection of AI-generated fake news.

Article Content

From source RSS / original summary

arXiv:2606. 04199v1 Announce Type: new Abstract: The increasing use of large language models has raised concerns about the spread of AI-generated fake news, particularly under varying prompting strategies. Most existing detection models are trained and evaluated under a single generation setting, leaving their ability to generalize across unseen prompts unclear.

In this study, we investigate cross-prompt generalization in fake news detection using three datasets of AI-generated articles produced under distinct prompts, combined with real news articles. We extract interpretable linguistic features capturing lexical diversity, readability, and emotion-based characteristics and evaluate a random forest classifier under a cross-prompt framework, where models trained on one prompt are tested on another.

Across all six train-test combinations, performance remains consistently high, with AUC values ranging from 0. 988 to 1. 000. Analysis of feature distributions shows that AI-generated text exhibits increased lexical diversity, reduced readability, and substantially lower emotional intensity compared to the overall dataset, with variations across prompts.

Despite these distributional shifts, the classifier maintains strong performance, indicating that these features capture stable properties of AI-generated text that generalize across prompting strategies. These findings suggest that feature-based approaches can provide robust detection of AI-generated fake news under prompt variability.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Leyao Wang, Yanan He, Peng Chen, Asaf Yehudai, Yixin Liu, Rex Ying, Michal Shmueli-Scheuer, Arman Cohan

2w ago

FeaturedOriginal

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

AI Summary

The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.

#LLM #Agent #Inference #Policy

Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Discourse-Role Labels as Presentation-Time Variables for Context Use in Language Models

SENSE: Semantic Embedding Navigation with Soft-gated Evaluation for Retrieval-based Speculative Decoding

Related in this space

Deploy Self-Evolving Agents for Faster, More Secure Research with a Hermes Agent and NVIDIA NemoClaw

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems