Does AI Reviewer See the Full Picture? Attacking and Defending… | AI Deep Signal

Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review

arXiv cs.CL·Xinyu Zhao, Rana Muhammad Shahroz Khan, Zhen Xu, Zhen Tan, Tianlong Chen

6/12/2026

·~2 min·6/12/2026·en·3

Quick Answer

This paper shows that The integration of Large Language Models (LLMs) into peer review exposes vulnerabilities to targeted attacks, prompting the introduction of PaperGuard, a benchmark designed to evaluate and defend against these multimodal adversarial manipulations.

Quick Take

The framework includes a multimodal dataset, a suite of targeted attacks, and a defense mechanism using chunk-based embedding search, revealing that AI reviewers are significantly susceptible to manipulation.

Key Points

Current AI peer-review studies focus primarily on text, neglecting multimodal vulnerabilities.
PaperGuard features a comprehensive dataset across various scientific domains.
The framework includes black-box and white-box attack methodologies targeting both text and figures.
Experiments confirm that AI reviewers are widely vulnerable to domain-specific attacks.
PaperGuard establishes essential protocols for resilient AI-assisted scholarly reviewing.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 12716v1 Announce Type: new Abstract: The integration of (LLMs) and Multimodal LLMs (MLLMs) into scientific peer-review workflows introduces novel and significant risks for adversarial manipulation, especially given the multimodal nature of scientific papers where figures, not just text, convey core evidence. This creates a significant gap: current robustness studies on AI peer-review are overwhelmingly text-only. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Isabel Xu (The Overlake School), Cynthia Xu (The Overlake School), Rachel Ren (Edwards Vacuum Inc.), Cong Guo (The University of Memphis), Jiacheng Ding (The University of Memphis)

5d ago

FeaturedOriginal

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

AI Summary

TriAgent introduces a cost-efficient multi-agent system for financial sentiment analysis, combining VADER, FinBERT, and Qwen2.5. It achieves an F1 score of ~0.87 with significant savings of $9.3M/year at a 10M-user scale compared to GPT-4o-mini, while also detecting hallucinations with an AUC of 0.90.

#LLM #Agent #AI Startup #Enterprise AI

Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Multi-Agent Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis