Code-Switching Reveals Language Anchoring in Multilingual LLMs | AI Deep Signal

Code-Switching Reveals Language Anchoring in Multilingual LLMs

arXiv cs.CL·Jeonghyun Park, Seunghyun Yoon, Yonghyun Jun, Hwanhee Lee

6/19/2026

·~2 min·6/19/2026·en·2

Quick Answer

This paper shows that Multilingual Large Language Models (MLLMs) struggle with Code-Switched (CS) inputs, showing performance degradation due to Anchor Bias.

Quick Take

The proposed CANVAS intervention improves Question Answering (QA) F1 scores across various MLLMs by aligning target-language hidden states with source anchors during inference.

Key Points

Anchor Bias quantifies language anchoring in MLLMs, revealing a grammar-frame effect.
Source-framed CS maintains source anchoring, while target-framed CS shows greater QA degradation.
CANVAS intervention effectively recovers QA F1 scores across diverse MLLMs and CS conditions.
Internal anchoring signals can mitigate CS inference failures in multilingual models.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

Multilingual (MLLMs) are increasingly expected to handle Code-Switched (CS) inputs, yet mixing languages frequently degrades performance relative to source- or target-language monolingual counterparts. To understand this degradation, we use grammar-forced CS as a controlled diagnostic setting for locating CS representations relative to their source and target counterparts. We introduce Anchor Bias, a geometric measure that quantifies language anchoring, whether a CS hidden

Read the full article on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Isabel Xu (The Overlake School), Cynthia Xu (The Overlake School), Rachel Ren (Edwards Vacuum Inc.), Cong Guo (The University of Memphis), Jiacheng Ding (The University of Memphis)

1w ago

FeaturedOriginal

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

AI Summary

TriAgent introduces a cost-efficient multi-agent system for financial sentiment analysis, combining VADER, FinBERT, and Qwen2.5. It achieves an F1 score of ~0.87 with significant savings of $9.3M/year at a 10M-user scale compared to GPT-4o-mini, while also detecting hallucinations with an AUC of 0.90.

#LLM #Agent #AI Startup #Enterprise AI

Code-Switching Reveals Language Anchoring in Multilingual LLMs

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Multi-Agent Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis