Universal Quantum Transformer

arXiv cs.AI·Sungyong Chung, Alireza Talebpour

6/2/2026

·~2 min·6/2/2026·en·139

Quick Answer

This paper shows that The Universal Quantum Transformer (UQT) introduces a quantum-native architecture that achieves exact mathematical reasoning using multi-qubit systems, outperforming classical networks by eliminating stochastic instability and over-parameterization.

Quick Take

The Universal Quantum Transformer (UQT) introduces a quantum-native architecture that achieves exact mathematical reasoning using multi-qubit systems, outperforming classical networks by eliminating stochastic instability and over-parameterization. Demonstrated on a 5-qubit substrate, it successfully learns cyclic modular arithmetic and non-Abelian algebra, showcasing superior computational efficiency on IBM Quantum hardware.

Key Points

UQT uses parameterized geometric phase embedding for exact mathematical reasoning.
Achieves deterministic generalization, termed crystallization, beyond classical grokking.
Successfully learns cyclic modular arithmetic and non-Abelian algebra on 5 qubits.
Offers significant computational and memory advantages over classical self-attention.
Proven viable on current IBM Quantum computers, demonstrating practical applicability.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2606. 00045v1 Announce Type: new Abstract: Classical continuous-space neural networks fundamentally struggle to lock into exact mathematical symmetries, such as modular arithmetic and non-commutative algebra. To approximate these discrete logical rules, they often rely on massive parameter scaling, resulting in stochastic instability even after delayed generalization phenomena known as grokking.

Here, we introduce the Universal Quantum Transformer (UQT), a fundamentally novel, quantum-native computing architecture that uses the physical properties of multi-qubit systems as a universal inductive bias for exact mathematical and algebraic reasoning. Rather than translating classical neural mechanisms, our framework relies entirely on parameterized geometric phase embedding and $SU(2)$ wave-interference.

We demonstrate that the quantum attention circuit, operating on a highly compact 5-qubit substrate, perfectly learns two highly distinct formal classes: cyclic modular arithmetic ($\mathbb{Z}_{11}$) and non-Abelian algebra (the $S_4$ permutation group). While classical attention-based networks exhibit stochastic instability at convergence, the UQT achieves mathematically exact, deterministic generalization. We refer to this phenomenon as crystallization: a step beyond the well-known phenomenon of grokking.

Crucially, this framework yields massive computational and memory advantages by theoretically bypassing the quadratic bottleneck of classical self-attention, and by logarithmically compressing the required representation dimension to eliminate the massive over-parameterization inherent to classical networks. Finally, we deploy this architecture on noisy intermediate-scale quantum (NISQ) hardware, proving its viability on current IBM Quantum computers.

These results establish parameterized quantum topology as a universally superior physical substrate for exact artificial intelligence.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·David Krongauz, Arad Zulti, Eran Segal, Teddy Lazebnik

1d ago

FeaturedOriginal

Automatic Ordinary Differential Equations Discovery For Biological Systems Using Large Language Model Powered Agentic System

AI Summary

The MEDA system utilizes large language models and symbolic regression to autonomously discover ordinary differential equations for biological systems, achieving strong structural recovery and biologically plausible models. It outperforms existing methods by integrating domain knowledge and mechanistic constraints, demonstrating effective retrieval and extrapolation capabilities.

#LLM #Agent #Inference #AI Startup