GiLT: Augmenting Transformer Language Models with Dependency Graphs

arXiv cs.CL·Tianyu Huang, Yida Zhao, Chuyan Zhou, Kewei Tu

4d ago

·~2 min·5/18/2026·en·3

Quick Take

GiLT enhances Transformer models by integrating dependency graphs for improved syntactic generalization.

Key Points

Utilizes dependency graphs without adding structural tokens.
Modulates attention weights for language modeling.
Achieves better performance on downstream tasks.

📖 Reader Mode

~2 min read

[Submitted on 15 May 2026]

View PDF HTML (experimental)

Abstract:Augmenting Transformers with linguistic structures effectively enhances the syntactic generalization performance of language models. Previous work in this direction focuses on syntactic tree structures of languages, in particular constituency tree structures. We propose Graph-Infused Layers Transformer Language Model (GiLT) which leverages dependency graphs for augmenting Transformer language models. Unlike most previous work, GiLT does not insert extra structural tokens in language modeling; instead, it injects structural information into language modeling by modulating attention weights in the Transformer with features extracted from the dependency graph that is incrementally constructed along with token prediction. In our experiments, GiLT with semantic dependency graphs achieves better syntactic generalization while maintaining competitive perplexity in comparison with Transformer language model baselines. In addition, GiLT can be finetuned from a pretrained language model to achieve improved downstream task performance. Our code is released at this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2605.15562 [cs.CL]
	(or arXiv:2605.15562v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2605.15562 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Tianyu Huang [view email]
[v1] Fri, 15 May 2026 03:08:49 UTC (236 KB)

— Originally published at arxiv.org

Continue reading on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

GiLT: Augmenting Transformer Language Models with Dependency Graphs

Quick Take

Key Points

📖 Reader Mode

Submission history

Want this in your inbox every morning?

More from arXiv cs.CL

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

Comparing LLM and Fine-Tuned Model Performance on NVDRS Circumstance Extraction with Varying Prompt Complexity

Related in this space

From Prompts to Protocols: An AI Agent for Laboratory Automation

Agentic Trading: When LLM Agents Meet Financial Markets