Personalization as Inverse Planning: Learning Latent Design Intents for Agentic Slide Generation via Structural Denoising

arXiv cs.AI·Tianci Liu, Zihan Dong, Linjun Zhang, Haoyu Wang, jing Gao, Emre Kiciman, Ranveer Chandra, Wei-Ting Chen

3h ago

·~1 min·7/2/2026·en·0

Quick Answer

This study introduces SPIRE, a framework for Page-level Slide Personalization (PSP) that formulates design intent learning as an inverse planning problem.

Quick Take

This study introduces SPIRE, a framework for Page-level Slide Personalization (PSP) that formulates design intent learning as an inverse planning problem. By employing structural denoising and reinforcement learning, SPIRE effectively refines slide designs without relying on specific tools, demonstrating superior performance in experiments.

Key Points

SPIRE addresses the challenge of fine-grained slide design personalization.
It formulates Page-level Slide Personalization as an inverse planning problem.
The framework uses structural denoising to refine slide designs collaboratively.
Reinforcement learning is employed to optimize the design process.
Extensive experiments validate SPIRE's superior performance over existing methods.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2607. 00407v1 Announce Type: new Abstract: Slide design requires personalizing both deck themes and page layouts. Yet, current AI agent-based methods struggle with fine-grained, page-level design. Solely relying on prespecified templates or user verbose instructions, they fail to capture latent design intents, leaving Page-level Slide Personalization (PSP) unresolved. To close this gap, this work formulates PSP as an inverse planning problem.

We propose to learn a design intent without assuming any knowledge of the specific executing tools (e. g. , PowerPoint, Beamer) being used. However, relinquishing control over these tools makes the problem intractable to optimize end-to-end. To overcome this, we propose SPIRE, a principled framework to solve PSP approximately.

By intentionally corrupting the visual structures of clean slides, SPIRE creates a verifiable task to denoise the corruption, whereby two agents learn to collaboratively refine executable designs via reinforcement learning (RL). We present a proof that structural denoising is a consistent surrogate for PSP, and that the formulation strictly reduces policy gradient variance in RL. Extensive experiments demonstrate the superiority of SPIRE.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·Binghai Wang, Chenlong Zhang, Dayiheng Liu, Jiajun Zhang, Jiawei Chen, Mouxiang Chen, Rongyao Fang, Siyuan Zhang, Xuwu Wang, Yuheng Jing, Zeyao Ma, Zeyu Cui

6d ago

FeaturedOriginal

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

AI Summary

As coding agents evolve, verifying solutions becomes more challenging than generating them, necessitating a focus on scalable, faithful, and robust verification methods. The study reveals that no fixed reward function can sustain effectiveness as model capabilities advance, emphasizing the need for verification to evolve alongside solution generation.

#Agent #AI Coding #Inference #Policy