Personalization as Inverse Planning: Learning Latent Design Intents for Agentic Slide Generation via Structural Denoising
Quick Answer
This study introduces SPIRE, a framework for Page-level Slide Personalization (PSP) that formulates design intent learning as an inverse planning problem.
Quick Take
This study introduces SPIRE, a framework for Page-level Slide Personalization (PSP) that formulates design intent learning as an inverse planning problem. By employing structural denoising and reinforcement learning, SPIRE effectively refines slide designs without relying on specific tools, demonstrating superior performance in experiments.
Key Points
- SPIRE addresses the challenge of fine-grained slide design personalization.
- It formulates Page-level Slide Personalization as an inverse planning problem.
- The framework uses structural denoising to refine slide designs collaboratively.
- Reinforcement learning is employed to optimize the design process.
- Extensive experiments validate SPIRE's superior performance over existing methods.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2607. 00407v1 Announce Type: new Abstract: Slide design requires personalizing both deck themes and page layouts. Yet, current AI agent-based methods struggle with fine-grained, page-level design. Solely relying on prespecified templates or user verbose instructions, they fail to capture latent design intents, leaving Page-level Slide Personalization (PSP) unresolved. To close this gap, this work formulates PSP as an inverse planning problem.
We propose to learn a design intent without assuming any knowledge of the specific executing tools (e. g. , PowerPoint, Beamer) being used. However, relinquishing control over these tools makes the problem intractable to optimize end-to-end. To overcome this, we propose SPIRE, a principled framework to solve PSP approximately.
By intentionally corrupting the visual structures of clean slides, SPIRE creates a verifiable task to denoise the corruption, whereby two agents learn to collaboratively refine executable designs via reinforcement learning (RL). We present a proof that structural denoising is a consistent surrogate for PSP, and that the formulation strictly reduces policy gradient variance in RL. Extensive experiments demonstrate the superiority of SPIRE.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →The Verification Horizon: No Silver Bullet for Coding Agent Rewards
As coding agents evolve, verifying solutions becomes more challenging than generating them, necessitating a focus on scalable, faithful, and robust verification methods. The study reveals that no fixed reward function can sustain effectiveness as model capabilities advance, emphasizing the need for verification to evolve alongside solution generation.