Any2Poster: Any-Source Poster Generation Across Modalities and Domains
Quick Take
Any2Poster introduces a benchmark for automatic poster generation, evaluating systems across eight input modalities and five content domains. The Any2Poster Agent achieves an average accuracy of 87.25% and significantly outperforms previous models, enhancing factual retention and visual quality assessments.
Key Points
- Any2Poster Bench evaluates poster generation across PDFs, URLs, PPTX, DOCX, Markdown, LaTeX, notebooks, and videos.
- The benchmark assesses visual quality, layout, readability, and logical flow using VLM-based judgments.
- Any2Poster Agent achieves 87.25% accuracy on the benchmark, improving over previous models.
- The model enhances factual retention from 51.06% to 72.58% in PaperQuiz-style evaluations.
- Any2Poster provides a reusable resource for multimodal, domain-general poster generation research.
Article Content
From source RSS / original summaryarXiv:2606. 02915v1 Announce Type: new Abstract: Visual posters are a compact medium for communicating dense information, yet progress on automatic poster generation remains difficult to measure because existing evaluations are often restricted to paper-only inputs, narrow domains, or surface-level visual similarity.
We introduce Any2Poster Bench, a benchmark for any-source poster generation that evaluates systems across eight input modalities--PDFs, URLs, PPTX, DOCX, Markdown, LaTeX, notebooks, and videos--and five content domains.
Any2Poster Bench pairs each source with quiz-based probes of verbatim factual retention and interpretive understanding, together with VLM-based judgments of visual quality, layout, readability, content completeness, and logical flow, enabling reproducible assessment of both information fidelity and visual communication.
To instantiate and validate this benchmark, we further present Any2Poster Agent, an end-to-end reference agent that parses heterogeneous sources, organizes salient content, plans poster layouts, renders posters, and iteratively refines them using visual feedback. On Any2Poster Bench, Any2Poster Agent achieves 87. 25% average accuracy across input modalities and 87. 28% across content domains.
On PaperQuiz-style evaluation, where prior paper-to-poster agents are directly comparable, Any2Poster Agent improves over PosterAgent-4o from 51. 06-51. 33% to 72. 58% overall accuracy and from 116-121 to 145. 16 in density-augmented score. Together, Any2Poster Bench and Any2Poster Agent provide a reusable evaluation resource and a competitive baseline for studying multimodal, domain-general poster generation.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CV
See more →Plan2Map: A Multimodal Benchmark for Document-Grounded Geospatial Boundary Reconstruction from Planning Records
Plan2Map introduces a 208-case benchmark for reconstructing geospatial boundaries from UK planning documents. The GeoPlanAgent system achieves a mean IoU of 0.736, significantly outperforming baseline models, highlighting the challenges in localization and map registration.