Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling

arXiv cs.CV·Xingyu Zheng, Xianglong Liu, Yifu Ding, Weilun Feng, Junqing Lin, Jinyang Guo, Haotong Qin

3h ago

·~2 min·7/3/2026·en·0

Quick Answer

MrFlow introduces a training-free multi-resolution acceleration strategy for flow-matching models, achieving 10x inference speedup while maintaining quality within 1% of prior performance.

Quick Take

MrFlow introduces a training-free multi-resolution acceleration strategy for flow-matching models, achieving 10x inference speedup while maintaining quality within 1% of prior performance. This method utilizes a staged pipeline that combines low-resolution structure generation with high-resolution refinement, outperforming existing training-free strategies and allowing for further acceleration when combined with timestep distillation techniques.

Key Points

Achieves 10x end-to-end acceleration without any training or runtime identification.
Maintains OneIG performance within a 1% gap compared to pre-acceleration.
Combines low-resolution sampling with high-frequency resampling for detail refinement.
Can be orthogonally combined with timestep distillation for up to 25x acceleration.
Utilizes a lightweight GAN-based model for super-resolution in pixel space.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2607. 01642v1 Announce Type: new Abstract: Hardware-agnostic strategies for accelerating text-to-image diffusion, such as timestep distillation and feature caching, can reduce inference time without custom kernels or system-level optimization. Among them, multi-resolution generation strategies have recently received broad attention, attaining more than 5x speedup without any training.

However, the design of performing upsampling in the latent space, together with the selective modification of partial regions, causes these methods to exhibit noticeable blurring or artifacts. To this end, we propose MrFlow, a training-free multi-resolution acceleration strategy for pretrained flow-matching models built upon a staged low-to-high-resolution pipeline.

MrFlow first rapidly generates the main structure at low resolution, then performs super-resolution in the pixel space using a lightweight pretrained GAN-based model, subsequently injects low-strength noise to enable high-frequency resampling, and finally refines the details at high resolution. Quantitative and qualitative results on FLUX.

1-dev and Qwen-Image show that MrFlow exploits the quadratic token reduction and reduced step requirement of low-resolution sampling to achieve 10x end-to-end acceleration while keeping OneIG within a 1% gap relative to that before acceleration, significantly surpassing other training-free acceleration strategies, and requiring no training or runtime dynamic identification whatsoever.

MrFlow can further be directly combined orthogonally with pre-trained timestep distillation strategies, achieving even higher generation acceleration of up to 25x.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Shahrzad Esmat, Chaunte W. Lacewell, Sameh Gobriel, Nilesh Jain, Ali Jannesari

4w ago

FeaturedOriginal

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

AI Summary

A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.

#LLM #Agent #Inference #AI Startup