Pyramid Self-contrastive Learning Framework for Test-time Ultrasound Image Denoising

arXiv cs.CV·Jiajing Zhang, Bingze Dai, Xi Zhang, Yue Xu, Wei-Ning Lee

3d ago

·~2 min·5/14/2026·en·1

Quick Take

The A2A framework enhances ultrasound image denoising at test time using self-contrastive learning.

Key Points

Eliminates domain shift and pretraining costs.
Achieves significant SNR and CNR improvements.
Facilitates reliable anatomical visualization in ultrasound.

📖 Reader Mode

~2 min read

[Submitted on 12 May 2026]

View PDF HTML (experimental)

Abstract:The inherent electronic and speckle noise complicates clinical interpretation of ultrasound images. Conventional denoising methods rely on explicit noise assumptions whose validity diminishes under composite noise conditions. Learning-based methods require massive labeled data and model parameters. These pre-defined and pre-trained manners entail an inevitable domain shift in complex in vivo environments, so they are limited to a specific noise type and often blur structural details. In this study, we propose a pure test-time training framework for one-shot ultrasound image denoising and apply it to synthetic aperture ultrasound (SAU), which synthesizes transmit focus from sub-aperture transmissions. Our Aperture-to-Aperture (A2A) framework disentangles anatomical similarity and noise randomness from shuffled sub-apertures through self-contrastive learning in pyramid latent spaces. The clean image is then decoded from the anatomy space, while discarding the noise space. A2A is trained at test time on one noisy sample of SAU signals, so it fundamentally eliminates the domain shift and pretraining costs. Simulation experiments, including electronic noise levels of 0 to 30 dB and different inclusion geometries, demonstrated an improvement of 69.3% SNR and 34.4% CNR by A2A. The in vivo results showed 84.8% SNR and 25.7% CNR gains using only two aperture data of the heart in six echocardiographic views, liver, and kidney. A2A delivers clear images/signals across diverse imaging targets and configurations, paving the way for more reliable anatomical visualization and functional assessment by ultrasound.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2605.12567 [cs.CV]
	(or arXiv:2605.12567v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2605.12567 arXiv-issued DOI via DataCite

Submission history

From: Jiajing Zhang [view email]
[v1] Tue, 12 May 2026 09:27:21 UTC (28,679 KB)

— Originally published at arxiv.org

Continue reading on arxiv.org

Pyramid Self-contrastive Learning Framework for Test-time Ultrasound Image Denoising

Quick Take

Key Points

📖 Reader Mode

Submission history

More from arXiv cs.CV

CoReDiT: Spatial Coherence-Guided Token Pruning and Reconstruction for Efficient Diffusion Transformers

ProtoMedAgent: Multimodal Clinical Interpretability via Privacy-Aware Agentic Workflows

Diagnosing and Correcting Concept Omission in Multimodal Diffusion Transformers

Related in this space

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

Generative Floor Plan Design with LLMs via Reinforcement Learning with Verifiable Rewards

Distribution-Aware Algorithm Design with LLM Agents