MedDiffuseMix: Preserving Diagnostic Evidence with Saliency-Aware Diffusion Medical Image Data Augmentatio

arXiv cs.CV·Teerath Kumar, Raja Vavekanand, Muhammad Turab

1d ago

·~2 min·6/30/2026·en·0

Quick Answer

MedDiffuseMix introduces a saliency-aware diffusion mixing framework for medical image augmentation, enhancing classification accuracy across four benchmarks.

Quick Take

MedDiffuseMix introduces a saliency-aware diffusion mixing framework for medical image augmentation, enhancing classification accuracy across four benchmarks. It outperforms standard methods, improving F1-scores and ROC AUC metrics by preserving diagnostically salient regions while minimizing semantic distortion.

Key Points

MedDiffuseMix uses saliency maps to enhance medical image augmentation.
Evaluated on four datasets: pneumonia chest radiography, musculoskeletal radiographs, PatchCamelyon, and breast cancer histopathology.
Significantly improves accuracy and F1-score compared to Mixup and other baselines.
Adaptive mixing and saliency preservation reduce semantic distortion in augmented images.
Visual analysis shows better retention of diagnostically important features.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2606. 28419v1 Announce Type: new Abstract: Limited data availability, class imbalance, and domain variability remain major barriers to reliable medical image classification. Conventional augmentation can improve training diversity but may distort diagnostically informative structures, whereas unconstrained generative augmentation may introduce label-inconsistent content. This paper proposes MedDiffuseMix, a saliency-guided diffusion mixing framework for controlled medical image augmentation.

The method uses classifier-derived saliency maps to separate high-saliency diagnostic regions from low-saliency background areas and applies diffusion-guided mixing mainly to regions with lower diagnostic importance. Adaptive mixing, Gaussian boundary blending, and a saliency-preservation constraint reduce semantic distortion and reject or attenuate samples that shift model attention away from clinically relevant evidence.

The framework is evaluated on four public benchmarks: the Radiological Society of North America pneumonia chest radiography dataset, Musculoskeletal Radiographs, PatchCamelyon, and the Breast Cancer Histopathological Image Classification dataset.

Experiments with convolutional and transformer-based classifiers show that MedDiffuseMix improves accuracy, F1-score, and area under the receiver operating characteristic curve compared with standard augmentation, Mixup, GenMix, SaliencyMix, and diffusion-based augmentation baselines. Ablation studies confirm the importance of saliency guidance, adaptive region mixing, and smooth boundary blending. Visual attribution analysis further indicates that MedDiffuseMix better preserves diagnostically salient regions.

These results suggest that saliency-guided diffusion mixing is an effective augmentation strategy for limited-data medical image classification.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Shahrzad Esmat, Chaunte W. Lacewell, Sameh Gobriel, Nilesh Jain, Ali Jannesari

3w ago

FeaturedOriginal

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

AI Summary

A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.

#LLM #Agent #Inference #AI Startup