DefocusTrackerAI -- A Generalized Framework for the Automatic Detection of Defocused Particle Images

arXiv cs.CV·Gon\c{c}alo Coutinho, Ana S. Moita, Ant\'onio L. N. Moreira, Massimiliano Rossi

3h ago

·~2 min·6/2/2026·en·0

Quick Take

DefocusTrackerAI is a deep-learning framework utilizing YOLOv9 for automatic detection of defocused particle images, outperforming Faster R-CNN with higher recall and lower uncertainty. The model achieves spatial resolution uncertainties between 0.1 and 0.4 pixels and is applicable across various optical setups and lighting conditions, demonstrating versatility in real DPT experiments.

Key Points

YOLOv9 outperforms Faster R-CNN in detecting defocused particles with higher recall.
Achieves spatial resolution uncertainties of 0.1 to 0.4 pixels at high particle densities.
Successfully applied in real DPT experiments, including fluorescence and shadowgraph data.
Available as a pre-trained model for high-accuracy defocused particle detection.
Can be integrated with calibration for effective 3D defocusing particle tracking.

Article Content

From source RSS / original summary

arXiv:2606. 00076v1 Announce Type: new Abstract: The present work introduces DefocusTrackerAI, a generalized deep-learning framework for the automatic detection and position estimation of defocused particle images from any kind of optical configuration without compromising uncertainty and recall, intended as a follow-up of the open-source project DefocusTracker.

We selected the deep neural network architecture from the direct comparison of two well-known object detection models, Faster R-CNN and YOLOv9, trained on a diverse and feature-rich synthetic image set containing astigmatic and non-astigmatic defocused particle images of varying diameters.

The model evaluation on synthetic data showed that, first, YOLOv9 outperforms Faster R-CNN, achieving higher recall and lower uncertainty, particularly at high particle image densities; and second, that YOLOv9 provides enhanced spatial resolution, with uncertainty values between 0. 1 and 0. 4 pixels for particle image densities N_s up to 0. 5, outperforming state-of-the-art algorithms.

We demonstrated that our models are able to detect astigmatic and non-astigmatic defocused particle images in multiple optical setups with varying lighting conditions. In addition, we successfully applied our models on real DPT experiments, including fluorescence and shadowgraph data, showing that they can be used beyond conventional DPT applications, including the tracking of sprays and droplets. A pre-trained, ready-to-use version of DefocusTrackerAI based on YOLOv9 is available at https://gitlab. com/goncalo.

coutinho/defocustrackerAI-main/-/tree/7e0f11f649ebad50e20dca5b9545f26ca303ebe0 and can be used for automatic detection of defocused particle images of any kind with high accuracy. In combination with a suitable calibration approach for the depth position, it can be used as an effective first step for three-dimensional defocusing particle tracking.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Taha Koleilat, Hassan Rivaz, Yiming Xiao

6d ago

FeaturedOriginal

Evi-Steer: Learning to Steer Biomedical Vision-Language Models through Efficient and Generalizable Evidential Tuning

AI Summary

Evi-Steer introduces a novel evidential tuning framework for BiomedCLIP, enabling efficient fine-tuning with only 0.11% parameter updates. It significantly enhances performance in few-shot learning and domain shifts across 15 biomedical imaging datasets, demonstrating robustness for clinical applications.

#AI Coding #Inference #Open Source