AI for Maritime Security | AI Deep Signal

AI for Maritime Security: Comparative Evaluation of CNN and Vision Transformer Architectures for Maritime Object Detection

arXiv cs.CV·Ismet Gocer, Zakirul Bhuiayn, Shakeel Ahmad, Raza Hasan

6/16/2026

·~1 min·6/16/2026·en·1

Quick Answer

This study evaluates six deep learning models for maritime object detection, including CNNs and Vision Transformers.

Quick Take

The Vision Transformer achieved 100% accuracy with the fastest processing time, highlighting its potential for maritime security applications.

Key Points

Used a dataset of 6,468 maritime images under various weather conditions.
Evaluated six architectures: CNN, Xception, VGG16, MobileNetV2, EfficientNetV2L, and Vision Transformer.
Vision Transformer outperformed others with 100% accuracy and lowest error rates.
Performance varies based on computational constraints and deployment conditions.
Lightweight models are suitable for resource-limited devices.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 14720v1 Announce Type: new Abstract: This study aims to enhance maritime security by using advanced Artificial Intelligence (AI) and Computer Vision (CV) techniques. For this purpose, it was designed and assessed intelligent object detection systems that can detect the presence of ships on the sea surface under different real-time environments.

To achieve this goal, a maritime image dataset with 6,468 images was used, covering different weather conditions like cloudy, foggy, rainy, and sunny environments. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Aavash Chhetri, Bibek Niroula, Eduard Vazquez, Yash Raj Shrestha, Prashnna Gyawali, Loris Bazzani, Binod Bhattarai

3w ago

FeaturedOriginal

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

AI Summary

ProMoE-FL introduces a Prototype-conditioned Mixture-of-Experts framework for multimodal federated learning, effectively addressing missing modalities. It outperforms existing methods on four chest X-ray datasets, demonstrating superior feature synthesis capabilities in both homogeneous and heterogeneous settings.

#LLM #AI Coding #AI Startup #Enterprise AI

AI for Maritime Security: Comparative Evaluation of CNN and Vision Transformer Architectures for Maritime Object Detection

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CV

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Eddeep: a deep-learning framework for fast eddy-current distortion correction in diffusion MRI

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CV

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Eddeep: a deep-learning framework for fast eddy-current distortion correction in diffusion MRI

-Guided ANN Index Optimization for Human-Object Interaction Retrieval