From Local Training to Large-Scale Mapping: A Comparative Assessment of Machine Learning and Deep Learning for Transferable Satellite-Derived Bathymetry
Quick Take
This study evaluates machine learning and deep learning for satellite-derived bathymetry (SDB) using Sentinel-2 imagery, achieving RMSE as low as 0.19 m on the MagicBathyNet benchmark. The use of a Smooth Weight Function (SWF) and spatial continuity in training significantly enhances performance, particularly in complex coastal environments. Optimized architectures and pretrained weights are released for scalable application across new sites.
Key Points
- Intra-regional RMSE ranges from 1.15 to 1.92 m, with deep models showing robustness.
- Proposed networks outperform U-Net with 0.19-0.22 m RMSE on the MagicBathyNet benchmark.
- Training on multi-temporal imagery reduces noise from varying environmental conditions.
- Smooth Weight Function (SWF) emphasizes near-surface depths for improved accuracy.
- Optimized architectures and pretrained weights facilitate scalable transfer to new sites.
Article Content
From source RSS / original summaryarXiv:2606. 02764v1 Announce Type: new Abstract: Satellite-derived bathymetry (SDB) from multispectral imagery is cost-effective but scales poorly across regions, especially in optically complex coastal environments. We evaluate machine learning and deep learning for transferable SDB over the 0-20 m depth range using Sentinel-2 imagery.
A Random Forest baseline and four CNNs (ResNet-50, ResNet-101, EfficientNet-B4, ConvNeXt-Large) are trained on Pratas Island and selected Great Barrier Reef regions, then evaluated on spatially independent intra- and cross-regional test areas. Preserving spatial continuity during training, by keeping contiguous reef blocks rather than random patches, is the single most impactful design choice; we further introduce a Smooth Weight Function (SWF)-weighted RMSE loss that emphasizes near-surface depths.
With these choices, intra-regional RMSE ranges from 1. 15 to 1. 92 m over 0-20 m and is as low as 0. 26 m for depths 2. 99-3. 78 m), while the deep models stay more robust (2. 46-2. 98 m). On the public MagicBathyNet aerial-RGB benchmark (0-16 m) the proposed networks reach 0. 19-0. 22 m RMSE, outperforming a U-Net baseline and a task-specific transformer architecture with substantially fewer parameters.
We further exploit multi-temporal repeat imagery: training on it broadens diversity, and median-aggregating predictions across passes at inference reduces noise from changing sun angles, atmospheric conditions, water properties, and tides. We release optimized architectures and pretrained weights to enable scalable transfer to new sites.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CV
See more →Plan2Map: A Multimodal Benchmark for Document-Grounded Geospatial Boundary Reconstruction from Planning Records
Plan2Map introduces a 208-case benchmark for reconstructing geospatial boundaries from UK planning documents. The GeoPlanAgent system achieves a mean IoU of 0.736, significantly outperforming baseline models, highlighting the challenges in localization and map registration.