Biomazon: A Multimodal Dataset for 3D Forest Structure and Biomass Modeling in the Amazon Basin
Quick Answer
Biomazon introduces a 20 m multimodal dataset for predicting 3D forest structure and biomass in the Amazon Basin, integrating GEDI RH profiles and AGBD with multi-sensor data.
Quick Take
Biomazon introduces a 20 m multimodal dataset for predicting 3D forest structure and biomass in the Amazon Basin, integrating GEDI RH profiles and AGBD with multi-sensor data. This benchmark facilitates machine learning evaluations of forest vertical structure and biomass modeling, establishing a reference for future research.
Key Points
- Biomazon dataset pairs GEDI RH profiles with AGBD using multi-sensor predictors.
- Standardized spatial splits and evaluation protocols enhance machine learning benchmarking.
- Comprehensive ablation study evaluates model scale, modality contributions, and embeddings.
- Baseline performance compared with existing products like GEDI L4D RH10-RH98.
- Establishes a reference for structurally consistent RH-profile prediction in tropical forests.
Article Content
From source RSS / original summaryarXiv:2606. 05368v1 Announce Type: new Abstract: Accurate, spatially explicit characterization of tropical forest structure is essential for carbon accounting and ecosystem monitoring, yet most ML pipelines predict canopy-top height proxies (e. g. , RH95/RH98) or AGBD as separate scalar targets, rather than learning the forest vertical structure as an ordered profile.
The community lacks a ML-ready multimodal benchmark for predicting the entire GEDI RH profile jointly with AGBD, or for evaluating methods that enforce physically consistent ordering across RH percentiles. We address this with Biomazon, a 20 m multimodal benchmark dataset over the Amazon Basin that pairs GEDI RH and AGBD targets with multi-sensor predictors (Sentinel-1/2, ALOS-2 PALSAR-2, Copernicus DEM, Dynamic World LULC, and AlphaEarth embeddings) under standardized spatial splits and evaluation protocols.
Using a shared encoder-decoder with task-specific heads as a baseline framework, we conduct a comprehensive ablation study of (i) backbone/model scale, (ii) modality contributions, and (iii) the use of auxiliary embeddings under standalone and fusion settings, and we report both single-target and joint-target results to quantify tradeoffs under a unified training protocol.
Finally, we contextualize baseline performance through regionally aligned comparisons against existing gridded products, including GEDI L4D RH10-RH98 and AGBD, at matching temporal scale. Biomazon, together with the accompanying protocols and baseline results, establishes a reference benchmark for future work on structurally consistent RH-profile prediction and structure-biomass modeling in tropical forests.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CV
See more →LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval
A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.