Applying Deep Learning for cockpit segmentation in the context of mixed reality

arXiv cs.CV·Alexandre Leles Sousa, Pedro de Oliveira Nielson, Erick Oliveira Rodrigues, Rafael Francisco dos Santos, Giovani Bernardes Vitor

3h ago

·~1 min·6/8/2026·en·0

Quick Answer

This study utilizes U-net and DeepLabV3+ convolutional neural networks for cockpit image segmentation, achieving approximately 90% accuracy.

Quick Take

This study utilizes U-net and DeepLabV3+ convolutional neural networks for cockpit image segmentation, achieving approximately 90% accuracy. The approach enhances mixed reality experiences by effectively distinguishing foreground from background in real-time images captured from a CAT793F truck simulator.

Key Points

Utilized U-net and DeepLabV3+ for image segmentation in mixed reality.
Achieved around 90% accuracy in distinguishing foreground and background.
Real images were captured using a CAT793F off-highway truck simulator.
Enhances user immersion in simulated environments through effective image processing.
Focuses on integrating virtual objects with real-world imagery.

Article Excerpt

From source RSS / original summary

arXiv:2606. 06520v1 Announce Type: new Abstract: Computer vision is an area that has been growing continuously. With the advance of technologies with a first-person view, new development opportunities have emerged inside the area. Mixed reality promotes virtual environments with objects from the physical world shown in real time. For that, it's necessary to be concerned with the immersion of the user in this simulated environment, increasingly seeking to bring it closer to a possible desired reality.

This paper proposes the development of image processing in order to perform the segmentation of images to identify what is foreground and background in order to facilitate the union of virtual and real images. Thus, the present work obtain real images of the user using the off-highway truck simulator CAT793F, through a camera, to be able to perform the segmentation of such images with artificial intelligence techniques.

The convolutional neural network architectures "U-net" and "DeepLabV3+" are applied to perform image segmentation. As a result, metrics with around 90% accuracy were presented and and the best model was determined.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Shahrzad Esmat, Chaunte W. Lacewell, Sameh Gobriel, Nilesh Jain, Ali Jannesari

3d ago

FeaturedOriginal

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

AI Summary

A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.

#LLM #Agent #Inference #AI Startup

Applying Deep Learning for cockpit segmentation in the context of mixed reality

Quick Answer

Quick Take

Key Points

Article Excerpt

Want this in your inbox every morning?

More from arXiv cs.CV

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Biomazon: A Multimodal Dataset for 3D Forest Structure and Biomass Modeling in the Amazon Basin

Optimal Transport Flow Matching by Design

Related in this space

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

Aptiv to Deliver Production-Ready Edge AI with Long-Term Support with NVIDIA