Enhancing Distance Prediction through Monocular Depth Estimation based on Graph Convolutional Networks

Masoumian, Armin

Dades identificatives

Identificador: TDX:4357

Handle: https://hdl.handle.net/20.500.11797/TDX4357

Autors: Masoumian, Armin

Resum:
As the field of robotics and autonomous vehicles advances, the demand for precise depth measurements becomes increasingly pronounced. Depth estimation (DE), a fundamental task in computer vision, plays a pivotal role in achieving this accuracy, with deep learning (DL) techniques offering a viable solution. Particularly, self-supervised monocular depth estimation (MDE) represents cutting-edge technology, allowing the estimation of object depth in a scene from a single image, eliminating the need for expensive stereoscopic or 3D cameras. Graph convolutional networks (GCNs) have further improved the accuracy of DE models by accommodating non-Euclidean data, while combining multiple loss functions has enhanced the reliability of depth predictions. This study explores the extensive applications of self-supervised MDE and provides a comprehensive review of recent advancements in the field using DL techniques. It delves into key aspects like input data shapes, training methods, and evaluation criteria while also addressing the limitations of DL-based MDE models, including challenges related to accuracy, computational efficiency, real-time feasibility, domain adaptation, and generalization. Furthermore, the research introduces an innovative MDE approach leveraging GCNs for estimating depth maps from monocular videos, outperforming existing state-of-the-art methods. Additionally, a novel deep learning framework is presented, seamlessly integrating DE and object detection within a single image, achieving impressive accuracy, particularly in outdoor scenarios. In summary, this study underscores the efficiency of the self-supervised MDE approach based on graph convolutional networks, providing both quantitative and qualitative comparisons with state-of-the-art methods, emphasizing the considerable advantages of the proposed depth prediction technique.
Altres:

Editor: Universitat Rovira i Virgili
Data: 2024-02-07, 2024-04-09T09:26:03Z, 2024-04-09T09:26:03Z
Identificador: http://hdl.handle.net/10803/690512
Departament/Institut: Departament d'Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili.
Idioma: eng
Autor: Masoumian, Armin
Director: Abdellatif Fatahallah Ibrahim Mahmoud, Hatem, Cristiano Rodríguez, Julián Efrén, Puig Valls, Domènec Savi
Font: TDX (Tesis Doctorals en Xarxa)
Format: application/pdf, 180 p.

Paraules clau:

Depth Estimation
Computer Vision
Deep Learning
Estimación de la Profundidad
Visión por Computador
Aprendizaje Profundo
Estimació de la Profunditat
Visió per Computador
Aprenentatge Profund
Ciències
Documents:

Memoria
Cerca a google

Enhancing Distance Prediction through Monocular Depth Estimation based on Graph Convolutional Networks

Dades identificatives

Altres:

Paraules clau:

Documents:

Cerca a google