Articles producció científicaEnginyeria Informàtica i Matemàtiques

Promising Depth Map Prediction Method from a Single Image Based on Conditional Generative Adversarial Network

  • Identification data

    Identifier:  imarina:9380784
    Authors:  Abdulwahab, S; Rashwan, HA; Masoumian, A; Sharaf, N; Puig, D
    Abstract:
    Pose estimation is typically performed through 3D images. In contrast, estimating the pose from a single RGB image is still a difficult task. RGB images do not only represent objects' shape, but also represent the intensity that is relative to the viewpoint, texture, and lighting condition. While the 3D pose estimation from depth images is considered a promising approach since the depth image only represents objects' shape. Thus, it is necessary to know what is the appropriate method that can be used for predicting the depth image from a 2D RGB image and then to use for getting the 3D pose estimation. In this paper, we propose a promising approach based on a deep learning model for depth estimation in order to improve the 3D pose estimation. The proposed model consists of two successive networks. The first network is an autoencoder network that maps from the RGB domain to the depth domain. The second network is a discriminator network that compares a real depth image to a generated depth image to support the first network to generate an accurate depth image. In this work, we do not use real depth images corresponding to the input color images. Our contribution is to use 3D CAD models corresponding to objects appearing in color images to render depth images from different viewpoints. These rendered images are then used as ground truth and to guide the autoencoder network to learn the mapping from the image domain to the depth domain. The proposed model outperforms state-of-the-art models on the publicly PASCAL 3D+ dataset.
  • Others:

    Link to the original source: https://ebooks.iospress.nl/doi/10.3233/FAIA210159
    APA: Abdulwahab, S; Rashwan, HA; Masoumian, A; Sharaf, N; Puig, D (2021). Promising Depth Map Prediction Method from a Single Image Based on Conditional Generative Adversarial Network. Amsterdam: IOS Press
    Paper original source: Fuzzy Logic-Based Variable Encoding For Improved Diabetic Retinopathy Prediction. 339 392-401
    Article's DOI: 10.3233/FAIA210159
    Journal publication year: 2021-01-01
    Entity: Universitat Rovira i Virgili
    Paper version: info:eu-repo/semantics/publishedVersion
    Record's date: 2026-05-09
    URV's Author/s: Abdellatif Fatahallah Ibrahim Mahmoud, Hatem / Abdulwahab, Saddam Abdulrhman Hamed / Masoumian, Armin / Puig Valls, Domènec Savi
    Department: Enginyeria Informàtica i Matemàtiques
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Publication Type: Proceedings Paper
    Author, as appears in the article.: Abdulwahab, S; Rashwan, HA; Masoumian, A; Sharaf, N; Puig, D
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Thematic Areas: Interdisciplinar, Información y documentación, General o multidisciplinar, Comunicación e información, Comunicació i informació, Ciências agrárias i, Artificial intelligence
    Author's mail: hatem.abdellatif@urv.cat, hatem.abdellatif@urv.cat, armin.masoumian@estudiants.urv.cat, armin.masoumian@estudiants.urv.cat, saddam.abdulwahab@urv.cat, saddam.abdulwahab@urv.cat, saddam.abdulwahab@urv.cat, hatem.abdellatif@urv.cat, domenec.puig@urv.cat, domenec.puig@urv.cat
  • Keywords:

    Unet++
    Unet plus
    Unet
    Image to image translation
    Image to image translatio
    Image segmentation
    Depth prediction
    Deep learning
    Artificial Intelligence
    Interdisciplinar
    Información y documentación
    General o multidisciplinar
    Comunicación e información
    Comunicació i informació
    Ciências agrárias i
  • Documents:

  • Cerca a google

    Search to google scholar