Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots

Singh, A; Narula, R; Rashwan, HA; Abdel-Nasser, M; Puig, D; Nandi, GC

doi:10.1007/s00521-022-07273-7

Dades identificatives

Identificador: imarina:9262032

Handle: https://hdl.handle.net/20.500.11797/imarina9262032

Autors: Singh, A; Narula, R; Rashwan, HA; Abdel-Nasser, M; Puig, D; Nandi, GC

Resum:
Semantic mapping is still challenging for household collaborative robots. Deep learning models have proved their capability to extract semantics from the scene and learn robot odometry. For interfacing semantic information with robot odometry, existing approaches extract both semantics and robot odometry separately and then integrate them using fusion techniques. Such approaches face many issues while integration, and the mapping procedure requires a lot of memory and resources to process the information. In an attempt to produce accurate semantic mapping with resource-limited devices, this paper proposes an efficient deep learning-based model to simultaneously estimate robot odometry by using monocular sequence frames and detecting objects in the frames. The proposed model includes two main components: using a YOLOv3 object detector as a backbone and a convolutional long short-term (Conv-LSTM) recurrent neural network to model the changes in camera pose. The unique advantage of the proposed model is that it boycotts the need for data association and the requirement of multi-sensor fusion. We conducted the experiments on a LoCoBot robot in a laboratory environment, attaining satisfactory results with such limited computational resources. Additionally, we tested the proposed method on the Kitti dataset, reaching an average test loss of 15.93 on various sequences. The experiments are documented in this video https://www.youtube.com/watch?v=hnmqwxpaTEw.
Altres:

Enllaç font original: https://link.springer.com/article/10.1007/s00521-022-07273-7
Referència de l'ítem segons les normes APA: Singh, A; Narula, R; Rashwan, HA; Abdel-Nasser, M; Puig, D; Nandi, GC (2022). Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots. Neural Computing & Applications, 34(18), 15617-15631. DOI: 10.1007/s00521-022-07273-7
Referència a l'article segons font original: Neural Computing & Applications. 34 (18): 15617-15631
DOI de l'article: 10.1007/s00521-022-07273-7
Any de publicació de la revista: 2022-09-01
Entitat: Universitat Rovira i Virgili
Versió de l'article dipositat: info:eu-repo/semantics/acceptedVersion
Data d'alta del registre: 2026-05-09
Autor/s de la URV: Abdellatif Fatahallah Ibrahim Mahmoud, Hatem / Abdelnasser Mohamed Mahmoud, Mohamed / Puig Valls, Domènec Savi / Singh, Aditya
Departament: Enginyeria Informàtica i Matemàtiques
URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
Tipus de publicació: Journal Publications
Autor segons l'article: Singh, A; Narula, R; Rashwan, HA; Abdel-Nasser, M; Puig, D; Nandi, GC
Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
Àrees temàtiques: Software, Engenharias iv, Computer science, artificial intelligence, Artificial intelligence, Administração pública e de empresas, ciências contábeis e turismo
Adreça de correu electrònic de l'autor: hatem.abdellatif@urv.cat, hatem.abdellatif@urv.cat, mohamed.abdelnasser@urv.cat, mohamed.abdelnasser@urv.cat, aditya.singh@urv.cat, hatem.abdellatif@urv.cat, domenec.puig@urv.cat, domenec.puig@urv.cat

Paraules clau:

Visual odometry
Slam
Real-time
Object detection
Mapping
Household robots
Decent work and economic growth
Agglomerative clustering
Artificial Intelligence
Computer Science
Software
Engenharias iv
Administração pública e de empresas
ciências contábeis e turismo
Documents:

DocumentPrincipal
Cerca a google

Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots

Dades identificatives

Altres:

Paraules clau:

Documents:

Cerca a google