Autor segons l'article: Carrasco, Daniel Padilla; Rashwan, Hatem A; Garcia, Miguel Angel; Puig, Domenec
Departament: Enginyeria Informàtica i Matemàtiques
Autor/s de la URV: Abdellatif Fatahallah Ibrahim Mahmoud, Hatem / GARCIA GARCIA, MIGUEL ANGEL / Puig Valls, Domènec Savi
Paraules clau: Tiny objects Smart parking Object detection Feature extraction Detectors Convolutional neural networks Computational modeling Cameras Automobiles tiny objects smart parking feature extraction detectors convolutional neural networks computational modeling cameras automobiles
Resum: To solve real-life problems for different smart city applications, using deep Neural Network, such as parking occupancy detection, requires fine-tuning of these networks. For large parking, it is desirable to use a cenital-plane camera located at a high distance that allows the monitoring of the entire parking space or a large parking area with only one camera. Today’s most popular object detection models, such as YOLO, achieve good precision scores at real-time speed. However, if we use our own data different from that of the general-purpose datasets, such as COCO and ImageNet, we have a large margin for improvisation. In this paper, we propose a modified, yet lightweight, deep object detection model based on the YOLO-v5 architecture. The proposed model can detect large, small, and tiny objects. Specifically, we propose the use of a multi-scale mechanism to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for detecting objects in a scene (i.e., in our case vehicles). The proposed multi-scale module reduces the number of trainable parameters compared to the original YOLO-v5 architecture. The experimental results also demonstrate that precision is improved by a large margin. In fact, as shown in the experiments, the results show a small reduction from 7.28 million parameters of the YOLO-v5-S profile to 7.26 million parameters in our model. In addition, we reduced the detection speed by inferring 30 fps compared to the YOLO-v5-L/X profiles. In addition, the tiny vehicle detection performance was significantly improved by 33% compared to the YOLO-v5-X profile.
Àrees temàtiques: Telecommunications Materials science (miscellaneous) Materials science (all) General materials science General engineering General computer science Engineering, electrical & electronic Engineering (miscellaneous) Engineering (all) Engenharias iv Engenharias iii Electrical and electronic engineering Computer science, information systems Computer science (miscellaneous) Computer science (all) Ciência da computação
Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
Adreça de correu electrònic de l'autor: miguelangel.garciag@urv.cat hatem.abdellatif@urv.cat domenec.puig@urv.cat
Identificador de l'autor: 0000-0001-9972-2182 0000-0001-5421-1637 0000-0002-0562-4205
Data d'alta del registre: 2024-09-21
Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
Enllaç font original: https://ieeexplore.ieee.org/document/9658533
URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
Referència a l'article segons font original: Ieee Access. 11 22430-22440
Referència de l'ítem segons les normes APA: Carrasco, Daniel Padilla; Rashwan, Hatem A; Garcia, Miguel Angel; Puig, Domenec (2023). T-YOLO: Tiny vehicle detection based on YOLO and multi-scale convolutional neural networks. Ieee Access, 11(), 22430-22440. DOI: 10.1109/ACCESS.2021.3137638
DOI de l'article: 10.1109/ACCESS.2021.3137638
Entitat: Universitat Rovira i Virgili
Any de publicació de la revista: 2023
Tipus de publicació: Journal Publications