Articles producció científica> Enginyeria Informàtica i Matemàtiques

T-YOLO: Tiny vehicle detection based on YOLO and multi-scale convolutional neural networks

  • Dades identificatives

    Identificador: imarina:9243276
    Autors:
    Carrasco, Daniel PadillaRashwan, Hatem AGarcia, Miguel AngelPuig, Domenec
    Resum:
    To solve real-life problems for different smart city applications, using deep Neural Network, such as parking occupancy detection, requires fine-tuning of these networks. For large parking, it is desirable to use a cenital-plane camera located at a high distance that allows the monitoring of the entire parking space or a large parking area with only one camera. Today’s most popular object detection models, such as YOLO, achieve good precision scores at real-time speed. However, if we use our own data different from that of the general-purpose datasets, such as COCO and ImageNet, we have a large margin for improvisation. In this paper, we propose a modified, yet lightweight, deep object detection model based on the YOLO-v5 architecture. The proposed model can detect large, small, and tiny objects. Specifically, we propose the use of a multi-scale mechanism to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for detecting objects in a scene (i.e., in our case vehicles). The proposed multi-scale module reduces the number of trainable parameters compared to the original YOLO-v5 architecture. The experimental results also demonstrate that precision is improved by a large margin. In fact, as shown in the experiments, the results show a small reduction from 7.28 million parameters of the YOLO-v5-S profile to 7.26 million parameters in our model. In addition, we reduced the detection speed by inferring 30 fps compared to the YOLO-v5-L/X profiles. In addition, the tiny vehicle detection performance was significantly improved by 33% compared to the YOLO-v5-X profile.
  • Altres:

    Autor segons l'article: Carrasco, Daniel Padilla; Rashwan, Hatem A; Garcia, Miguel Angel; Puig, Domenec
    Departament: Enginyeria Informàtica i Matemàtiques
    Autor/s de la URV: Abdellatif Fatahallah Ibrahim Mahmoud, Hatem / GARCIA GARCIA, MIGUEL ANGEL / Puig Valls, Domènec Savi
    Paraules clau: Tiny objects Smart parking Object detection Feature extraction Detectors Convolutional neural networks Computational modeling Cameras Automobiles tiny objects smart parking feature extraction detectors convolutional neural networks computational modeling cameras automobiles
    Resum: To solve real-life problems for different smart city applications, using deep Neural Network, such as parking occupancy detection, requires fine-tuning of these networks. For large parking, it is desirable to use a cenital-plane camera located at a high distance that allows the monitoring of the entire parking space or a large parking area with only one camera. Today’s most popular object detection models, such as YOLO, achieve good precision scores at real-time speed. However, if we use our own data different from that of the general-purpose datasets, such as COCO and ImageNet, we have a large margin for improvisation. In this paper, we propose a modified, yet lightweight, deep object detection model based on the YOLO-v5 architecture. The proposed model can detect large, small, and tiny objects. Specifically, we propose the use of a multi-scale mechanism to learn deep discriminative feature representations at different scales and automatically determine the most suitable scales for detecting objects in a scene (i.e., in our case vehicles). The proposed multi-scale module reduces the number of trainable parameters compared to the original YOLO-v5 architecture. The experimental results also demonstrate that precision is improved by a large margin. In fact, as shown in the experiments, the results show a small reduction from 7.28 million parameters of the YOLO-v5-S profile to 7.26 million parameters in our model. In addition, we reduced the detection speed by inferring 30 fps compared to the YOLO-v5-L/X profiles. In addition, the tiny vehicle detection performance was significantly improved by 33% compared to the YOLO-v5-X profile.
    Àrees temàtiques: Telecommunications Materials science (miscellaneous) Materials science (all) General materials science General engineering General computer science Engineering, electrical & electronic Engineering (miscellaneous) Engineering (all) Engenharias iv Engenharias iii Electrical and electronic engineering Computer science, information systems Computer science (miscellaneous) Computer science (all) Ciência da computação
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Adreça de correu electrònic de l'autor: miguelangel.garciag@urv.cat hatem.abdellatif@urv.cat domenec.puig@urv.cat
    Identificador de l'autor: 0000-0001-9972-2182 0000-0001-5421-1637 0000-0002-0562-4205
    Data d'alta del registre: 2024-09-21
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Referència a l'article segons font original: Ieee Access. 11 22430-22440
    Referència de l'ítem segons les normes APA: Carrasco, Daniel Padilla; Rashwan, Hatem A; Garcia, Miguel Angel; Puig, Domenec (2023). T-YOLO: Tiny vehicle detection based on YOLO and multi-scale convolutional neural networks. Ieee Access, 11(), 22430-22440. DOI: 10.1109/ACCESS.2021.3137638
    Entitat: Universitat Rovira i Virgili
    Any de publicació de la revista: 2023
    Tipus de publicació: Journal Publications
  • Paraules clau:

    Computer Science (Miscellaneous),Computer Science, Information Systems,Engineering (Miscellaneous),Engineering, Electrical & Electronic,Materials Science (Miscellaneous),Telecommunications
    Tiny objects
    Smart parking
    Object detection
    Feature extraction
    Detectors
    Convolutional neural networks
    Computational modeling
    Cameras
    Automobiles
    tiny objects
    smart parking
    feature extraction
    detectors
    convolutional neural networks
    computational modeling
    cameras
    automobiles
    Telecommunications
    Materials science (miscellaneous)
    Materials science (all)
    General materials science
    General engineering
    General computer science
    Engineering, electrical & electronic
    Engineering (miscellaneous)
    Engineering (all)
    Engenharias iv
    Engenharias iii
    Electrical and electronic engineering
    Computer science, information systems
    Computer science (miscellaneous)
    Computer science (all)
    Ciência da computação
  • Documents:

  • Cerca a google

    Search to google scholar