Treballs Fi de MàsterEnginyeria Informàtica i Matemàtiques

Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals

  • Identification data

    Identifier:  TFM:946
    Authors:  Farouk Marei, David George
  • Others:

    Entity: Universitat Rovira i Virgili (URV)
    Confidenciality: No
    Education area(s): Enginyeria de la Seguretat Informàtica i Intel·ligència Artificial
    Title in different languages: Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals
    Abstract: Object Detection and Depth Estimation methods can help visual impaired individuals to understand the scene in front of them. there are multiple applications that provide help to those individuals by connecting them (by a video call) to others who can help in describing the scene to them. However, We believe that I can give an alternative to those applications using light and fast machine learning models and avoid the interaction of the human support. The project is divided into Two parts. The First part is Object Detection in a scene. We used YOLOV5S model that was trained on the COCO date-set with 80 different objects. The second part in the depth estimation model. We used Midas from pytorch after trying multiple depth models. the depth estimation model will help me to estimate the distance of each object extracted from the depth model. The final goal is to take an image or a video from the user and extract the objects with the distance of each of them and send this data to the user in a sound note format.
    Subject: Enginyeria informàtica
    Academic year: 2020-2021
    Language: en
    Work's public defense date: 2021-09-10
    Subject areas: Computer engineering
    Student: Farouk Marei, David George
    Department: Enginyeria Informàtica i Matemàtiques
    Creation date in repository: 2022-03-17
    Title in original language: Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals
    Access Rights: info:eu-repo/semantics/openAccess
    Project director: Rashwan, Hatem A.; Puig, Dominque
  • Keywords:

    Enginyeria informàtica
    Computer engineering
    Ingeniería informática
  • Documents:

  • Cerca a google

    Search to google scholar