Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals

Farouk Marei, David George

Datos identificativos

Identificador: TFM:946

Handle: https://hdl.handle.net/20.500.11797/TFM946

Autores:
Farouk Marei, David George
Otros:

Título en diferentes idiomas: Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals
Áreas temàticas: Enginyeria informàtica Computer engineering Ingeniería informática
Confidencialidad: No
Curso académico: 2020-2021
Estudiante: Farouk Marei, David George
Departamento: Enginyeria Informàtica i Matemàtiques
Derechos de Accesso: info:eu-repo/semantics/openAccess
Fecha de la defensa del trabajo: 2021-09-10
Director del proyecto: Rashwan, Hatem A.; Puig, Dominque
Resumen: Els mètodes de detecció d'objectes i estimació de profunditat poden ajudar els individus amb discapacitat visual a entendre l'escena davant d'ells. Hi ha múltiples aplicacions que proporcionen ajuda a aquests individus connectant-los (per una videotrucada) a altres que poden ajudar-los a descriure l'escena. No obstant això, creiem que puc donar una alternativa a aquestes aplicacions utilitzant models d'aprenentatge automàtic ràpid i lluminós i evitar la interacció del suport humà. El projecte es divideix en dues parts. La primera part és la detecció d'objectes en una escena. Utilitzem el model YOLOV5S que es va formar en el conjunt de dates del COCO amb 80 objectes diferents. La segona part del model d'estimació de profunditat. Vam utilitzar Midas de pytorch després de provar diversos models de profunditat. El model d'estimació de profunditat m'ajudarà a estimar la distància de cada objecte extret del model de profunditat. L'objectiu final és prendre una imatge o un vídeo de l'usuari i extreure els objectes amb la distància de cadascun d'ells i enviar aquestes dades a l'usuari en un format de nota de so. Object Detection and Depth Estimation methods can help visual impaired individuals to understand the scene in front of them. there are multiple applications that provide help to those individuals by connecting them (by a video call) to others who can help in describing the scene to them. However, We believe that I can give an alternative to those applications using light and fast machine learning models and avoid the interaction of the human support. The project is divided into Two parts. The First part is Object Detection in a scene. We used YOLOV5S model that was trained on the COCO date-set with 80 different objects. The second part in the depth estimation model. We used Midas from pytorch after trying multiple depth models. the depth estimation model will help me to estimate the distance of each object extracted from the depth model. The final goal is to take an image or a video from the user and extract the objects with the distance of each of them and send this data to the user in a sound note format.
Materia: Enginyeria informàtica
Entidad: Universitat Rovira i Virgili (URV)
Idioma: en
Enseñanza(s): Enginyeria de la Seguretat Informàtica i Intel·ligència Artificial
Título en la lengua original: Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals
Fecha de alta en el repositorio: 2022-03-17

Palabras clave:

Enginyeria informàtica
Computer engineering
Ingeniería informática
Enginyeria informàtica
Documentos:

Memòria
Cerca a google

Repositori URV

Treballs Fi de Màster> Enginyeria Informàtica i Matemàtiques

Intelligent Assistant System Based on Single Camera and Deep Neural Networks for Aiding Visual Impaired Individuals

Datos identificativos

Otros:

Palabras clave:

Documentos:

Cerca a google