Tesis doctoralsDepartament d'Enginyeria Informàtica i Matemàtiques

Real-Time Localization of Multi-Oriented Text in Natural Scene Images

  • Dades identificatives

    Identificador:  TDX:3140
    Autors:  Gironés Sancho, Xavier
    Resum:
    This thesis focuses on the problem of text localization in natural scene images from the perspective of time-efficiency. Towards this end, a multi-oriented text localization method in natural images suitable for real-time processing of high-definition video on portable and mobile devices is introduced. The proposed method is based on the connected component (CC) approach: First, CCs are isolated by convolving a multi-scale pyramid with a specifically designed linear spatial filter, followed by hysteresis thresholding. Next, non-textual CCs are pruned employing a cascade of local classifiers fed with increasingly extended feature vectors, where the stroke width feature is estimated in linear time complexity by computing the maximal inscribed squares in the CCs. Candidate CCs and their neighbors are subsequently checked with a context-aware classifier that takes into account the target CCs and their vicinity. Lastly, text sequences are extracted in all pyramid levels and fused using dynamic programming. The proposed method is capable of processing 1080p HD video at nearly 30 frames per second on a standard laptop without requiring a GPU. Furthermore, when benchmarked on the ICDAR 2013 Robust Reading and on the ICDAR 2015 Incidental Scene Text datasets, it performed more than twice faster than the state-of-the-art, while still delivering competitive results in terms of precision and recall. Additionally, this thesis introduces a new family or rational approximations of the arctangent function valid in the [0, π/2] range that can be easily extended to two and four quadrants, and a new technique for vehicle license plate localization in unconstrained environments is presented as a practical use case leveraging the text localization system described in this research.
  • Altres:

    Editor: Universitat Rovira i Virgili
    Data: 2021-03-16, 2021-04-29T09:39:49Z, 2021-04-29T09:39:49Z
    Identificador: http://hdl.handle.net/10803/671518
    Departament/Institut: Departament d'Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili.
    Idioma: eng
    Autor: Gironés Sancho, Xavier
    Director: Julià Ferré, Maria Carmen
    Font: TDX (Tesis Doctorals en Xarxa)
    Format: application/pdf, application/pdf, 114
  • Paraules clau:

    License plate detection
    Real-time
    Text localization
    Detección de matrículas
    Tiempo real
    Localización de texto
    Detecció de matrícules
    Temps real
    Localització de text
    Enginyeria i arquitectura
  • Documents:

  • Cerca a google

    Search to google scholar