Ethiopic and Latin multilingual text detection from images using hybrid techniques

Atirsaw  Awoke; Menore  Tekeba

download PDF

Published:

Nov 4, 2021

Issue

Vol. 39 No. 1 (2021)

Section

Articles

Atirsaw Awoke

Menore Tekeba

Abstract

Caption and scene texts found in images contain valuable information. These texts can be used for many applications to answer questions like what, when, where, and by who to give context to the images. So, automatic text detection enhances the user’s understanding of the media content. In Ethiopia, most street posts and promotional boards are written in multilingual characters
such as Latin (English, Afaan Oromo etc.) and Ethiopic (Amharic, Tigrigna etc.). In this work, we have studied Ethiopic and Latin
multilingual text detection from images for both caption and scene texts. After the images are pre-processed, maximally stable extremal region (MSER) algorithm, aspect ratio and stroke width transform (SWT) algorithm are used to extract text regions, respectively. Then texture features are computed using local binary patterns (LBP) from the extracted regions. Finally, the support vector machine (SVM) is used to classify text region vs nontext using the computed LBP features. We prepared a new multilingual Ethiopic and Latin script image dataset to evaluate our method.

Zede Journal
Journal / Zede Journal / Vol. 39 No. 1 (2021) / Articles

Published:

Ethiopic and Latin multilingual text detection from images using hybrid techniques

Atirsaw Awoke

Menore Tekeba

Abstract

Journal Identifiers

Article Sidebar

Published:

Article Details

Main Article Content

Atirsaw Awoke

Menore Tekeba

Abstract

Journal Identifiers