Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Abiot Sinamo Boltena

Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Feb 25, 2022

Mesay Samuel Gondere, Lars Schmidt-Thieme, Durga Prasad Sharma, Abiot Sinamo Boltena

Figure 1 for Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Figure 2 for Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Figure 3 for Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Figure 4 for Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Abstract:Amharic is one of the official languages of the Federal Democratic Republic of Ethiopia. It is one of the languages that use an Ethiopic script which is derived from Gee'z, ancient and currently a liturgical language. Amharic is also one of the most widely used literature-rich languages of Ethiopia. There are very limited innovative and customized research works in Amharic optical character recognition (OCR) in general and Amharic handwritten text recognition in particular. In this study, Amharic handwritten word recognition will be investigated. State-of-the-art deep learning techniques including convolutional neural networks together with recurrent neural networks and connectionist temporal classification (CTC) loss were used to make the recognition in an end-to-end fashion. More importantly, an innovative way of complementing the loss function using the auxiliary task from the row-wise similarities of the Amharic alphabet was tested to show a significant recognition improvement over a baseline method. Such findings will promote innovative problem-specific solutions as well as will open insight to a generalized solution that emerges from problem-specific domains.

Via

Access Paper or Ask Questions

Handwritten Amharic Character Recognition Using a Convolutional Neural Network

Sep 23, 2019

Mesay Samuel Gondere, Lars Schmidt-Thieme, Abiot Sinamo Boltena, Hadi Samer Jomaa

Figure 1 for Handwritten Amharic Character Recognition Using a Convolutional Neural Network

Figure 2 for Handwritten Amharic Character Recognition Using a Convolutional Neural Network

Figure 3 for Handwritten Amharic Character Recognition Using a Convolutional Neural Network

Figure 4 for Handwritten Amharic Character Recognition Using a Convolutional Neural Network

Abstract:Amharic is the official language of the Federal Democratic Republic of Ethiopia. There are lots of historic Amharic and Ethiopic handwritten documents addressing various relevant issues including governance, science, religious, social rules, cultures and art works which are very reach indigenous knowledge. The Amharic language has its own alphabet derived from Ge'ez which is currently the liturgical language in Ethiopia. Handwritten character recognition for non Latin scripts like Amharic is not addressed especially using the advantages of the state of the art techniques. This research work designs for the first time a model for Amharic handwritten character recognition using a convolutional neural network. The dataset was organized from collected sample handwritten documents and data augmentation was applied for machine learning. The model was further enhanced using multi-task learning from the relationships of the characters. Promising results are observed from the later model which can further be applied to word prediction.

* ECDA2019 Conference Oral Presentation

Via

Access Paper or Ask Questions