Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging machine learning for less developed languages: Progress on Urdu text detection

Sep 28, 2022

Hazrat Ali

Figure 1 for Leveraging machine learning for less developed languages: Progress on Urdu text detection

Figure 2 for Leveraging machine learning for less developed languages: Progress on Urdu text detection

Figure 3 for Leveraging machine learning for less developed languages: Progress on Urdu text detection

Figure 4 for Leveraging machine learning for less developed languages: Progress on Urdu text detection

Share this with someone who'll enjoy it:

Abstract:Text detection in natural scene images has applications for autonomous driving, navigation help for elderly and blind people. However, the research on Urdu text detection is usually hindered by lack of data resources. We have developed a dataset of scene images with Urdu text. We present the use of machine learning methods to perform detection of Urdu text from the scene images. We extract text regions using channel enhanced Maximally Stable Extremal Region (MSER) method. First, we classify text and noise based on their geometric properties. Next, we use a support vector machine for early discarding of non-text regions. To further remove the non-text regions, we use histogram of oriented gradients (HoG) features obtained and train a second SVM classifier. This improves the overall performance on text region detection within the scene images. To support research on Urdu text, We aim to make the data freely available for research use. We also aim to highlight the challenges and the research gap for Urdu text detection.

* NeurIPS ML4D 2021 * Accepted at NeurIPS ML4D workshop. arXiv admin note: text overlap with arXiv:2109.08060

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging machine learning for less developed languages: Progress on Urdu text detection

Paper and Code