Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Morris

Evaluation of Automated Image Descriptions for Visually Impaired Students

Jun 29, 2021

Anett Hoppe, David Morris, Ralph Ewerth

Figure 1 for Evaluation of Automated Image Descriptions for Visually Impaired Students

Abstract:Illustrations are widely used in education, and sometimes, alternatives are not available for visually impaired students. Therefore, those students would benefit greatly from an automatic illustration description system, but only if those descriptions were complete, correct, and easily understandable using a screenreader. In this paper, we report on a study for the assessment of automated image descriptions. We interviewed experts to establish evaluation criteria, which we then used to create an evaluation questionnaire for sighted non-expert raters, and description templates. We used this questionnaire to evaluate the quality of descriptions which could be generated with a template-based automatic image describer. We present evidence that these templates have the potential to generate useful descriptions, and that the questionnaire identifies problems with description templates.

* Hoppe A., Morris D., Ewerth R. (2021) Evaluation of Automated Image Descriptions for Visually Impaired Students. In: Roll I., McNamara D., Sosnovsky S., Luckin R., Dimitrova V. (eds) AIED 2021. LNCS vol 12749. Springer, Cham
* 6 pages, 12 references. Accepted for publication at the 22nd International Conference on Artificial Intelligence in Education (AIED 2021), June 14-16 2021, Utrecht, The Netherlands

Via

Access Paper or Ask Questions

SlideImages: A Dataset for Educational Image Classification

Jan 19, 2020

David Morris, Eric Müller-Budack, Ralph Ewerth

Figure 1 for SlideImages: A Dataset for Educational Image Classification

Figure 2 for SlideImages: A Dataset for Educational Image Classification

Abstract:In the past few years, convolutional neural networks (CNNs) have achieved impressive results in computer vision tasks, which however mainly focus on photos with natural scene content. Besides, non-sensor derived images such as illustrations, data visualizations, figures, etc. are typically used to convey complex information or to explore large datasets. However, this kind of images has received little attention in computer vision. CNNs and similar techniques use large volumes of training data. Currently, many document analysis systems are trained in part on scene images due to the lack of large datasets of educational image data. In this paper, we address this issue and present SlideImages, a dataset for the task of classifying educational illustrations. SlideImages contains training data collected from various sources, e.g., Wikimedia Commons and the AI2D dataset, and test data collected from educational slides. We have reserved all the actual educational images as a test dataset in order to ensure that the approaches using this dataset generalize well to new educational images, and potentially other domains. Furthermore, we present a baseline system using a standard deep neural architecture and discuss dealing with the challenge of limited training data.

* 8 pages, 2 figures, to be presented at ECIR 2020

Via

Access Paper or Ask Questions