Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Apr 21, 2023

Nirmal Roy, Agathe Balayn, David Maxwell, Claudia Hauff

Figure 1 for Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Figure 2 for Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Figure 3 for Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Figure 4 for Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Share this with someone who'll enjoy it:

Abstract:The creation of relevance assessments by human assessors (often nowadays crowdworkers) is a vital step when building IR test collections. Prior works have investigated assessor quality & behaviour, though into the impact of a document's presentation modality on assessor efficiency and effectiveness. Given the rise of voice-based interfaces, we investigate whether it is feasible for assessors to judge the relevance of text documents via a voice-based interface. We ran a user study (n = 49) on a crowdsourcing platform where participants judged the relevance of short and long documents sampled from the TREC Deep Learning corpus-presented to them either in the text or voice modality. We found that: (i) participants are equally accurate in their judgements across both the text and voice modality; (ii) with increased document length it takes participants significantly longer (for documents of length > 120 words it takes almost twice as much time) to make relevance judgements in the voice condition; and (iii) the ability of assessors to ignore stimuli that are not relevant (i.e., inhibition) impacts the assessment quality in the voice modality-assessors with higher inhibition are significantly more accurate than those with lower inhibition. Our results indicate that we can reliably leverage the voice modality as a means to effectively collect relevance labels from crowdworkers.

* Accepted at SIGIR 2023

View paper on

Share this with someone who'll enjoy it:

Title:Hear Me Out: A Study on the Use of the Voice Modality for Crowdsourced Relevance Assessments

Paper and Code