Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Therese Lindström Tiedemann

University of Helsinki

Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

Aug 30, 2023

Elena Volodina, Simon Dobnik, Therese Lindström Tiedemann, Xuan-Son Vu

Figure 1 for Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

Figure 2 for Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

Abstract:Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.

* Big DataService 2023 conference, 2023 Workshop on Big Data and Machine Learning with Privacy Enhancing Tech, IEEE Catalog Number: CFP23A91-ART, ISBN: 979-8-3503-3379-4

Via

Access Paper or Ask Questions

Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

Jun 17, 2022

David Alfter, Therese Lindström Tiedemann, Elena Volodina

Figure 1 for Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

Figure 2 for Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

Figure 3 for Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

Figure 4 for Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

Abstract:In this study we investigate to which degree experts and non-experts agree on questions of difficulty in a crowdsourcing experiment. We ask non-experts (second language learners of Swedish) and two groups of experts (teachers of Swedish as a second/foreign language and CEFR experts) to rank multi-word expressions in a crowdsourcing experiment. We find that the resulting rankings by all the three tested groups correlate to a very high degree, which suggests that judgments produced in a comparative setting are not influenced by professional insights into Swedish as a second language.

* Northern European Journal of Language Technology. Volume 1. 2021

Via

Access Paper or Ask Questions