Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jack Flann

Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Apr 01, 2022

Francesco Moramarco, Alex Papadopoulos Korfiatis, Mark Perera, Damir Juric, Jack Flann, Ehud Reiter, Anya Belz, Aleksandar Savkov

Figure 1 for Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Figure 2 for Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Figure 3 for Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Figure 4 for Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

Abstract:In recent years, machine learning models have rapidly become better at generating clinical consultation notes; yet, there is little work on how to properly evaluate the generated consultation notes to understand the impact they may have on both the clinician using them and the patient's clinical safety. To address this we present an extensive human evaluation study of consultation notes where 5 clinicians (i) listen to 57 mock consultations, (ii) write their own notes, (iii) post-edit a number of automatically generated notes, and (iv) extract all the errors, both quantitative and qualitative. We then carry out a correlation study with 18 automatic quality metrics and the human judgements. We find that a simple, character-based Levenshtein distance metric performs on par if not better than common model-based metrics like BertScore. All our findings and annotations are open-sourced.

* To be published in proceedings of ACL 2022

Via

Access Paper or Ask Questions

Towards more patient friendly clinical notes through language models and ontologies

Dec 23, 2021

Francesco Moramarco, Damir Juric, Aleksandar Savkov, Jack Flann, Maria Lehl, Kristian Boda, Tessa Grafen, Vitalii Zhelezniak, Sunir Gohil, Alex Papadopoulos Korfiatis(+1 more)

Figure 1 for Towards more patient friendly clinical notes through language models and ontologies

Figure 2 for Towards more patient friendly clinical notes through language models and ontologies

Figure 3 for Towards more patient friendly clinical notes through language models and ontologies

Figure 4 for Towards more patient friendly clinical notes through language models and ontologies

Abstract:Clinical notes are an efficient way to record patient information but are notoriously hard to decipher for non-experts. Automatically simplifying medical text can empower patients with valuable information about their health, while saving clinicians time. We present a novel approach to automated simplification of medical text based on word frequencies and language modelling, grounded on medical ontologies enriched with layman terms. We release a new dataset of pairs of publicly available medical sentences and a version of them simplified by clinicians. Also, we define a novel text simplification metric and evaluation framework, which we use to conduct a large-scale human evaluation of our method against the state of the art. Our method based on a language model trained on medical forum data generates simpler sentences while preserving both grammar and the original meaning, surpassing the current state of the art.

Via

Access Paper or Ask Questions

Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Apr 30, 2019

Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Francesco Moramarco, Jack Flann, Nils Y. Hammerla

Figure 1 for Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Figure 2 for Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Figure 3 for Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Figure 4 for Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Abstract:Recent literature suggests that averaged word vectors followed by simple post-processing outperform many deep learning methods on semantic textual similarity tasks. Furthermore, when averaged word vectors are trained supervised on large corpora of paraphrases, they achieve state-of-the-art results on standard STS benchmarks. Inspired by these insights, we push the limits of word embeddings even further. We propose a novel fuzzy bag-of-words (FBoW) representation for text that contains all the words in the vocabulary simultaneously but with different degrees of membership, which are derived from similarities between word vectors. We show that max-pooled word vectors are only a special case of fuzzy BoW and should be compared via fuzzy Jaccard index rather than cosine similarity. Finally, we propose DynaMax, a completely unsupervised and non-parametric similarity measure that dynamically extracts and max-pools good features depending on the sentence pair. This method is both efficient and easy to implement, yet outperforms current baselines on STS tasks by a large margin and is even competitive with supervised word vectors trained to directly optimise cosine similarity.

* Published as a conference paper at ICLR 2019

Via

Access Paper or Ask Questions