Picture for Anya Belz

Anya Belz

HEDS 3.0: The Human Evaluation Data Sheet Version 3.0

Add code
Dec 10, 2024
Viaarxiv icon

Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques

Add code
May 13, 2024
Figure 1 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 2 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 3 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 4 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Viaarxiv icon

High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models

Add code
Feb 19, 2024
Figure 1 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 2 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 3 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 4 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Viaarxiv icon

Assessing the Portability of Parameter Matrices Trained by Parameter-Efficient Finetuning Methods

Add code
Jan 25, 2024
Viaarxiv icon

Data-to-text Generation for Severely Under-Resourced Languages with GPT-3.5: A Bit of Help Needed from Google Translate

Add code
Aug 19, 2023
Viaarxiv icon

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

Add code
May 02, 2023
Figure 1 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 2 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 3 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 4 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Viaarxiv icon

PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques

Add code
Apr 24, 2023
Figure 1 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 2 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 3 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 4 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Viaarxiv icon

Consultation Checklists: Standardising the Human Evaluation of Medical Note Generation

Add code
Nov 17, 2022
Viaarxiv icon

User-Driven Research of Medical Note Generation Software

Add code
May 06, 2022
Figure 1 for User-Driven Research of Medical Note Generation Software
Figure 2 for User-Driven Research of Medical Note Generation Software
Figure 3 for User-Driven Research of Medical Note Generation Software
Figure 4 for User-Driven Research of Medical Note Generation Software
Viaarxiv icon

Quantified Reproducibility Assessment of NLP Results

Add code
Apr 12, 2022
Figure 1 for Quantified Reproducibility Assessment of NLP Results
Figure 2 for Quantified Reproducibility Assessment of NLP Results
Figure 3 for Quantified Reproducibility Assessment of NLP Results
Figure 4 for Quantified Reproducibility Assessment of NLP Results
Viaarxiv icon