Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Diego Molla-Aliod

Synthetic Dialogue Dataset Generation using LLM Agents

Jan 30, 2024

Yelaman Abdullin, Diego Molla-Aliod, Bahadorreza Ofoghi, John Yearwood, Qingyang Li

Abstract:Linear programming (LP) problems are pervasive in real-life applications. However, despite their apparent simplicity, an untrained user may find it difficult to determine the linear model of their specific problem. We envisage the creation of a goal-oriented conversational agent that will engage in conversation with the user to elicit all information required so that a subsequent agent can generate the linear model. In this paper, we present an approach for the generation of sample dialogues that can be used to develop and train such a conversational agent. Using prompt engineering, we develop two agents that "talk" to each other, one acting as the conversational agent, and the other acting as the user. Using a set of text descriptions of linear problems from NL4Opt available to the user only, the agent and the user engage in conversation until the agent has retrieved all key information from the original problem description. We also propose an extrinsic evaluation of the dialogues by assessing how well the summaries generated by the dialogues match the original problem descriptions. We conduct human and automatic evaluations, including an evaluation approach that uses GPT-4 to mimic the human evaluation metrics. The evaluation results show an overall good quality of the dialogues, though research is still needed to improve the quality of the GPT-4 evaluation metrics. The resulting dialogues, including the human annotations of a subset, are available to the research community. The conversational agent used for the generation of the dialogues can be used as a baseline.

* GEM Workshop @ EMNLP 2023

Via

Access Paper or Ask Questions

On Extending Neural Networks with Loss Ensembles for Text Classification

Nov 14, 2017

Hamideh Hajiabadi, Diego Molla-Aliod, Reza Monsefi

Figure 1 for On Extending Neural Networks with Loss Ensembles for Text Classification

Figure 2 for On Extending Neural Networks with Loss Ensembles for Text Classification

Figure 3 for On Extending Neural Networks with Loss Ensembles for Text Classification

Figure 4 for On Extending Neural Networks with Loss Ensembles for Text Classification

Abstract:Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through the gradient propagation optimization method of the neural network. The approach is evaluated on several text classification datasets. We also evaluate its performance in various environments with several degrees of label noise. Experimental results indicate an improvement of the results and strong resilience against label noise in comparison with other methods.

* 5 pages, 5 tables, 1 figure. Camera-ready submitted to The 2017 Australasian Language Technology Association Workshop (ALTA 2017)

Via

Access Paper or Ask Questions

Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Aug 11, 2017

Diego Molla-Aliod

Figure 1 for Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Figure 2 for Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Figure 3 for Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Figure 4 for Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Abstract:Macquarie University's contribution to the BioASQ challenge (Task 5b Phase B) focused on the use of query-based extractive summarisation techniques for the generation of the ideal answers. Four runs were submitted, with approaches ranging from a trivial system that selected the first $n$ snippets, to the use of deep learning approaches under a regression framework. Our experiments and the ROUGE results of the five test batches of BioASQ indicate surprisingly good results for the trivial approach. Overall, most of our runs on the first three test batches achieved the best ROUGE-SU4 results in the challenge.

* Proceedings of the BioNLP 2017 Workshop (Vancouver, Canada), pages 67-75 (2017)
* As published in BioNLP2017. 9 pages, 5 figures, 4 tables

Via

Access Paper or Ask Questions