Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stéphane Huet

IRISA / INRIA Rennes, IRISA / INRIA Rennes

Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Oct 25, 2021

Matthieu Riou, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre

Figure 1 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Figure 2 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Figure 3 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Figure 4 for Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Abstract:Design of dialogue systems has witnessed many advances lately, yet acquiring huge set of data remains an hindrance to their fast development for a new task or language. Besides, training interactive systems with batch data is not satisfactory. On-line learning is pursued in this paper as a convenient way to alleviate these difficulties. After the system modules are initiated, a single process handles data collection, annotation and use in training algorithms. A new challenge is to control the cost of the on-line learning borne by the user. Our work focuses on learning the semantic parsing and dialogue management modules (speech recognition and synthesis offer ready-for-use solutions). In this context we investigate several variants of simultaneous learning which are tested in user trials. In our experiments, with varying merits, they can all achieve good performance with only a few hundreds of training dialogues and overstep a handcrafted system. The analysis of these experiments gives us some insights, discussed in the paper, into the difficulty for the system's trainers to establish a coherent and constant behavioural strategy to enable a fast and good-quality training phase.

* arXiv admin note: text overlap with arXiv:1810.00924

Via

Access Paper or Ask Questions

A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Apr 09, 2020

Elvys Linhares Pontes, Stéphane Huet, Juan-Manuel Torres-Moreno, Thiago G. da Silva, Andréa Carneiro Linhares

Figure 1 for A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Figure 2 for A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Figure 3 for A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Figure 4 for A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Abstract:Multi-Sentence Compression (MSC) aims to generate a short sentence with the key information from a cluster of similar sentences. MSC enables summarization and question-answering systems to generate outputs combining fully formed sentences from one or several documents. This paper describes an Integer Linear Programming method for MSC using a vertex-labeled graph to select different keywords, with the goal of generating more informative sentences while maintaining their grammaticality. Our system is of good quality and outperforms the state of the art for evaluations led on news datasets in three languages: French, Portuguese and Spanish. We led both automatic and manual evaluations to determine the informativeness and the grammaticality of compressions for each dataset. In additional tests, which take advantage of the fact that the length of compressions can be modulated, we still improve ROUGE scores with shorter output sentences.

* Computaci\'on y Sistemas Vo. 24, No. 2, 2020
* Preprint version

Via

Access Paper or Ask Questions

Predicting the Semantic Textual Similarity with Siamese CNN and LSTM

Oct 24, 2018

Elvys Linhares Pontes, Stéphane Huet, Andréa Carneiro Linhares, Juan-Manuel Torres-Moreno

Figure 1 for Predicting the Semantic Textual Similarity with Siamese CNN and LSTM

Figure 2 for Predicting the Semantic Textual Similarity with Siamese CNN and LSTM

Figure 3 for Predicting the Semantic Textual Similarity with Siamese CNN and LSTM

Figure 4 for Predicting the Semantic Textual Similarity with Siamese CNN and LSTM

Abstract:Semantic Textual Similarity (STS) is the basis of many applications in Natural Language Processing (NLP). Our system combines convolution and recurrent neural networks to measure the semantic similarity of sentences. It uses a convolution network to take account of the local context of words and an LSTM to consider the global context of sentences. This combination of networks helps to preserve the relevant information of sentences and improves the calculation of the similarity between sentences. Our model has achieved good results and is competitive with the best state-of-the-art systems.

Via

Access Paper or Ask Questions

A Multilingual Study of Compressive Cross-Language Text Summarization

Oct 24, 2018

Elvys Linhares Pontes, Stéphane Huet, Juan-Manuel Torres-Moreno

Figure 1 for A Multilingual Study of Compressive Cross-Language Text Summarization

Figure 2 for A Multilingual Study of Compressive Cross-Language Text Summarization

Figure 3 for A Multilingual Study of Compressive Cross-Language Text Summarization

Figure 4 for A Multilingual Study of Compressive Cross-Language Text Summarization

Abstract:Cross-Language Text Summarization (CLTS) generates summaries in a language different from the language of the source documents. Recent methods use information from both languages to generate summaries with the most informative sentences. However, these methods have performance that can vary according to languages, which can reduce the quality of summaries. In this paper, we propose a compressive framework to generate cross-language summaries. In order to analyze performance and especially stability, we tested our system and extractive baselines on a dataset available in four languages (English, French, Portuguese, and Spanish) to generate English and French summaries. An automatic evaluation showed that our method outperformed extractive state-of-art CLTS methods with better and more stable ROUGE scores for all languages.

Via

Access Paper or Ask Questions

Joint On-line Learning of a Zero-shot Spoken Semantic Parser and a Reinforcement Learning Dialogue Manager

Oct 01, 2018

Matthieu Riou, Bassam Jabaian, Stéphane Huet, Fabrice Lefèvre

Figure 1 for Joint On-line Learning of a Zero-shot Spoken Semantic Parser and a Reinforcement Learning Dialogue Manager

Abstract:Despite many recent advances for the design of dialogue systems, a true bottleneck remains the acquisition of data required to train its components. Unlike many other language processing applications, dialogue systems require interactions with users, therefore it is complex to develop them with pre-recorded data. Building on previous works, on-line learning is pursued here as a most convenient way to address the issue. Data collection, annotation and use in learning algorithms are performed in a single process. The main difficulties are then: to bootstrap an initial basic system, and to control the level of additional cost on the user side. Considering that well-performing solutions can be used directly off the shelf for speech recognition and synthesis, the study is focused on learning the spoken language understanding and dialogue management modules only. Several variants of joint learning are investigated and tested with user trials to confirm that the overall on-line learning can be obtained after only a few hundred training dialogues and can overstep an expert-based system.

Via

Access Paper or Ask Questions

Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

Mar 19, 2017

Elvys Linhares Pontes, Thiago Gouveia da Silva, Andréa Carneiro Linhares, Juan-Manuel Torres-Moreno, Stéphane Huet

Figure 1 for Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

Figure 2 for Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

Figure 3 for Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

Figure 4 for Métodos de Otimização Combinatória Aplicados ao Problema de Compressão MultiFrases

Abstract:The Internet has led to a dramatic increase in the amount of available information. In this context, reading and understanding this flow of information have become costly tasks. In the last years, to assist people to understand textual data, various Natural Language Processing (NLP) applications based on Combinatorial Optimization have been devised. However, for Multi-Sentences Compression (MSC), method which reduces the sentence length without removing core information, the insertion of optimization methods requires further study to improve the performance of MSC. This article describes a method for MSC using Combinatorial Optimization and Graph Theory to generate more informative sentences while maintaining their grammaticality. An experiment led on a corpus of 40 clusters of sentences shows that our system has achieved a very good quality and is better than the state-of-the-art.

* 12 pages, 1 figure, 3 tables (paper in Portuguese), Preprint of XLVIII Simp\'osio Brasileiro de Pesquisa Operacional, 2016, Vit\'oria, ES, (Brazil)

Via

Access Paper or Ask Questions

Algorithmes de classification et d'optimisation: participation du LIA/ADOC á DEFT'14

Feb 21, 2017

Luis Adrián Cabrera-Diego, Stéphane Huet, Bassam Jabaian, Alejandro Molina, Juan-Manuel Torres-Moreno, Marc El-Bèze, Barthélémy Durette

Figure 1 for Algorithmes de classification et d'optimisation: participation du LIA/ADOC á DEFT'14

Figure 2 for Algorithmes de classification et d'optimisation: participation du LIA/ADOC á DEFT'14

Figure 3 for Algorithmes de classification et d'optimisation: participation du LIA/ADOC á DEFT'14

Abstract:This year, the DEFT campaign (D\'efi Fouilles de Textes) incorporates a task which aims at identifying the session in which articles of previous TALN conferences were presented. We describe the three statistical systems developed at LIA/ADOC for this task. A fusion of these systems enables us to obtain interesting results (micro-precision score of 0.76 measured on the test corpus)

* 8 pages, 3 tables, Conference paper (in French)

Via

Access Paper or Ask Questions

Utilisation de la linguistique en reconnaissance de la parole : un état de l'art

May 30, 2006

Stéphane Huet, Pascale Sébillot, Guillaume Gravier

Abstract:To transcribe speech, automatic speech recognition systems use statistical methods, particularly hidden Markov model and N-gram models. Although these techniques perform well and lead to efficient systems, they approach their maximum possibilities. It seems thus necessary, in order to outperform current results, to use additional information, especially bound to language. However, introducing such knowledge must be realized taking into account specificities of spoken language (hesitations for example) and being robust to possible misrecognized words. This document presents a state of the art of these researches, evaluating the impact of the insertion of linguistic information on the quality of the transcription.

Via

Access Paper or Ask Questions