Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shumpei Inoue

Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures

Oct 23, 2023

Shumpei Inoue, Minh-Tien Nguyen, Hiroki Mizokuchi, Tuan-Anh D. Nguyen, Huu-Hiep Nguyen, Dung Tien Le

Figure 1 for Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures

Figure 2 for Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures

Figure 3 for Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures

Figure 4 for Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures

Abstract:This paper introduces a new IncidentAI dataset for safety prevention. Different from prior corpora that usually contain a single task, our dataset comprises three tasks: named entity recognition, cause-effect extraction, and information retrieval. The dataset is annotated by domain experts who have at least six years of practical experience as high-pressure gas conservation managers. We validate the contribution of the dataset in the scenario of safety prevention. Preliminary results on the three tasks show that NLP techniques are beneficial for analyzing incident reports to prevent future failures. The dataset facilitates future research in NLP and incident management communities. The access to the dataset is also provided (the IncidentAI dataset is available at: https://github.com/Cinnamon/incident-ai-dataset).

* Accepted by EMNLP 2023 (The Industry Track)

Via

Access Paper or Ask Questions

Meeting Decision Tracker: Making Meeting Minutes with De-Contextualized Utterances

Oct 20, 2022

Shumpei Inoue, Hy Nguyen, Pham Viet Hoang, Tsungwei Liu, Minh-Tien Nguyen

Figure 1 for Meeting Decision Tracker: Making Meeting Minutes with De-Contextualized Utterances

Figure 2 for Meeting Decision Tracker: Making Meeting Minutes with De-Contextualized Utterances

Figure 3 for Meeting Decision Tracker: Making Meeting Minutes with De-Contextualized Utterances

Figure 4 for Meeting Decision Tracker: Making Meeting Minutes with De-Contextualized Utterances

Abstract:Meetings are a universal process to make decisions in business and project collaboration. The capability to automatically itemize the decisions in daily meetings allows for extensive tracking of past discussions. To that end, we developed Meeting Decision Tracker, a prototype system to construct decision items comprising decision utterance detector (DUD) and decision utterance rewriter (DUR). We show that DUR makes a sizable contribution to improving the user experience by dealing with utterance collapse in natural conversation. An introduction video of our system is also available at https://youtu.be/TG1pJJo0Iqo.

* 7 pages, AACL-IJCNLP 2022

Via

Access Paper or Ask Questions

Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Apr 08, 2022

Shumpei Inoue, Tsungwei Liu, Nguyen Hong Son, Minh-Tien Nguyen

Figure 1 for Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Figure 2 for Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Figure 3 for Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Figure 4 for Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation

Abstract:This paper introduces a model for incomplete utterance restoration (IUR). Different from prior studies that only work on extraction or abstraction datasets, we design a simple but effective model, working for both scenarios of IUR. Our design simulates the nature of IUR, where omitted tokens from the context contribute to restoration. From this, we construct a Picker that identifies the omitted tokens. To support the picker, we design two label creation methods (soft and hard labels), which can work in cases of no annotation of the omitted tokens. The restoration is done by using a Generator with the help of the Picker on joint learning. Promising results on four benchmark datasets in extraction and abstraction scenarios show that our model is better than the pretrained T5 and non-generative language model methods in both rich and limited training data settings. The code will be also available.

* This is the early version of the paper accepted by NAACL 2022. It includes 10 pages, 2 figures

Via

Access Paper or Ask Questions