Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaonan Jing

On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation

Oct 16, 2024

Xiaonan Jing, Srinivas Billa, Danny Godbout

Abstract:Hallucination has been a popular topic in natural language generation (NLG). In real-world applications, unfaithful content can result in bad data quality or loss of trust from end users. Thus, it is crucial to fact-check before adopting NLG for production usage, which can be expensive if done manually. In this paper, we investigate automated faithfulness evaluation in guided NLG. We developed a rubrics template and use large language models (LLMs) to score the generation into quantifiable scales. We compared popular LLMs as well as the widely adopted natural language inference (NLI) models in scoring quality and sensitivity. In addition, we developed methods to generation synthetic unfaithful data, as well as a heuristics to quantify the percentage of hallucination. Our results on 4 travel-domain industry dataset show that GPT-4 can provide accurate judgement and explanation on whether a source and a generation are factually consistent. Furthermore, we found that tuning NLI models on synthetic data can improve performance. Lastly, we present insights on latency and cost for deploying such system.

* 14 pages, 13 figures

Via

Access Paper or Ask Questions

Modeling Fuzzy Cluster Transitions for Topic Tracing

Apr 16, 2021

Xiaonan Jing, Yi Zhang, Qingyuan Hu, Julia Taylor Rayz

Figure 1 for Modeling Fuzzy Cluster Transitions for Topic Tracing

Figure 2 for Modeling Fuzzy Cluster Transitions for Topic Tracing

Figure 3 for Modeling Fuzzy Cluster Transitions for Topic Tracing

Figure 4 for Modeling Fuzzy Cluster Transitions for Topic Tracing

Abstract:Twitter can be viewed as a data source for Natural Language Processing (NLP) tasks. The continuously updating data streams on Twitter make it challenging to trace real-time topic evolution. In this paper, we propose a framework for modeling fuzzy transitions of topic clusters. We extend our previous work on crisp cluster transitions by incorporating fuzzy logic in order to enrich the underlying structures identified by the framework. We apply the methodology to both computer generated clusters of nouns from tweets and human tweet annotations. The obtained fuzzy transitions are compared with the crisp transitions, on both computer generated clusters and human labeled topic sets.

* Accepted as full paper by NAFIPS'2021

Via

Access Paper or Ask Questions

Tracing Topic Transitions with Temporal Graph Clusters

Apr 16, 2021

Xiaonan Jing, Qingyuan Hu, Yi Zhang, Julia Taylor Rayz

Figure 1 for Tracing Topic Transitions with Temporal Graph Clusters

Figure 2 for Tracing Topic Transitions with Temporal Graph Clusters

Figure 3 for Tracing Topic Transitions with Temporal Graph Clusters

Figure 4 for Tracing Topic Transitions with Temporal Graph Clusters

Abstract:Twitter serves as a data source for many Natural Language Processing (NLP) tasks. It can be challenging to identify topics on Twitter due to continuous updating data stream. In this paper, we present an unsupervised graph based framework to identify the evolution of sub-topics within two weeks of real-world Twitter data. We first employ a Markov Clustering Algorithm (MCL) with a node removal method to identify optimal graph clusters from temporal Graph-of-Words (GoW). Subsequently, we model the clustering transitions between the temporal graphs to identify the topic evolution. Finally, the transition flows generated from both computational approach and human annotations are compared to ensure the validity of our framework.

* Accepted as full paper by the 34th International FLAIRS Conference

Via

Access Paper or Ask Questions

Graph-of-Tweets: A Graph Merging Approach to Sub-event Identification

Jan 08, 2021

Xiaonan Jing, Julia Taylor Rayz

Figure 1 for Graph-of-Tweets: A Graph Merging Approach to Sub-event Identification

Figure 2 for Graph-of-Tweets: A Graph Merging Approach to Sub-event Identification

Figure 3 for Graph-of-Tweets: A Graph Merging Approach to Sub-event Identification

Figure 4 for Graph-of-Tweets: A Graph Merging Approach to Sub-event Identification

Abstract:Graph structures are powerful tools for modeling the relationships between textual elements. Graph-of-Words (GoW) has been adopted in many Natural Language tasks to encode the association between terms. However, GoW provides few document-level relationships in cases when the connections between documents are also essential. For identifying sub-events on social media like Twitter, features from both word- and document-level can be useful as they supply different information of the event. We propose a hybrid Graph-of-Tweets (GoT) model which combines the word- and document-level structures for modeling Tweets. To compress large amount of raw data, we propose a graph merging method which utilizes FastText word embeddings to reduce the GoW. Furthermore, we present a novel method to construct GoT with the reduced GoW and a Mutual Information (MI) measure. Finally, we identify maximal cliques to extract popular sub-events. Our model showed promising results on condensing lexical-level information and capturing keywords of sub-events.

* Accepted by 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) Workshop on Data Analytics on Social Media (DASM)

Via

Access Paper or Ask Questions

Misspelling Correction with Pre-trained Contextual Language Model

Jan 08, 2021

Yifei Hu, Xiaonan Jing, Youlim Ko, Julia Taylor Rayz

Figure 1 for Misspelling Correction with Pre-trained Contextual Language Model

Figure 2 for Misspelling Correction with Pre-trained Contextual Language Model

Figure 3 for Misspelling Correction with Pre-trained Contextual Language Model

Figure 4 for Misspelling Correction with Pre-trained Contextual Language Model

Abstract:Spelling irregularities, known now as spelling mistakes, have been found for several centuries. As humans, we are able to understand most of the misspelled words based on their location in the sentence, perceived pronunciation, and context. Unlike humans, computer systems do not possess the convenient auto complete functionality of which human brains are capable. While many programs provide spelling correction functionality, many systems do not take context into account. Moreover, Artificial Intelligence systems function in the way they are trained on. With many current Natural Language Processing (NLP) systems trained on grammatically correct text data, many are vulnerable against adversarial examples, yet correctly spelled text processing is crucial for learning. In this paper, we investigate how spelling errors can be corrected in context, with a pre-trained language model BERT. We present two experiments, based on BERT and the edit distance algorithm, for ranking and selecting candidate corrections. The results of our experiments demonstrated that when combined properly, contextual word embeddings of BERT and edit distance are capable of effectively correcting spelling errors.

* Accepted by 2020 IEEE 19th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC). IEEE

Via

Access Paper or Ask Questions