Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yi-Li Hsu

Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023

Jan 31, 2025

Ting-Yao E. Hsu, Yi-Li Hsu, Shaurya Rohatgi, Chieh-Yang Huang, Ho Yin Sam Ng, Ryan Rossi, Sungchul Kim, Tong Yu, Lun-Wei Ku, C. Lee Giles(+1 more)

Abstract:Since the SCICAP datasets launch in 2021, the research community has made significant progress in generating captions for scientific figures in scholarly articles. In 2023, the first SCICAP Challenge took place, inviting global teams to use an expanded SCICAP dataset to develop models for captioning diverse figure types across various academic fields. At the same time, text generation models advanced quickly, with many powerful pre-trained large multimodal models (LMMs) emerging that showed impressive capabilities in various vision-and-language tasks. This paper presents an overview of the first SCICAP Challenge and details the performance of various models on its data, capturing a snapshot of the fields state. We found that professional editors overwhelmingly preferred figure captions generated by GPT-4V over those from all other models and even the original captions written by authors. Following this key finding, we conducted detailed analyses to answer this question: Have advanced LMMs solved the task of generating captions for scientific figures?

* Accepted to TACL 2025

Via

Access Paper or Ask Questions

Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term

Oct 26, 2023

Yi-Li Hsu, Shih-Chieh Dai, Aiping Xiong, Lun-Wei Ku

Figure 1 for Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term

Figure 2 for Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term

Figure 3 for Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term

Figure 4 for Is Explanation the Cure? Misinformation Mitigation in the Short Term and Long Term

Abstract:With advancements in natural language processing (NLP) models, automatic explanation generation has been proposed to mitigate misinformation on social media platforms in addition to adding warning labels to identified fake news. While many researchers have focused on generating good explanations, how these explanations can really help humans combat fake news is under-explored. In this study, we compare the effectiveness of a warning label and the state-of-the-art counterfactual explanations generated by GPT-4 in debunking misinformation. In a two-wave, online human-subject study, participants (N = 215) were randomly assigned to a control group in which false contents are shown without any intervention, a warning tag group in which the false claims were labeled, or an explanation group in which the false contents were accompanied by GPT-4 generated explanations. Our results show that both interventions significantly decrease participants' self-reported belief in fake claims in an equivalent manner for the short-term and long-term. We discuss the implications of our findings and directions for future NLP-based misinformation debunking strategies.

* EMNLP Findings 2023

Via

Access Paper or Ask Questions

Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification

Jun 26, 2023

Chih-Yao Chen, Tun-Min Hung, Yi-Li Hsu, Lun-Wei Ku

Figure 1 for Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification

Figure 2 for Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification

Figure 3 for Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification

Figure 4 for Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification

Abstract:Fine-grained emotion classification (FEC) is a challenging task. Specifically, FEC needs to handle subtle nuance between labels, which can be complex and confusing. Most existing models only address text classification problem in the euclidean space, which we believe may not be the optimal solution as labels of close semantic (e.g., afraid and terrified) may not be differentiated in such space, which harms the performance. In this paper, we propose HypEmo, a novel framework that can integrate hyperbolic embeddings to improve the FEC task. First, we learn label embeddings in the hyperbolic space to better capture their hierarchical structure, and then our model projects contextualized representations to the hyperbolic space to compute the distance between samples and labels. Experimental results show that incorporating such distance to weight cross entropy loss substantially improves the performance with significantly higher efficiency. We evaluate our proposed model on two benchmark datasets and found 4.8% relative improvement compared to the previous state of the art with 43.2% fewer parameters and 76.9% less training time. Code is available at https: //github.com/dinobby/HypEmo.

* Accepted to ACL 2023

Via

Access Paper or Ask Questions

Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Jun 14, 2022

Shih-Chieh Dai, Yi-Li Hsu, Aiping Xiong, Lun-Wei Ku

Figure 1 for Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Figure 2 for Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Figure 3 for Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Figure 4 for Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Abstract:Automated fact checking systems have been proposed that quickly provide veracity prediction at scale to mitigate the negative influence of fake news on people and on public opinion. However, most studies focus on veracity classifiers of those systems, which merely predict the truthfulness of news articles. We posit that effective fact checking also relies on people's understanding of the predictions. In this paper, we propose elucidating fact checking predictions using counterfactual explanations to help people understand why a specific piece of news was identified as fake. In this work, generating counterfactual explanations for fake news involves three steps: asking good questions, finding contradictions, and reasoning appropriately. We frame this research question as contradicted entailment reasoning through question answering (QA). We first ask questions towards the false claim and retrieve potential answers from the relevant evidence documents. Then, we identify the most contradictory answer to the false claim by use of an entailment classifier. Finally, a counterfactual explanation is created using a matched QA pair with three different counterfactual explanation forms. Experiments are conducted on the FEVER dataset for both system and human evaluations. Results suggest that the proposed approach generates the most helpful explanations compared to state-of-the-art methods.

* 11 pages

Via

Access Paper or Ask Questions