Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Koki Noda

QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond

Apr 29, 2024

Tomoki Fukuma, Koki Noda, Toshihide Ubukata Kousuke Hoso, Yoshiharu Ichikawa, Kyosuke Kambe, Yu Masubuch, Fujio Toriumi

Abstract:The proliferation of social media has led to information overload and increased interest in opinion mining. We propose "Question-Answering Network Analysis" (QANA), a novel opinion mining framework that utilizes Large Language Models (LLMs) to generate questions from users' comments, constructs a bipartite graph based on the comments' answerability to the questions, and applies centrality measures to examine the importance of opinions. We investigate the impact of question generation styles, LLM selections, and the choice of embedding model on the quality of the constructed QA networks by comparing them with annotated Key Point Analysis datasets. QANA achieves comparable performance to previous state-of-the-art supervised models in a zero-shot manner for Key Point Matching task, also reducing the computational cost from quadratic to linear. For Key Point Generation, questions with high PageRank or degree centrality align well with manually annotated key points. Notably, QANA enables analysts to assess the importance of key points from various aspects according to their selection of centrality measure. QANA's primary contribution lies in its flexibility to extract key points from a wide range of perspectives, which enhances the quality and impartiality of opinion mining.

* Under review as a conference paper at COLM 2024

Via

Access Paper or Ask Questions

Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Jun 18, 2022

Seiji Maekawa, Koki Noda, Yuya Sasaki, Makoto Onizuka

Figure 1 for Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Figure 2 for Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Figure 3 for Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Figure 4 for Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs

Abstract:Graph Neural Networks (GNNs) have achieved great success on a node classification task. Despite the broad interest in developing and evaluating GNNs, they have been assessed with limited benchmark datasets. As a result, the existing evaluation of GNNs lacks fine-grained analysis from various characteristics of graphs. Motivated by this, we conduct extensive experiments with a synthetic graph generator that can generate graphs having controlled characteristics for fine-grained analysis. Our empirical studies clarify the strengths and weaknesses of GNNs from four major characteristics of real-world graphs with class labels of nodes, i.e., 1) class size distributions (balanced vs. imbalanced), 2) edge connection proportions between classes (homophilic vs. heterophilic), 3) attribute values (biased vs. random), and 4) graph sizes (small vs. large). In addition, to foster future research on GNNs, we publicly release our codebase that allows users to evaluate various GNNs with various graphs. We hope this work offers interesting insights for future research.

* 17 pages, 10 figures

Via

Access Paper or Ask Questions