Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huy Quoc To

Towards Efficient Large Language Models for Scientific Text: A Review

Aug 20, 2024

Huy Quoc To, Ming Liu, Guangyan Huang

Figure 1 for Towards Efficient Large Language Models for Scientific Text: A Review

Figure 2 for Towards Efficient Large Language Models for Scientific Text: A Review

Abstract:Large language models (LLMs) have ushered in a new era for processing complex information in various fields, including science. The increasing amount of scientific literature allows these models to acquire and understand scientific knowledge effectively, thus improving their performance in a wide range of tasks. Due to the power of LLMs, they require extremely expensive computational resources, intense amounts of data, and training time. Therefore, in recent years, researchers have proposed various methodologies to make scientific LLMs more affordable. The most well-known approaches align in two directions. It can be either focusing on the size of the models or enhancing the quality of data. To date, a comprehensive review of these two families of methods has not yet been undertaken. In this paper, we (I) summarize the current advances in the emerging abilities of LLMs into more accessible AI solutions for science, and (II) investigate the challenges and opportunities of developing affordable solutions for scientific domains using LLMs.

Via

Access Paper or Ask Questions

SKT5SciSumm - A Hybrid Generative Approach for Multi-Document Scientific Summarization

Feb 27, 2024

Huy Quoc To, Hung-Nghiep Tran, Andr'e Greiner-Petter, Felix Beierle, Akiko Aizawa

Figure 1 for SKT5SciSumm - A Hybrid Generative Approach for Multi-Document Scientific Summarization

Figure 2 for SKT5SciSumm - A Hybrid Generative Approach for Multi-Document Scientific Summarization

Figure 3 for SKT5SciSumm - A Hybrid Generative Approach for Multi-Document Scientific Summarization

Figure 4 for SKT5SciSumm - A Hybrid Generative Approach for Multi-Document Scientific Summarization

Abstract:Summarization for scientific text has shown significant benefits both for the research community and human society. Given the fact that the nature of scientific text is distinctive and the input of the multi-document summarization task is substantially long, the task requires sufficient embedding generation and text truncation without losing important information. To tackle these issues, in this paper, we propose SKT5SciSumm - a hybrid framework for multi-document scientific summarization (MDSS). We leverage the Sentence-Transformer version of Scientific Paper Embeddings using Citation-Informed Transformers (SPECTER) to encode and represent textual sentences, allowing for efficient extractive summarization using k-means clustering. We employ the T5 family of models to generate abstractive summaries using extracted sentences. SKT5SciSumm achieves state-of-the-art performance on the Multi-XScience dataset. Through extensive experiments and evaluation, we showcase the benefits of our model by using less complicated models to achieve remarkable results, thereby highlighting its potential in advancing the field of multi-document summarization for scientific text.

Via

Access Paper or Ask Questions

A Survey of Pre-trained Language Models for Processing Scientific Text

Jan 31, 2024

Xanh Ho, Anh Khoa Duong Nguyen, An Tuan Dao, Junfeng Jiang, Yuki Chida, Kaito Sugimoto, Huy Quoc To, Florian Boudin, Akiko Aizawa

Abstract:The number of Language Models (LMs) dedicated to processing scientific text is on the rise. Keeping pace with the rapid growth of scientific LMs (SciLMs) has become a daunting task for researchers. To date, no comprehensive surveys on SciLMs have been undertaken, leaving this issue unaddressed. Given the constant stream of new SciLMs, appraising the state-of-the-art and how they compare to each other remain largely unknown. This work fills that gap and provides a comprehensive review of SciLMs, including an extensive analysis of their effectiveness across different domains, tasks and datasets, and a discussion on the challenges that lie ahead.

* Resources are available at https://github.com/Alab-NII/Awesome-SciLM

Via

Access Paper or Ask Questions

Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

Oct 27, 2020

Huy Quoc To, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

Figure 1 for Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

Figure 2 for Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

Figure 3 for Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

Figure 4 for Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

Abstract:As biological gender is one of the aspects of presenting individual human, much work has been done on gender classification based on people names. The proposals for English and Chinese languages are tremendous; still, there have been few works done for Vietnamese so far. We propose a new dataset for gender prediction based on Vietnamese names. This dataset comprises over 26,000 full names annotated with genders. This dataset is available on our website for research purposes. In addition, this paper describes six machine learning algorithms (Support Vector Machine, Multinomial Naive Bayes, Bernoulli Naive Bayes, Decision Tree, Random Forrest and Logistic Regression) and a deep learning model (LSTM) with fastText word embedding for gender prediction on Vietnamese names. We create a dataset and investigate the impact of each name component on detecting gender. As a result, the best F1-score that we have achieved is up to 96\% on LSTM model and we generate a web API based on our trained model.

* 6 pages, 6 figures

Via

Access Paper or Ask Questions