Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Congrui Yin

PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant

Feb 20, 2025

Congrui Yin, Evan Wei, Zhongxing Zhang, Zaifu Zhan

Abstract:In the paper, we introduce a paper reading assistant, PaperHelper, a potent tool designed to enhance the capabilities of researchers in efficiently browsing and understanding scientific literature. Utilizing the Retrieval-Augmented Generation (RAG) framework, PaperHelper effectively minimizes hallucinations commonly encountered in large language models (LLMs), optimizing the extraction of accurate, high-quality knowledge. The implementation of advanced technologies such as RAFT and RAG Fusion significantly boosts the performance, accuracy, and reliability of the LLMs-based literature review process. Additionally, PaperHelper features a user-friendly interface that facilitates the batch downloading of documents and uses the Mermaid format to illustrate structural relationships between documents. Experimental results demonstrate that PaperHelper, based on a fine-tuned GPT-4 API, achieves an F1 Score of 60.04, with a latency of only 5.8 seconds, outperforming the basic RAG model by 7\% in F1 Score.

Via

Access Paper or Ask Questions

IAPT: Instruction-Aware Prompt Tuning for Large Language Models

May 28, 2024

Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie

Figure 1 for IAPT: Instruction-Aware Prompt Tuning for Large Language Models

Figure 2 for IAPT: Instruction-Aware Prompt Tuning for Large Language Models

Figure 3 for IAPT: Instruction-Aware Prompt Tuning for Large Language Models

Figure 4 for IAPT: Instruction-Aware Prompt Tuning for Large Language Models

Abstract:Soft prompt tuning is a widely studied parameter-efficient fine-tuning method. However, it has a clear drawback: many soft tokens must be inserted into the input sequences to guarantee downstream performance. As a result, soft prompt tuning is less considered than Low-rank adaptation (LoRA) in the large language modeling (LLM) era. In this work, we propose a novel prompt tuning method, Instruction-Aware Prompt Tuning (IAPT), that requires only four soft tokens. First, we install a parameter-efficient soft prompt generator at each Transformer layer to generate idiosyncratic soft prompts for each input instruction. The generated soft prompts can be seen as a semantic summary of the input instructions and can effectively guide the output generation. Second, the soft prompt generators are modules with a bottleneck architecture consisting of a self-attention pooling operation, two linear projections, and an activation function. Pilot experiments show that prompt generators at different Transformer layers require different activation functions. Thus, we propose to learn the idiosyncratic activation functions for prompt generators automatically with the help of rational functions. We have conducted experiments on various tasks, and the experimental results demonstrate that (a) our IAPT method can outperform the recent baselines with comparable tunable parameters. (b) Our IAPT method is more efficient than LoRA under the single-backbone multi-tenant setting.

* Accepted by ACL-2024

Via

Access Paper or Ask Questions

F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

May 21, 2023

Xiangxiang Gao, Wei Zhu, Jiasheng Gao, Congrui Yin

Figure 1 for F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

Figure 2 for F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

Figure 3 for F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

Figure 4 for F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

Abstract:Computational complexity and overthinking problems have become the bottlenecks for pre-training language models (PLMs) with millions or even trillions of parameters. A Flexible-Patience-Based Early Exiting method (F-PABEE) has been proposed to alleviate the problems mentioned above for single-label classification (SLC) and multi-label classification (MLC) tasks. F-PABEE makes predictions at the classifier and will exit early if predicted distributions of cross-layer are consecutively similar. It is more flexible than the previous state-of-the-art (SOTA) early exiting method PABEE because it can simultaneously adjust the similarity score thresholds and the patience parameters. Extensive experiments show that: (1) F-PABEE makes a better speedup-accuracy balance than existing early exiting strategies on both SLC and MLC tasks. (2) F-PABEE achieves faster inference and better performances on different PLMs such as BERT and ALBERT. (3) F-PABEE-JSKD performs best for F-PABEE with different similarity measures.

* accepted by ICASSP-2023

Via

Access Paper or Ask Questions