Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Weimao Ke

Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Nov 01, 2024

Lixiao Yang, Mengyang Xu, Weimao Ke

Figure 1 for Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Figure 2 for Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Figure 3 for Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Figure 4 for Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Abstract:Question-answering (QA) is an important application of Information Retrieval (IR) and language models, and the latest trend is toward pre-trained large neural networks with embedding parameters. Augmenting QA performances with these LLMs requires intensive computational resources for fine-tuning. We propose an innovative approach to improve QA task performances by integrating optimized vector retrievals and instruction methodologies. Based on retrieval augmentation, the process involves document embedding, vector retrieval, and context construction for optimal QA results. We experiment with different combinations of text segmentation techniques and similarity functions, and analyze their impacts on QA performances. Results show that the model with a small chunk size of 100 without any overlap of the chunks achieves the best result and outperforms the models based on semantic segmentation using sentences. We discuss related QA examples and offer insight into how model performances are improved within the two-stage framework.

* 6 pages, 4 tables

Via

Access Paper or Ask Questions

Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Oct 01, 2024

Qingyang Li, Weimao Ke

Figure 1 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Figure 2 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Figure 3 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Figure 4 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Abstract:This paper examines the pivotal role of dropout techniques in mitigating overfitting in language model training. It conducts a comprehensive investigation into the influence of variable dropout rates on both individual layers and residual connections within the context of language modeling. Our study conducts training of a decoder implementation on the classic Tiny Shakespeare data to examine the effects of the adjustments on training efficiency and validation error. Results not only confirm the benefits of dropout for regularization and residuals for convergence, but also reveal their interesting interactions. There exists an important trade-off between the depth of residual connections and the dropout on these connections for optimal deep neural network convergence and generalization.

* 5 pages, 4 figures

Via

Access Paper or Ask Questions