Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jinyuan Wang

Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning

Oct 23, 2023

Jinyuan Wang, Junlong Li, Hai Zhao

Figure 1 for Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning

Figure 2 for Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning

Figure 3 for Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning

Figure 4 for Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning

Abstract:In open-domain question-answering (ODQA), most existing questions require single-hop reasoning on commonsense. To further extend this task, we officially introduce open-domain multi-hop reasoning (ODMR) by answering multi-hop questions with explicit reasoning steps in open-domain setting. Recently, large language models (LLMs) have found significant utility in facilitating ODQA without external corpus. Furthermore, chain-of-thought (CoT) prompting boosts the reasoning capability of LLMs to a greater extent with manual or automated paradigms. However, existing automated methods lack of quality assurance, while manual approaches suffer from limited scalability and poor diversity, hindering the capabilities of LLMs. In this paper, we propose Self-prompted Chain-of-Thought (SP-CoT), an automated framework to mass-produce high quality CoTs of LLMs, by LLMs and for LLMs. SP-CoT introduces an automated generation pipeline of high quality ODMR datasets, an adaptive sampler for in-context CoT selection and self-prompted inference via in-context learning. Extensive experiments on four multi-hop question-answering benchmarks show that our proposed SP-CoT not only significantly surpasses the previous SOTA methods on large-scale (175B) LLMs, but also nearly doubles the zero-shot performance of small-scale (13B) LLMs. Further analysis reveals the remarkable capability of SP-CoT to elicit direct and concise intermediate reasoning steps by recalling $\sim$50\% of intermediate answers on MuSiQue-Ans dataset.

* Accepted by Findings of EMNLP2023

Via

Access Paper or Ask Questions

CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

Sep 11, 2023

Jinyuan Wang, Hai Zhao, Zhong Wang, Zeyang Zhu, Jinhao Xie, Yong Yu, Yongjian Fei, Yue Huang, Dawei Cheng

Figure 1 for CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

Figure 2 for CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

Figure 3 for CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

Figure 4 for CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

Abstract:In recent years, great advances in pre-trained language models (PLMs) have sparked considerable research focus and achieved promising performance on the approach of dense passage retrieval, which aims at retrieving relative passages from massive corpus with given questions. However, most of existing datasets mainly benchmark the models with factoid queries of general commonsense, while specialised fields such as finance and economics remain unexplored due to the deficiency of large-scale and high-quality datasets with expert annotations. In this work, we propose a new task, policy retrieval, by introducing the Chinese Stock Policy Retrieval Dataset (CSPRD), which provides 700+ prospectus passages labeled by experienced experts with relevant articles from 10k+ entries in our collected Chinese policy corpus. Experiments on lexical, embedding and fine-tuned bi-encoder models show the effectiveness of our proposed CSPRD yet also suggests ample potential for improvement. Our best performing baseline achieves 56.1% MRR@10, 28.5% NDCG@10, 37.5% Recall@10 and 80.6% Precision@10 on dev set.

Via

Access Paper or Ask Questions

Representation Decoupling for Open-Domain Passage Retrieval

Oct 14, 2021

Bohong Wu, Zhuosheng Zhang, Jinyuan Wang, Hai Zhao

Figure 1 for Representation Decoupling for Open-Domain Passage Retrieval

Figure 2 for Representation Decoupling for Open-Domain Passage Retrieval

Figure 3 for Representation Decoupling for Open-Domain Passage Retrieval

Figure 4 for Representation Decoupling for Open-Domain Passage Retrieval

Abstract:Training dense passage representations via contrastive learning (CL) has been shown effective for Open-Domain Passage Retrieval (ODPR). Recent studies mainly focus on optimizing this CL framework by improving the sampling strategy or extra pretraining. Different from previous studies, this work devotes itself to investigating the influence of conflicts in the widely used CL strategy in ODPR, motivated by our observation that a passage can be organized by multiple semantically different sentences, thus modeling such a passage as a unified dense vector is not optimal. We call such conflicts Contrastive Conflicts. In this work, we propose to solve it with a representation decoupling method, by decoupling the passage representations into contextual sentence-level ones, and design specific CL strategies to mediate these conflicts. Experiments on widely used datasets including Natural Questions, Trivia QA, and SQuAD verify the effectiveness of our method, especially on the dataset where the conflicting problem is severe. Our method also presents good transferability across the datasets, which further supports our idea of mediating Contrastive Conflicts.

Via

Access Paper or Ask Questions