Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shushu Wang

Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Oct 23, 2023

Libo Zhao, Kai Fan, Wei Luo, Jing Wu, Shushu Wang, Ziqian Zeng, Zhongqiang Huang

Figure 1 for Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Figure 2 for Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Figure 3 for Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Figure 4 for Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Abstract:Simultaneous machine translation (SiMT) requires a robust read/write policy in conjunction with a high-quality translation model. Traditional methods rely on either a fixed wait-$k$ policy coupled with a standalone wait-$k$ translation model, or an adaptive policy jointly trained with the translation model. In this study, we propose a more flexible approach by decoupling the adaptive policy model from the translation model. Our motivation stems from the observation that a standalone multi-path wait-$k$ model performs competitively with adaptive policies utilized in state-of-the-art SiMT approaches. Specifically, we introduce DaP, a divergence-based adaptive policy, that makes read/write decisions for any translation model based on the potential divergence in translation distributions resulting from future information. DaP extends a frozen wait-$k$ model with lightweight parameters, and is both memory and computation efficient. Experimental results across various benchmarks demonstrate that our approach offers an improved trade-off between translation accuracy and latency, outperforming strong baselines.

* Accept to EMNLP 2023 main conference. 17 pages, 12 figures, 5 tables

Via

Access Paper or Ask Questions

A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Nov 11, 2021

Jianyun Zou, Min Yang, Lichao Zhang, Yechen Xu, Qifan Pan, Fengqing Jiang, Ran Qin, Shushu Wang, Yifan He, Songfang Huang(+1 more)

Figure 1 for A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Figure 2 for A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Figure 3 for A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Figure 4 for A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Abstract:Complex Knowledge Base Question Answering is a popular area of research in the past decade. Recent public datasets have led to encouraging results in this field, but are mostly limited to English and only involve a small number of question types and relations, hindering research in more realistic settings and in languages other than English. In addition, few state-of-the-art KBQA models are trained on Wikidata, one of the most popular real-world knowledge bases. We propose CLC-QuAD, the first large scale complex Chinese semantic parsing dataset over Wikidata to address these challenges. Together with the dataset, we present a text-to-SPARQL baseline model, which can effectively answer multi-type complex questions, such as factual questions, dual intent questions, boolean questions, and counting questions, with Wikidata as the background knowledge. We finally analyze the performance of SOTA KBQA models on this dataset and identify the challenges facing Chinese KBQA.

* 8 pages

Via

Access Paper or Ask Questions