Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kun Xiong

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Aug 09, 2023

Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

Figure 1 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Figure 2 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Figure 3 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Figure 4 for JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Abstract:This paper presents an empirical exploration of non-transitivity in perfect-information games, specifically focusing on Xiangqi, a traditional Chinese board game comparable in game-tree complexity to chess and shogi. By analyzing over 10,000 records of human Xiangqi play, we highlight the existence of both transitive and non-transitive elements within the game's strategic structure. To address non-transitivity, we introduce the JiangJun algorithm, an innovative combination of Monte-Carlo Tree Search (MCTS) and Policy Space Response Oracles (PSRO) designed to approximate a Nash equilibrium. We evaluate the algorithm empirically using a WeChat mini program and achieve a Master level with a 99.41\% win rate against human players. The algorithm's effectiveness in overcoming non-transitivity is confirmed by a plethora of metrics, such as relative population performance and visualization results. Our project site is available at \url{https://sites.google.com/view/jiangjun-site/}.

* 28 pages, accepted by Transactions on Machine Learning Research (TMLR)

Via

Access Paper or Ask Questions

To Paraphrase or Not To Paraphrase: User-Controllable Selective Paraphrase Generation

Aug 21, 2020

Mohan Zhang, Luchen Tan, Zhengkai Tu, Zihang Fu, Kun Xiong, Ming Li, Jimmy Lin

Figure 1 for To Paraphrase or Not To Paraphrase: User-Controllable Selective Paraphrase Generation

Figure 2 for To Paraphrase or Not To Paraphrase: User-Controllable Selective Paraphrase Generation

Abstract:In this article, we propose a paraphrase generation technique to keep the key phrases in source sentences during paraphrasing. We also develop a model called TAGPA with such technique, which has multiple pre-configured or trainable key phrase detector and a paraphrase generator. The paraphrase generator aims to keep the key phrases and increase the diversity of the paraphrased sentences. The key phrases can be entities provided by our user, like company names, people's names, domain-specific terminologies, etc., or can be learned from a given dataset.

Via

Access Paper or Ask Questions

SegaBERT: Pre-training of Segment-aware BERT for Language Understanding

Apr 30, 2020

He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Ming Li

Figure 1 for SegaBERT: Pre-training of Segment-aware BERT for Language Understanding

Figure 2 for SegaBERT: Pre-training of Segment-aware BERT for Language Understanding

Figure 3 for SegaBERT: Pre-training of Segment-aware BERT for Language Understanding

Figure 4 for SegaBERT: Pre-training of Segment-aware BERT for Language Understanding

Abstract:Pre-trained language models have achieved state-of-the-art results in various natural language processing tasks. Most of them are based on the Transformer architecture, which distinguishes tokens with the token position index of the input sequence. However, sentence index and paragraph index are also important to indicate the token position in a document. We hypothesize that better contextual representations can be generated from the text encoder with richer positional information. To verify this, we propose a segment-aware BERT, by replacing the token position embedding of Transformer with a combination of paragraph index, sentence index, and token index embeddings. We pre-trained the SegaBERT on the masked language modeling task in BERT but without any affiliated tasks. Experimental results show that our pre-trained model can outperform the original BERT model on various NLP tasks.

Via

Access Paper or Ask Questions

Semantics of the Unwritten

Apr 05, 2020

He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li

Abstract:The semantics of a text is manifested not only by what is read, but also by what is not read. In this article, we will study how those implicit "not read" information such as end-of-paragraph (EOP) and end-of-sequence (EOS) affect the quality of text generation. Transformer-based pretrained language models (LMs) have demonstrated the ability to generate long continuations with good quality. This model gives us a platform for the first time to demonstrate that paragraph layouts and text endings are also important components of human writing. Specifically, we find that pretrained LMs can generate better continuations by learning to generate the end of the paragraph (EOP) in the fine-tuning stage. Experimental results on English story generation show that EOP can lead to higher BLEU score and lower EOS perplexity. To further investigate the relationship between text ending and EOP, we conduct experiments with a self-collected Chinese essay dataset on Chinese-GPT2, a character level LM without paragraph breaker or EOS during pre-training. Experimental results show that the Chinese GPT2 can generate better essay endings with paragraph information. Experiments on both English stories and Chinese essays demonstrate that learning to end paragraphs can benefit the continuation generation with pretrained LMs.

Via

Access Paper or Ask Questions

Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Feb 05, 2020

Ruixue Zhang, Wei Yang, Luyun Lin, Zhengkai Tu, Yuqing Xie, Zihang Fu, Yuhao Xie, Luchen Tan, Kun Xiong, Jimmy Lin

Figure 1 for Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Figure 2 for Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Figure 3 for Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Figure 4 for Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Abstract:Techniques for automatically extracting important content elements from business documents such as contracts, statements, and filings have the potential to make business operations more efficient. This problem can be formulated as a sequence labeling task, and we demonstrate the adaption of BERT to two types of business documents: regulatory filings and property lease agreements. There are aspects of this problem that make it easier than "standard" information extraction tasks and other aspects that make it more difficult, but on balance we find that modest amounts of annotated data (less than 100 documents) are sufficient to achieve reasonable accuracy. We integrate our models into an end-to-end cloud platform that provides both an easy-to-use annotation interface as well as an inference interface that allows users to upload documents and inspect model outputs.

Via

Access Paper or Ask Questions

Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering

Apr 14, 2019

Wei Yang, Yuqing Xie, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin

Figure 1 for Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering

Figure 2 for Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering

Figure 3 for Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering

Figure 4 for Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering

Abstract:Recently, a simple combination of passage retrieval using off-the-shelf IR techniques and a BERT reader was found to be very effective for question answering directly on Wikipedia, yielding a large improvement over the previous state of the art on a standard benchmark dataset. In this paper, we present a data augmentation technique using distant supervision that exploits positive as well as negative examples. We apply a stage-wise approach to fine tuning BERT on multiple datasets, starting with data that is "furthest" from the test data and ending with the "closest". Experimental results show large gains in effectiveness over previous approaches on English QA datasets, and we establish new baselines on two recent Chinese QA datasets.

Via

Access Paper or Ask Questions

End-to-End Open-Domain Question Answering with BERTserini

Feb 05, 2019

Wei Yang, Yuqing Xie, Aileen Lin, Xingyu Li, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin

Figure 1 for End-to-End Open-Domain Question Answering with BERTserini

Figure 2 for End-to-End Open-Domain Question Answering with BERTserini

Figure 3 for End-to-End Open-Domain Question Answering with BERTserini

Figure 4 for End-to-End Open-Domain Question Answering with BERTserini

Abstract:We demonstrate an end-to-end question answering system that integrates BERT with the open-source Anserini information retrieval toolkit. In contrast to most question answering and reading comprehension models today, which operate over small amounts of input text, our system integrates best practices from IR with a BERT-based reader to identify answers from a large corpus of Wikipedia articles in an end-to-end fashion. We report large improvements over previous results on a standard benchmark test collection, showing that fine-tuning pretrained BERT with SQuAD is sufficient to achieve high accuracy in identifying answer spans.

Via

Access Paper or Ask Questions

Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Jul 20, 2016

Kun Xiong, Anqi Cui, Zefeng Zhang, Ming Li

Figure 1 for Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Figure 2 for Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Figure 3 for Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Figure 4 for Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Abstract:Neural conversational models tend to produce generic or safe responses in different contexts, e.g., reply \textit{"Of course"} to narrative statements or \textit{"I don't know"} to questions. In this paper, we propose an end-to-end approach to avoid such problem in neural generative models. Additional memory mechanisms have been introduced to standard sequence-to-sequence (seq2seq) models, so that context can be considered while generating sentences. Three seq2seq models, which memorize a fix-sized contextual vector from hidden input, hidden input/output and a gated contextual attention structure respectively, have been trained and tested on a dataset of labeled question-answering pairs in Chinese. The model with contextual attention outperforms others including the state-of-the-art seq2seq models on perplexity test. The novel contextual model generates diverse and robust responses, and is able to carry out conversations on a wide range of topics appropriately.

Via

Access Paper or Ask Questions