Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shuzi Niu

Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in Tencent

Feb 23, 2023

Xuanji Xiao, Huaqiang Dai, Qian Dong, Shuzi Niu, Yuzhen Liu, Pei Liu

Abstract:Despite recommender systems play a key role in network content platforms, mining the user's interests is still a significant challenge. Existing works predict the user interest by utilizing user behaviors, i.e., clicks, views, etc., but current solutions are ineffective when users perform unsettled activities. The latter ones involve new users, which have few activities of any kind, and sparse users who have low-frequency behaviors. We uniformly describe both these user-types as "cold users", which are very common but often neglected in network content platforms. To address this issue, we enhance the representation of the user interest by combining his social interest, e.g., friendship, following bloggers, interest groups, etc., with the activity behaviors. Thus, in this work, we present a novel algorithm entitled SocialNet, which adopts a two-stage method to progressively extract the coarse-grained and fine-grained social interest. Our technique then concatenates SocialNet's output with the original user representation to get the final user representation that combines behavior interests and social interests. Offline experiments on Tencent video's recommender system demonstrate the superiority over the baseline behavior-based model. The online experiment also shows a significant performance improvement in clicks and view time in the real-world recommendation system. The source code is available at https://github.com/Social4Rec/SocialNet.

Via

Access Paper or Ask Questions

Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

Apr 25, 2022

Qian Dong, Yiding Liu, Suqi Cheng, Shuaiqiang Wang, Zhicong Cheng, Shuzi Niu, Dawei Yin

Figure 1 for Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

Figure 2 for Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

Figure 3 for Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

Figure 4 for Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

Abstract:Passage re-ranking is to obtain a permutation over the candidate passage set from retrieval stage. Re-rankers have been boomed by Pre-trained Language Models (PLMs) due to their overwhelming advantages in natural language understanding. However, existing PLM based re-rankers may easily suffer from vocabulary mismatch and lack of domain specific knowledge. To alleviate these problems, explicit knowledge contained in knowledge graph is carefully introduced in our work. Specifically, we employ the existing knowledge graph which is incomplete and noisy, and first apply it in passage re-ranking task. To leverage a reliable knowledge, we propose a novel knowledge graph distillation method and obtain a knowledge meta graph as the bridge between query and passage. To align both kinds of embedding in the latent space, we employ PLM as text encoder and graph neural network over knowledge meta graph as knowledge encoder. Besides, a novel knowledge injector is designed for the dynamic interaction between text and knowledge encoder. Experimental results demonstrate the effectiveness of our method especially in queries requiring in-depth domain knowledge.

Via

Access Paper or Ask Questions

Improving Variational Encoder-Decoders in Dialogue Generation

Feb 06, 2018

Xiaoyu Shen, Hui Su, Shuzi Niu, Vera Demberg

Figure 1 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 2 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 3 for Improving Variational Encoder-Decoders in Dialogue Generation

Figure 4 for Improving Variational Encoder-Decoders in Dialogue Generation

Abstract:Variational encoder-decoders (VEDs) have shown promising results in dialogue generation. However, the latent variable distributions are usually approximated by a much simpler model than the powerful RNN structure used for encoding and decoding, yielding the KL-vanishing problem and inconsistent training objective. In this paper, we separate the training step into two phases: The first phase learns to autoencode discrete texts into continuous embeddings, from which the second phase learns to generalize latent representations by reconstructing the encoded embedding. In this case, latent variables are sampled by transforming Gaussian noise through multi-layer perceptrons and are trained with a separate VED model, which has the potential of realizing a much more flexible distribution. We compare our model with current popular models and the experiment demonstrates substantial improvement in both metric-based and human evaluations.

* Accepted by AAAI2018

Via

Access Paper or Ask Questions

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Oct 11, 2017

Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, Shuzi Niu

Figure 1 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 2 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 3 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Figure 4 for DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

Abstract:We develop a high-quality multi-turn dialog dataset, DailyDialog, which is intriguing in several aspects. The language is human-written and less noisy. The dialogues in the dataset reflect our daily communication way and cover various topics about our daily life. We also manually label the developed dataset with communication intention and emotion information. Then, we evaluate existing approaches on DailyDialog dataset and hope it benefit the research field of dialog systems.

* accepted by IJCNLP 2017

Via

Access Paper or Ask Questions

A Conditional Variational Framework for Dialog Generation

Jul 06, 2017

Xiaoyu Shen, Hui Su, Yanran Li, Wenjie Li, Shuzi Niu, Yang Zhao, Akiko Aizawa, Guoping Long

Figure 1 for A Conditional Variational Framework for Dialog Generation

Figure 2 for A Conditional Variational Framework for Dialog Generation

Figure 3 for A Conditional Variational Framework for Dialog Generation

Figure 4 for A Conditional Variational Framework for Dialog Generation

Abstract:Deep latent variable models have been shown to facilitate the response generation for open-domain dialog systems. However, these latent variables are highly randomized, leading to uncontrollable generated responses. In this paper, we propose a framework allowing conditional response generation based on specific attributes. These attributes can be either manually assigned or automatically detected. Moreover, the dialog states for both speakers are modeled separately in order to reflect personal features. We validate this framework on two different scenarios, where the attribute refers to genericness and sentiment states respectively. The experiment result testified the potential of our model, where meaningful responses can be generated in accordance with the specified attributes.

* Accepted by ACL2017

Via

Access Paper or Ask Questions

Stochastic Rank Aggregation

Sep 26, 2013

Shuzi Niu, Yanyan Lan, Jiafeng Guo, Xueqi Cheng

Figure 1 for Stochastic Rank Aggregation

Figure 2 for Stochastic Rank Aggregation

Figure 3 for Stochastic Rank Aggregation

Figure 4 for Stochastic Rank Aggregation

Abstract:This paper addresses the problem of rank aggregation, which aims to find a consensus ranking among multiple ranking inputs. Traditional rank aggregation methods are deterministic, and can be categorized into explicit and implicit methods depending on whether rank information is explicitly or implicitly utilized. Surprisingly, experimental results on real data sets show that explicit rank aggregation methods would not work as well as implicit methods, although rank information is critical for the task. Our analysis indicates that the major reason might be the unreliable rank information from incomplete ranking inputs. To solve this problem, we propose to incorporate uncertainty into rank aggregation and tackle the problem in both unsupervised and supervised scenario. We call this novel framework {stochastic rank aggregation} (St.Agg for short). Specifically, we introduce a prior distribution on ranks, and transform the ranking functions or objectives in traditional explicit methods to their expectations over this distribution. Our experiments on benchmark data sets show that the proposed St.Agg outperforms the baselines in both unsupervised and supervised scenarios.

* Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

Via

Access Paper or Ask Questions