Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wenjie Zheng

Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Jan 27, 2025

Wenjie Zheng, Qiming Xie, Zengzhi Wang, Jianfei Yu, Rui Xia

Figure 1 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Figure 2 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Figure 3 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Figure 4 for Towards Explainable Multimodal Depression Recognition for Clinical Interviews

Abstract:Recently, multimodal depression recognition for clinical interviews (MDRC) has recently attracted considerable attention. Existing MDRC studies mainly focus on improving task performance and have achieved significant development. However, for clinical applications, model transparency is critical, and previous works ignore the interpretability of decision-making processes. To address this issue, we propose an Explainable Multimodal Depression Recognition for Clinical Interviews (EMDRC) task, which aims to provide evidence for depression recognition by summarizing symptoms and uncovering underlying causes. Given an interviewer-participant interaction scenario, the goal of EMDRC is to structured summarize participant's symptoms based on the eight-item Patient Health Questionnaire depression scale (PHQ-8), and predict their depression severity. To tackle the EMDRC task, we construct a new dataset based on an existing MDRC dataset. Moreover, we utilize the PHQ-8 and propose a PHQ-aware multimodal multi-task learning framework, which captures the utterance-level symptom-related semantic information to help generate dialogue-level summary. Experiment results on our annotated dataset demonstrate the superiority of our proposed methods over baseline systems on the EMDRC task.

* 21 pages

Via

Access Paper or Ask Questions

Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation

Dec 12, 2024

Lianrui Mu, Xingze Zhou, Wenjie Zheng, Jiangnan Ye, Xiaoyu Liang, Yuchen Yang, Jianhong Bai, Jiedong Zhuang, Haoji Hu

Figure 1 for Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation

Figure 2 for Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation

Abstract:Landmark-guided character animation generation is an important field. Generating character animations with facial features consistent with a reference image remains a significant challenge in conditional video generation, especially involving complex motions like dancing. Existing methods often fail to maintain facial feature consistency due to mismatches between the facial landmarks extracted from source videos and the target facial features in the reference image. To address this problem, we propose a facial landmark transformation method based on the 3D Morphable Model (3DMM). We obtain transformed landmarks that align with the target facial features by reconstructing 3D faces from the source landmarks and adjusting the 3DMM parameters to match the reference image. Our method improves the facial consistency between the generated videos and the reference images, effectively improving the facial feature mismatch problem.

Via

Access Paper or Ask Questions

Hyperbolic Hierarchical Knowledge Graph Embeddings for Link Prediction in Low Dimensions

Apr 28, 2022

Wenjie Zheng, Wenxue Wang, Fulan Qian, Shu Zhao, Yanping Zhang

Figure 1 for Hyperbolic Hierarchical Knowledge Graph Embeddings for Link Prediction in Low Dimensions

Figure 2 for Hyperbolic Hierarchical Knowledge Graph Embeddings for Link Prediction in Low Dimensions

Figure 3 for Hyperbolic Hierarchical Knowledge Graph Embeddings for Link Prediction in Low Dimensions

Figure 4 for Hyperbolic Hierarchical Knowledge Graph Embeddings for Link Prediction in Low Dimensions

Abstract:Knowledge graph embeddings (KGE) have been validated as powerful methods for inferring missing links in knowledge graphs (KGs) since they map entities into Euclidean space and treat relations as transformations of entities. Currently, some Euclidean KGE methods model semantic hierarchies prevalent in KGs and promote the performance of link prediction. For hierarchical data, instead of traditional Euclidean space, hyperbolic space as an embedding space has shown the promise of high fidelity and low memory consumption; however, existing hyperbolic KGE methods neglect to model them. To address this issue, we propose a novel KGE model -- hyperbolic hierarchical KGE (HypHKGE). To be specific, we first design the attention-based learnable curvatures for hyperbolic space to preserve rich semantic hierarchies. Moreover, we define the hyperbolic hierarchical transformations based on the theory of hyperbolic geometry, which utilize hierarchies that we preserved to infer the links. Experiments show that HypHKGE can effectively model semantic hierarchies in hyperbolic space and outperforms the state-of-the-art hyperbolic methods, especially in low dimensions.

Via

Access Paper or Ask Questions

Total Variation Regularization for Compartmental Epidemic Models with Time-varying Dynamics

Apr 01, 2020

Wenjie Zheng

Figure 1 for Total Variation Regularization for Compartmental Epidemic Models with Time-varying Dynamics

Figure 2 for Total Variation Regularization for Compartmental Epidemic Models with Time-varying Dynamics

Figure 3 for Total Variation Regularization for Compartmental Epidemic Models with Time-varying Dynamics

Figure 4 for Total Variation Regularization for Compartmental Epidemic Models with Time-varying Dynamics

Abstract:Traditional methods to infer compartmental epidemic models with time-varying dynamics can only capture continuous changes in the dynamic. However, many changes are discontinuous due to sudden interventions, such as city lockdown and opening of field hospitals. To model the discontinuities, this study introduces the tool of total variation regularization, which regulates the temporal changes of the dynamic parameters, such as the transmission rate. To recover the ground truth dynamic, this study designs a novel yet straightforward optimization algorithm, dubbed iterative Nelder-Mead, which repeatedly applies the Nelder-Mead algorithm. Experiments on the simulated data show that the proposed approach can qualitatively reproduce the discontinuities of the underlying dynamics. To extend this research to real data as well as to help researchers worldwide to fight against COVID-19, the author releases his research platform as an open-source package.

* 13 pages, 7 figures

Via

Access Paper or Ask Questions

$hv$-Block Cross Validation is not a BIBD: a Note on the Paper by Jeff Racine

Oct 20, 2019

Wenjie Zheng

Figure 1 for $hv$-Block Cross Validation is not a BIBD: a Note on the Paper by Jeff Racine

Abstract:This note corrects a mistake in the paper "consistent cross-validatory model-selection for dependent data: $hv$-block cross-validation" by Racine (2000). In his paper, he implied that the therein proposed $hv$-block cross-validation is consistent in the sense of Shao (1993). To get this intuition, he relied on the speculation that $hv$-block is a balanced incomplete block design (BIBD). This note demonstrates that this is not the case, and thus the theoretical consistency of $hv$-block remains an open question. In addition, I also provide a Python program counting the number of occurrences of each sample and each pair of samples.

* Technique report. 5 pages, 1 figure

Via

Access Paper or Ask Questions

A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

May 11, 2018

Wenjie Zheng, Aurélien Bellet, Patrick Gallinari

Figure 1 for A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Figure 2 for A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Figure 3 for A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Figure 4 for A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Abstract:We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank-Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted from a deep neural network.

Via

Access Paper or Ask Questions

Toward a Better Understanding of Leaderboard

Jun 07, 2017

Wenjie Zheng

Figure 1 for Toward a Better Understanding of Leaderboard

Figure 2 for Toward a Better Understanding of Leaderboard

Abstract:The leaderboard in machine learning competitions is a tool to show the performance of various participants and to compare them. However, the leaderboard quickly becomes no longer accurate, due to hack or overfitting. This article gives two pieces of advice to prevent easy hack or overfitting. By following these advice, we reach the conclusion that something like the Ladder leaderboard introduced in [blum2015ladder] is inevitable. With this understanding, we naturally simplify Ladder by eliminating its redundant computation and explain how to choose the parameter and interpret it. We also prove that the sample complexity is cubic to the desired precision of the leaderboard.

* 9 pages, 3 figures

Via

Access Paper or Ask Questions

Two Differentially Private Rating Collection Mechanisms for Recommender Systems

Apr 28, 2016

Wenjie Zheng

Figure 1 for Two Differentially Private Rating Collection Mechanisms for Recommender Systems

Abstract:We design two mechanisms for the recommender system to collect user ratings. One is modified Laplace mechanism, and the other is randomized response mechanism. We prove that they are both differentially private and preserve the data utility.

Via

Access Paper or Ask Questions