Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haopeng Chen

Heterogeneous Subgraph Network with Prompt Learning for Interpretable Depression Detection on Social Media

Jul 12, 2024

Chen Chen, Mingwei Li, Fenghuan Li, Haopeng Chen, Yuankun Lin

Figure 1 for Heterogeneous Subgraph Network with Prompt Learning for Interpretable Depression Detection on Social Media

Figure 2 for Heterogeneous Subgraph Network with Prompt Learning for Interpretable Depression Detection on Social Media

Figure 3 for Heterogeneous Subgraph Network with Prompt Learning for Interpretable Depression Detection on Social Media

Figure 4 for Heterogeneous Subgraph Network with Prompt Learning for Interpretable Depression Detection on Social Media

Abstract:Massive social media data can reflect people's authentic thoughts, emotions, communication, etc., and therefore can be analyzed for early detection of mental health problems such as depression. Existing works about early depression detection on social media lacked interpretability and neglected the heterogeneity of social media data. Furthermore, they overlooked the global interaction among users. To address these issues, we develop a novel method that leverages a Heterogeneous Subgraph Network with Prompt Learning(HSNPL) and contrastive learning mechanisms. Specifically, prompt learning is employed to map users' implicit psychological symbols with excellent interpretability while deep semantic and diverse behavioral features are incorporated by a heterogeneous information network. Then, the heterogeneous graph network with a dual attention mechanism is constructed to model the relationships among heterogeneous social information at the feature level. Furthermore, the heterogeneous subgraph network integrating subgraph attention and self-supervised contrastive learning is developed to explore complicated interactions among users and groups at the user level. Extensive experimental results demonstrate that our proposed method significantly outperforms state-of-the-art methods for depression detection on social media.

Via

Access Paper or Ask Questions

Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Feb 27, 2024

Zhenhong Zhou, Jiuyang Xiang, Haopeng Chen, Quan Liu, Zherui Li, Sen Su

Figure 1 for Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Figure 2 for Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Figure 3 for Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Figure 4 for Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue

Abstract:Large Language Models (LLMs) have been demonstrated to generate illegal or unethical responses, particularly when subjected to "jailbreak." Research on jailbreak has highlighted the safety issues of LLMs. However, prior studies have predominantly focused on single-turn dialogue, ignoring the potential complexities and risks presented by multi-turn dialogue, a crucial mode through which humans derive information from LLMs. In this paper, we argue that humans could exploit multi-turn dialogue to induce LLMs into generating harmful information. LLMs may not intend to reject cautionary or borderline unsafe queries, even if each turn is closely served for one malicious purpose in a multi-turn dialogue. Therefore, by decomposing an unsafe query into several sub-queries for multi-turn dialogue, we induced LLMs to answer harmful sub-questions incrementally, culminating in an overall harmful response. Our experiments, conducted across a wide range of LLMs, indicate current inadequacies in the safety mechanisms of LLMs in multi-turn dialogue. Our findings expose vulnerabilities of LLMs in complex scenarios involving multi-turn dialogue, presenting new challenges for the safety of LLMs.

* working in progress 23pages, 18 figures

Via

Access Paper or Ask Questions

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Dec 15, 2023

Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han(+3 more)

Figure 1 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 2 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 3 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Figure 4 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Abstract:Amphion is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. Amphion offers a unique feature: visualizations of classic models or architectures. We believe that these visualizations are beneficial for junior researchers and engineers who wish to gain a better understanding of the model. The North-Star objective of Amphion is to offer a platform for studying the conversion of any inputs into general audio. Amphion is designed to support individual generation tasks. In addition to the specific generation tasks, Amphion also includes several vocoders and evaluation metrics. A vocoder is an important module for producing high-quality audio signals, while evaluation metrics are critical for ensuring consistent metrics in generation tasks. In this paper, we provide a high-level overview of Amphion.

* GitHub: https://github.com/open-mmlab/Amphion

Via

Access Paper or Ask Questions

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion

Oct 17, 2023

Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu

Abstract:Singing voice conversion (SVC) is a technique to enable an arbitrary singer to sing an arbitrary song. To achieve that, it is important to obtain speaker-agnostic representations from source audio, which is a challenging task. A common solution is to extract content-based features (e.g., PPGs) from a pretrained acoustic model. However, the choices for acoustic models are vast and varied. It is yet to be explored what characteristics of content features from different acoustic models are, and whether integrating multiple content features can help each other. Motivated by that, this study investigates three distinct content features, sourcing from WeNet, Whisper, and ContentVec, respectively. We explore their complementary roles in intelligibility, prosody, and conversion similarity for SVC. By integrating the multiple content features with a diffusion-based SVC model, our SVC system achieves superior conversion performance on both objective and subjective evaluation in comparison to a single source of content features. Our demo page and code can be available https://www.zhangxueyao.com/data/MultipleContentsSVC/index.html.

Via

Access Paper or Ask Questions

Edge-Featured Graph Attention Network

Jan 19, 2021

Jun Chen, Haopeng Chen

Figure 1 for Edge-Featured Graph Attention Network

Figure 2 for Edge-Featured Graph Attention Network

Figure 3 for Edge-Featured Graph Attention Network

Figure 4 for Edge-Featured Graph Attention Network

Abstract:Lots of neural network architectures have been proposed to deal with learning tasks on graph-structured data. However, most of these models concentrate on only node features during the learning process. The edge features, which usually play a similarly important role as the nodes, are often ignored or simplified by these models. In this paper, we present edge-featured graph attention networks, namely EGATs, to extend the use of graph neural networks to those tasks learning on graphs with both node and edge features. These models can be regarded as extensions of graph attention networks (GATs). By reforming the model structure and the learning process, the new models can accept node and edge features as inputs, incorporate the edge information into feature representations, and iterate both node and edge features in a parallel but mutual way. The results demonstrate that our work is highly competitive against other node classification approaches, and can be well applied in edge-featured graph learning tasks.

Via

Access Paper or Ask Questions