Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yongjie Wang

An Intelligent and Privacy-Preserving Digital Twin Model for Aging-in-Place

Apr 04, 2025

Yongjie Wang, Jonathan Cyril Leung, Ming Chen, Zhiwei Zeng, Benny Toh Hsiang Tan, Yang Qiu, Zhiqi Shen

Abstract:The population of older adults is steadily increasing, with a strong preference for aging-in-place rather than moving to care facilities. Consequently, supporting this growing demographic has become a significant global challenge. However, facilitating successful aging-in-place is challenging, requiring consideration of multiple factors such as data privacy, health status monitoring, and living environments to improve health outcomes. In this paper, we propose an unobtrusive sensor system designed for installation in older adults' homes. Using data from the sensors, our system constructs a digital twin, a virtual representation of events and activities that occurred in the home. The system uses neural network models and decision rules to capture residents' activities and living environments. This digital twin enables continuous health monitoring by providing actionable insights into residents' well-being. Our system is designed to be low-cost and privacy-preserving, with the aim of providing green and safe monitoring for the health of older adults. We have successfully deployed our system in two homes over a time period of two months, and our findings demonstrate the feasibility and effectiveness of digital twin technology in supporting independent living for older adults. This study highlights that our system could revolutionize elder care by enabling personalized interventions, such as lifestyle adjustments, medical treatments, or modifications to the residential environment, to enhance health outcomes.

* accepted to IEEE TENSYMP 2025

Via

Access Paper or Ask Questions

A Survey on Natural Language Counterfactual Generation

Jul 04, 2024

Yongjie Wang, Xiaoqi Qiu, Yu Yue, Xu Guo, Zhiwei Zeng, Yuhong Feng, Zhiqi Shen

Figure 1 for A Survey on Natural Language Counterfactual Generation

Figure 2 for A Survey on Natural Language Counterfactual Generation

Figure 3 for A Survey on Natural Language Counterfactual Generation

Figure 4 for A Survey on Natural Language Counterfactual Generation

Abstract:Natural Language Counterfactual generation aims to minimally modify a given text such that the modified text will be classified into a different class. The generated counterfactuals provide insight into the reasoning behind a model's predictions by highlighting which words significantly influence the outcomes. Additionally, they can be used to detect model fairness issues or augment the training data to enhance the model's robustness. A substantial amount of research has been conducted to generate counterfactuals for various NLP tasks, employing different models and methodologies. With the rapid growth of studies in this field, a systematic review is crucial to guide future researchers and developers. To bridge this gap, this survey comprehensively overview textual counterfactual generation methods, particularly including those based on Large Language Models. We propose a new taxonomy that categorizes the generation methods into four groups and systematically summarize the metrics for evaluating the generation quality. Finally, we discuss ongoing research challenges and outline promising directions for future work.

* A survey paper

Via

Access Paper or Ask Questions

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Jun 09, 2024

Xiaoqi Qiu, Yongjie Wang, Xu Guo, Zhiwei Zeng, Yue Yu, Yuhong Feng, Chunyan Miao

Figure 1 for PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Figure 2 for PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Figure 3 for PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Figure 4 for PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Abstract:Counterfactually Augmented Data (CAD) involves creating new data samples by applying minimal yet sufficient modifications to flip the label of existing data samples to other classes. Training with CAD enhances model robustness against spurious features that happen to correlate with labels by spreading the casual relationships across different classes. Yet, recent research reveals that training with CAD may lead models to overly focus on modified features while ignoring other important contextual information, inadvertently introducing biases that may impair performance on out-ofdistribution (OOD) datasets. To mitigate this issue, we employ contrastive learning to promote global feature alignment in addition to learning counterfactual clues. We theoretically prove that contrastive loss can encourage models to leverage a broader range of features beyond those modified ones. Comprehensive experiments on two human-edited CAD datasets demonstrate that our proposed method outperforms the state-of-the-art on OOD datasets.

* Accepted by ACL 2024 main conference

Via

Access Paper or Ask Questions

Gradient based Feature Attribution in Explainable AI: A Technical Review

Mar 15, 2024

Yongjie Wang, Tong Zhang, Xu Guo, Zhiqi Shen

Abstract:The surge in black-box AI models has prompted the need to explain the internal mechanism and justify their reliability, especially in high-stakes applications, such as healthcare and autonomous driving. Due to the lack of a rigorous definition of explainable AI (XAI), a plethora of research related to explainability, interpretability, and transparency has been developed to explain and analyze the model from various perspectives. Consequently, with an exhaustive list of papers, it becomes challenging to have a comprehensive overview of XAI research from all aspects. Considering the popularity of neural networks in AI research, we narrow our focus to a specific area of XAI research: gradient based explanations, which can be directly adopted for neural network models. In this review, we systematically explore gradient based explanation methods to date and introduce a novel taxonomy to categorize them into four distinct classes. Then, we present the essence of technique details in chronological order and underscore the evolution of algorithms. Next, we introduce both human and quantitative evaluations to measure algorithm performance. More importantly, we demonstrate the general challenges in XAI and specific challenges in gradient based explanations. We hope that this survey can help researchers understand state-of-the-art progress and their corresponding disadvantages, which could spark their interest in addressing these issues in future work.

Via

Access Paper or Ask Questions

Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

Sep 09, 2023

Yongjie Wang, Hangwei Qian, Yongjie Liu, Wei Guo, Chunyan Miao

Figure 1 for Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

Figure 2 for Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

Figure 3 for Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

Figure 4 for Flexible and Robust Counterfactual Explanations with Minimal Satisfiable Perturbations

Abstract:Counterfactual explanations (CFEs) exemplify how to minimally modify a feature vector to achieve a different prediction for an instance. CFEs can enhance informational fairness and trustworthiness, and provide suggestions for users who receive adverse predictions. However, recent research has shown that multiple CFEs can be offered for the same instance or instances with slight differences. Multiple CFEs provide flexible choices and cover diverse desiderata for user selection. However, individual fairness and model reliability will be damaged if unstable CFEs with different costs are returned. Existing methods fail to exploit flexibility and address the concerns of non-robustness simultaneously. To address these issues, we propose a conceptually simple yet effective solution named Counterfactual Explanations with Minimal Satisfiable Perturbations (CEMSP). Specifically, CEMSP constrains changing values of abnormal features with the help of their semantically meaningful normal ranges. For efficiency, we model the problem as a Boolean satisfiability problem to modify as few features as possible. Additionally, CEMSP is a general framework and can easily accommodate more practical requirements, e.g., casualty and actionability. Compared to existing methods, we conduct comprehensive experiments on both synthetic and real-world datasets to demonstrate that our method provides more robust explanations while preserving flexibility.

* Accepted by CIKM 2023

Via

Access Paper or Ask Questions

Explaining Language Models' Predictions with High-Impact Concepts

May 03, 2023

Ruochen Zhao, Shafiq Joty, Yongjie Wang, Tan Wang

Abstract:The emergence of large-scale pretrained language models has posed unprecedented challenges in deriving explanations of why the model has made some predictions. Stemmed from the compositional nature of languages, spurious correlations have further undermined the trustworthiness of NLP systems, leading to unreliable model explanations that are merely correlated with the output predictions. To encourage fairness and transparency, there exists an urgent demand for reliable explanations that allow users to consistently understand the model's behavior. In this work, we propose a complete framework for extending concept-based interpretability methods to NLP. Specifically, we propose a post-hoc interpretability method for extracting predictive high-level features (concepts) from the pretrained model's hidden layer activations. We optimize for features whose existence causes the output predictions to change substantially, \ie generates a high impact. Moreover, we devise several evaluation metrics that can be universally applied. Extensive experiments on real and synthetic tasks demonstrate that our method achieves superior results on {predictive impact}, usability, and faithfulness compared to the baselines.

Via

Access Paper or Ask Questions

On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation

May 21, 2022

Yongjie Wang, Chuan Wang, Ruobing Li, Hui Lin

Figure 1 for On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation

Figure 2 for On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation

Figure 3 for On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation

Figure 4 for On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation

Abstract:In recent years, pre-trained models have become dominant in most natural language processing (NLP) tasks. However, in the area of Automated Essay Scoring (AES), pre-trained models such as BERT have not been properly used to outperform other deep learning models such as LSTM. In this paper, we introduce a novel multi-scale essay representation for BERT that can be jointly learned. We also employ multiple losses and transfer learning from out-of-domain essays to further improve the performance. Experiment results show that our approach derives much benefit from joint learning of multi-scale essay representation and obtains almost the state-of-the-art result among all deep learning models in the ASAP task. Our multi-scale essay representation also generalizes well to CommonLit Readability Prize data set, which suggests that the novel text representation proposed in this paper may be a new and effective choice for long-text tasks.

* Accepted to NAACL 2022 as a long paper

Via

Access Paper or Ask Questions

DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

May 13, 2022

Yongjie Wang, Hangwei Qian, Chunyan Miao

Figure 1 for DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

Figure 2 for DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

Figure 3 for DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

Figure 4 for DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

Abstract:Cloud service providers have launched Machine-Learning-as-a-Service (MLaaS) platforms to allow users to access large-scale cloudbased models via APIs. In addition to prediction outputs, these APIs can also provide other information in a more human-understandable way, such as counterfactual explanations (CF). However, such extra information inevitably causes the cloud models to be more vulnerable to extraction attacks which aim to steal the internal functionality of models in the cloud. Due to the black-box nature of cloud models, however, a vast number of queries are inevitably required by existing attack strategies before the substitute model achieves high fidelity. In this paper, we propose a novel simple yet efficient querying strategy to greatly enhance the querying efficiency to steal a classification model. This is motivated by our observation that current querying strategies suffer from decision boundary shift issue induced by taking far-distant queries and close-to-boundary CFs into substitute model training. We then propose DualCF strategy to circumvent the above issues, which is achieved by taking not only CF but also counterfactual explanation of CF (CCF) as pairs of training samples for the substitute model. Extensive and comprehensive experimental evaluations are conducted on both synthetic and real-world datasets. The experimental results favorably illustrate that DualCF can produce a high-fidelity model with fewer queries efficiently and effectively.

* in Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22), June 21-24, 2022, Seoul, Republic of Korea

Via

Access Paper or Ask Questions

Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

Jan 23, 2019

Jonathan Schwartz, Yi Jiang, Yongjie Wang, Anthony Aiello, Pallab Bhattacharya, Hui Yuan, Zetian Mi, Nabil Bassim, Robert Hovden

Figure 1 for Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

Figure 2 for Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

Figure 3 for Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

Figure 4 for Removing Stripes, Scratches, and Curtaining with Non-Recoverable Compressed Sensing

Abstract:Highly-directional image artifacts such as ion mill curtaining, mechanical scratches, or image striping from beam instability degrade the interpretability of micrographs. These unwanted, aperiodic features extend the image along a primary direction and occupy a small wedge of information in Fourier space. Deleting this wedge of data replaces stripes, scratches, or curtaining, with more complex streaking and blurring artifacts-known within the tomography community as missing wedge artifacts. Here, we overcome this problem by recovering the missing region using total variation minimization, which leverages image sparsity based reconstruction techniques-colloquially referred to as compressed sensing-to reliably restore images corrupted by stripe like features. Our approach removes beam instability, ion mill curtaining, mechanical scratches, or any stripe features and remains robust at low signal-to-noise. The success of this approach is achieved by exploiting compressed sensings inability to recover directional structures that are highly localized and missing in Fourier Space.

* 15 pages, 5 figures

Via

Access Paper or Ask Questions