Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xia Chen

ScreenAudit: Detecting Screen Reader Accessibility Errors in Mobile Apps Using Large Language Models

Apr 02, 2025

Mingyuan Zhong, Ruolin Chen, Xia Chen, James Fogarty, Jacob O. Wobbrock

Abstract:Many mobile apps are inaccessible, thereby excluding people from their potential benefits. Existing rule-based accessibility checkers aim to mitigate these failures by identifying errors early during development but are constrained in the types of errors they can detect. We present ScreenAudit, an LLM-powered system designed to traverse mobile app screens, extract metadata and transcripts, and identify screen reader accessibility errors overlooked by existing checkers. We recruited six accessibility experts including one screen reader user to evaluate ScreenAudit's reports across 14 unique app screens. Our findings indicate that ScreenAudit achieves an average coverage of 69.2%, compared to only 31.3% with a widely-used accessibility checker. Expert feedback indicated that ScreenAudit delivered higher-quality feedback and addressed more aspects of screen reader accessibility compared to existing checkers, and that ScreenAudit would benefit app developers in real-world settings.

* CHI 2025

Via

Access Paper or Ask Questions

Fuzzy Information Entropy and Region Biased Matrix Factorization for Web Service QoS Prediction

Jan 07, 2025

Guoxing Tang, Yugen Du, Xia Chen, Yingwei Luo, Benchi Ma

Figure 1 for Fuzzy Information Entropy and Region Biased Matrix Factorization for Web Service QoS Prediction

Figure 2 for Fuzzy Information Entropy and Region Biased Matrix Factorization for Web Service QoS Prediction

Figure 3 for Fuzzy Information Entropy and Region Biased Matrix Factorization for Web Service QoS Prediction

Abstract:Nowadays, there are many similar services available on the internet, making Quality of Service (QoS) a key concern for users. Since collecting QoS values for all services through user invocations is impractical, predicting QoS values is a more feasible approach. Matrix factorization is considered an effective prediction method. However, most existing matrix factorization algorithms focus on capturing global similarities between users and services, overlooking the local similarities between users and their similar neighbors, as well as the non-interactive effects between users and services. This paper proposes a matrix factorization approach based on user information entropy and region bias, which utilizes a similarity measurement method based on fuzzy information entropy to identify similar neighbors of users. Simultaneously, it integrates the region bias between each user and service linearly into matrix factorization to capture the non-interactive features between users and services. This method demonstrates improved predictive performance in more realistic and complex network environments. Additionally, numerous experiments are conducted on real-world QoS datasets. The experimental results show that the proposed method outperforms some of the state-of-the-art methods in the field at matrix densities ranging from 5% to 20%.

Via

Access Paper or Ask Questions

RAHN: A Reputation Based Hourglass Network for Web Service QoS Prediction

Jan 06, 2025

Xia Chen, Yugen Du, Guoxing Tang, Yingwei Luo, Benchi Ma

Abstract:As the homogenization of Web services becomes more and more common, the difficulty of service recommendation is gradually increasing. How to predict Quality of Service (QoS) more efficiently and accurately becomes an important challenge for service recommendation. Considering the excellent role of reputation and deep learning (DL) techniques in the field of QoS prediction, we propose a reputation and DL based QoS prediction network, RAHN, which contains the Reputation Calculation Module (RCM), the Latent Feature Extraction Module (LFEM), and the QoS Prediction Hourglass Network (QPHN). RCM obtains the user reputation and the service reputation by using a clustering algorithm and a Logit model. LFEM extracts latent features from known information to form an initial latent feature vector. QPHN aggregates latent feature vectors with different scales by using Attention Mechanism, and can be stacked multiple times to obtain the final latent feature vector for prediction. We evaluate RAHN on a real QoS dataset. The experimental results show that the Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) of RAHN are smaller than the six baseline methods.

* 4 pages,3 figures

Via

Access Paper or Ask Questions

Using causal inference to avoid fallouts in data-driven parametric analysis: a case study in the architecture, engineering, and construction industry

Sep 11, 2023

Xia Chen, Ruiji Sun, Ueli Saluz, Stefano Schiavon, Philipp Geyer

Abstract:The decision-making process in real-world implementations has been affected by a growing reliance on data-driven models. We investigated the synergetic pattern between the data-driven methods, empirical domain knowledge, and first-principles simulations. We showed the potential risk of biased results when using data-driven models without causal analysis. Using a case study assessing the implication of several design solutions on the energy consumption of a building, we proved the necessity of causal analysis during the data-driven modeling process. We concluded that: (a) Data-driven models' accuracy assessment or domain knowledge screening may not rule out biased and spurious results; (b) Data-driven models' feature selection should involve careful consideration of causal relationships, especially colliders; (c) Causal analysis results can be used as an aid to first-principles simulation design and parameter checking to avoid cognitive biases. We proved the benefits of causal analysis when applied to data-driven models in building engineering.

* 16 pages,6 figures

Via

Access Paper or Ask Questions

Pathway toward prior knowledge-integrated machine learning in engineering

Jul 10, 2023

Xia Chen, Philipp Geyer

Abstract:Despite the digitalization trend and data volume surge, first-principles models (also known as logic-driven, physics-based, rule-based, or knowledge-based models) and data-driven approaches have existed in parallel, mirroring the ongoing AI debate on symbolism versus connectionism. Research for process development to integrate both sides to transfer and utilize domain knowledge in the data-driven process is rare. This study emphasizes efforts and prevailing trends to integrate multidisciplinary domain professions into machine acknowledgeable, data-driven processes in a two-fold organization: examining information uncertainty sources in knowledge representation and exploring knowledge decomposition with a three-tier knowledge-integrated machine learning paradigm. This approach balances holist and reductionist perspectives in the engineering domain.

* 8 pages, 4 figures

Via

Access Paper or Ask Questions

Application of attention-based Siamese composite neural network in medical image recognition

Apr 20, 2023

Zihao Huang, Xia Chen, Yue Wang, Weixing Xin, Xingtong Lin, Huizhen Li, Haowen Chen, Yizhen Lao

Abstract:Medical image recognition often faces the problem of insufficient data in practical applications. Image recognition and processing under few-shot conditions will produce overfitting, low recognition accuracy, low reliability and insufficient robustness. It is often the case that the difference of characteristics is subtle, and the recognition is affected by perspectives, background, occlusion and other factors, which increases the difficulty of recognition. Furthermore, in fine-grained images, the few-shot problem leads to insufficient useful feature information in the images. Considering the characteristics of few-shot and fine-grained image recognition, this study has established a recognition model based on attention and Siamese neural network. Aiming at the problem of few-shot samples, a Siamese neural network suitable for classification model is proposed. The Attention-Based neural network is used as the main network to improve the classification effect. Covid- 19 lung samples have been selected for testing the model. The results show that the less the number of image samples are, the more obvious the advantage shows than the ordinary neural network.

* This preprint is currently under consideration at Pattern Recognition Letters

Via

Access Paper or Ask Questions

A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array

Apr 19, 2022

Xia Chen, Guanlan Zhang, Michael Yu Wang, Hongyu Yu

Figure 1 for A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array

Figure 2 for A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array

Figure 3 for A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array

Figure 4 for A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array

Abstract:Vision-based tactile sensors have been widely studied in the robotics field for high spatial resolution and compatibility with machine learning algorithms. However, the currently employed sensor's imaging system is bulky limiting its further application. Here we present a micro lens array (MLA) based vison system to achieve a low thickness format of the sensor package with high tactile sensing performance. Multiple micromachined micro lens units cover the whole elastic touching layer and provide a stitched clear tactile image, enabling high spatial resolution with a thin thickness of 5 mm. The thermal reflow and soft lithography method ensure the uniform spherical profile and smooth surface of micro lens. Both optical and mechanical characterization demonstrated the sensor's stable imaging and excellent tactile sensing, enabling precise 3D tactile information, such as displacement mapping and force distribution with an ultra compact-thin structure.

* 7 pages, 5 figures

Via

Access Paper or Ask Questions

Explainable AI for engineering design: A unified approach of systems engineering and component-based deep learning

Aug 30, 2021

Philipp Geyer, Manav Mahan Singh, Xia Chen

Figure 1 for Explainable AI for engineering design: A unified approach of systems engineering and component-based deep learning

Figure 2 for Explainable AI for engineering design: A unified approach of systems engineering and component-based deep learning

Figure 3 for Explainable AI for engineering design: A unified approach of systems engineering and component-based deep learning

Figure 4 for Explainable AI for engineering design: A unified approach of systems engineering and component-based deep learning

Abstract:Data-driven models created by machine learning gain in importance in all fields of design and engineering. They have high potential to assists decision-makers in creating novel artefacts with a better performance and sustainability. However, limited generalization and the black-box nature of these models induce limited explainability and reusability. These drawbacks provide significant barriers retarding adoption in engineering design. To overcome this situation, we propose a component-based approach to create partial component models by machine learning (ML). This component-based approach aligns deep learning to systems engineering (SE). By means of the example of energy efficient building design, we first demonstrate generalization of the component-based method by accurately predicting the performance of designs with random structure different from training data. Second, we illustrate explainability by local sampling, sensitivity information and rules derived from low-depth decision trees and by evaluating this information from an engineering design perspective. The key for explainability is that activations at interfaces between the components are interpretable engineering quantities. In this way, the hierarchical component system forms a deep neural network (DNN) that directly integrates information for engineering explainability. The large range of possible configurations in composing components allows the examination of novel unseen design cases with understandable data-driven models. The matching of parameter ranges of components by similar probability distribution produces reusable, well-generalizing, and trustworthy models. The approach adapts the model structure to engineering methods of systems engineering and domain knowledge.

* 19 pages

Via

Access Paper or Ask Questions

PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection

Dec 17, 2020

Xia Chen, Jianren Wang, David Held, Martial Hebert

Figure 1 for PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection

Figure 2 for PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection

Figure 3 for PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection

Figure 4 for PanoNet3D: Combining Semantic and Geometric Understanding for LiDARPoint Cloud Detection

Abstract:Visual data in autonomous driving perception, such as camera image and LiDAR point cloud, can be interpreted as a mixture of two aspects: semantic feature and geometric structure. Semantics come from the appearance and context of objects to the sensor, while geometric structure is the actual 3D shape of point clouds. Most detectors on LiDAR point clouds focus only on analyzing the geometric structure of objects in real 3D space. Unlike previous works, we propose to learn both semantic feature and geometric structure via a unified multi-view framework. Our method exploits the nature of LiDAR scans -- 2D range images, and applies well-studied 2D convolutions to extract semantic features. By fusing semantic and geometric features, our method outperforms state-of-the-art approaches in all categories by a large margin. The methodology of combining semantic and geometric features provides a unique perspective of looking at the problems in real-world 3D point cloud detection.

* 3DV2020

Via

Access Paper or Ask Questions

PanoNet: Real-time Panoptic Segmentation through Position-Sensitive Feature Embedding

Aug 01, 2020

Xia Chen, Jianren Wang, Martial Hebert

Figure 1 for PanoNet: Real-time Panoptic Segmentation through Position-Sensitive Feature Embedding

Figure 2 for PanoNet: Real-time Panoptic Segmentation through Position-Sensitive Feature Embedding

Figure 3 for PanoNet: Real-time Panoptic Segmentation through Position-Sensitive Feature Embedding

Figure 4 for PanoNet: Real-time Panoptic Segmentation through Position-Sensitive Feature Embedding

Abstract:We propose a simple, fast, and flexible framework to generate simultaneously semantic and instance masks for panoptic segmentation. Our method, called PanoNet, incorporates a clean and natural structure design that tackles the problem purely as a segmentation task without the time-consuming detection process. We also introduce position-sensitive embedding for instance grouping by accounting for both object's appearance and its spatial location. Overall, PanoNet yields high panoptic quality results of high-resolution Cityscapes images in real-time, significantly faster than all other methods with comparable performance. Our approach well satisfies the practical speed and memory requirement for many applications like autonomous driving and augmented reality.

Via

Access Paper or Ask Questions