Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qiwei Han

C-PATH: Conversational Patient Assistance and Triage in Healthcare System

Jun 07, 2025

Qi Shi, Qiwei Han, Cláudia Soares

Abstract:Navigating healthcare systems can be complex and overwhelming, creating barriers for patients seeking timely and appropriate medical attention. In this paper, we introduce C-PATH (Conversational Patient Assistance and Triage in Healthcare), a novel conversational AI system powered by large language models (LLMs) designed to assist patients in recognizing symptoms and recommending appropriate medical departments through natural, multi-turn dialogues. C-PATH is fine-tuned on medical knowledge, dialogue data, and clinical summaries using a multi-stage pipeline built on the LLaMA3 architecture. A core contribution of this work is a GPT-based data augmentation framework that transforms structured clinical knowledge from DDXPlus into lay-person-friendly conversations, allowing alignment with patient communication norms. We also implement a scalable conversation history management strategy to ensure long-range coherence. Evaluation with GPTScore demonstrates strong performance across dimensions such as clarity, informativeness, and recommendation accuracy. Quantitative benchmarks show that C-PATH achieves superior performance in GPT-rewritten conversational datasets, significantly outperforming domain-specific baselines. C-PATH represents a step forward in the development of user-centric, accessible, and accurate AI tools for digital health assistance and triage.

* Accepted in IEEE ICDH 2025, 10 pages, 8 figures, 5 tables

Via

Access Paper or Ask Questions

Optimizing Luxury Vehicle Dealership Networks: A Graph Neural Network Approach to Site Selection

Aug 25, 2024

Luca Silvano Carocci, Qiwei Han

Abstract:This study presents a novel application of Graph Neural Networks (GNNs) to optimize dealership network planning for a luxury car manufacturer in the U.S. By conducting a comprehensive literature review on dealership location determinants, the study identifies 65 county-level explanatory variables, augmented by two additional measures of regional interconnectedness derived from social and mobility data. An ablation study involving 34 variable combinations and ten state-of-the-art GNN operators reveals key insights into the predictive power of various variables, particularly highlighting the significance of competition, demographic factors, and mobility patterns in influencing dealership location decisions. The analysis pinpoints seven specific counties as promising targets for network expansion. This research not only illustrates the effectiveness of GNNs in solving complex geospatial decision-making problems but also provides actionable recommendations and valuable methodological insights for industry practitioners.

* 10 pages, 4 figures, 6 tables

Via

Access Paper or Ask Questions

AiGen-FoodReview: A Multimodal Dataset of Machine-Generated Restaurant Reviews and Images on Social Media

Jan 16, 2024

Alessandro Gambetti, Qiwei Han

Abstract:Online reviews in the form of user-generated content (UGC) significantly impact consumer decision-making. However, the pervasive issue of not only human fake content but also machine-generated content challenges UGC's reliability. Recent advances in Large Language Models (LLMs) may pave the way to fabricate indistinguishable fake generated content at a much lower cost. Leveraging OpenAI's GPT-4-Turbo and DALL-E-2 models, we craft AiGen-FoodReview, a multi-modal dataset of 20,144 restaurant review-image pairs divided into authentic and machine-generated. We explore unimodal and multimodal detection models, achieving 99.80% multimodal accuracy with FLAVA. We use attributes from readability and photographic theories to score reviews and images, respectively, demonstrating their utility as hand-crafted features in scalable and interpretable detection models, with comparable performance. The paper contributes by open-sourcing the dataset and releasing fake review detectors, recommending its use in unimodal and multimodal fake review detection tasks, and evaluating linguistic and visual features in synthetic versus authentic data.

Via

Access Paper or Ask Questions

Dissecting Medical Referral Mechanisms in Health Services: Role of Physician Professional Networks

Dec 04, 2023

Regina de Brito Duarte, Qiwei Han, Claudia Soares

Figure 1 for Dissecting Medical Referral Mechanisms in Health Services: Role of Physician Professional Networks

Figure 2 for Dissecting Medical Referral Mechanisms in Health Services: Role of Physician Professional Networks

Figure 3 for Dissecting Medical Referral Mechanisms in Health Services: Role of Physician Professional Networks

Figure 4 for Dissecting Medical Referral Mechanisms in Health Services: Role of Physician Professional Networks

Abstract:Medical referrals between primary care physicians (PC) and specialist care (SC) physicians profoundly impact patient care regarding quality, satisfaction, and cost. This paper investigates the influence of professional networks among medical doctors on referring patients from PC to SC. Using five-year consultation data from a Portuguese private health provider, we conducted exploratory data analysis and constructed both professional and referral networks among physicians. We then apply Graph Neural Network (GNN) models to learn latent representations of the referral network. Our analysis supports the hypothesis that doctors' professional social connections can predict medical referrals, potentially enhancing collaboration within organizations and improving healthcare services. This research contributes to dissecting the underlying mechanisms in primary-specialty referrals, thereby providing valuable insights for enhancing patient care and effective healthcare management.

* 27 pages, 9 figures, 2 tables

Via

Access Paper or Ask Questions

Extreme Multilabel Classification for Specialist Doctor Recommendation with Implicit Feedback and Limited Patient Metadata

Aug 21, 2023

Filipa Valdeira, Stevo Racković, Valeria Danalachi, Qiwei Han, Cláudia Soares

Abstract:Recommendation Systems (RS) are often used to address the issue of medical doctor referrals. However, these systems require access to patient feedback and medical records, which may not always be available in real-world scenarios. Our research focuses on medical referrals and aims to predict recommendations in different specialties of physicians for both new patients and those with a consultation history. We use Extreme Multilabel Classification (XML), commonly employed in text-based classification tasks, to encode available features and explore different scenarios. While its potential for recommendation tasks has often been suggested, this has not been thoroughly explored in the literature. Motivated by the doctor referral case, we show how to recast a traditional recommender setting into a multilabel classification problem that current XML methods can solve. Further, we propose a unified model leveraging patient history across different specialties. Compared to state-of-the-art RS using the same features, our approach consistently improves standard recommendation metrics up to approximately $10\%$ for patients with a previous consultation history. For new patients, XML proves better at exploiting available features, outperforming the benchmark in favorable scenarios, with particular emphasis on recall metrics. Thus, our approach brings us one step closer to creating more effective and personalized doctor referral systems. Additionally, it highlights XML as a promising alternative to current hybrid or content-based RS, while identifying key aspects to take into account when using XML for recommendation tasks.

Via

Access Paper or Ask Questions

Beyond Rankings: Exploring the Impact of SERP Features on Organic Click-through Rates

May 31, 2023

Erik Fubel, Niclas Michael Groll, Patrick Gundlach, Qiwei Han, Maximilian Kaiser

Abstract:Search Engine Result Pages (SERPs) serve as the digital gateways to the vast expanse of the internet. Past decades have witnessed a surge in research primarily centered on the influence of website ranking on these pages, to determine the click-through rate (CTR). However, during this period, the landscape of SERPs has undergone a dramatic evolution: SERP features, encompassing elements such as knowledge panels, media galleries, FAQs, and more, have emerged as an increasingly prominent facet of these result pages. Our study examines the crucial role of these features, revealing them to be not merely aesthetic components, but strongly influence CTR and the associated behavior of internet users. We demonstrate how these features can significantly modulate web traffic, either amplifying or attenuating it. We dissect these intricate interaction effects leveraging a unique dataset of 67,000 keywords and their respective Google SERPs, spanning over 40 distinct US-based e-commerce domains, generating over 6 million clicks from 24 million views. This cross-website dataset, unprecedented in its scope, enables us to assess the impact of 24 different SERP features on organic CTR. Through an ablation study modeling CTR, we illustrate the incremental predictive power these features hold.

* submitted IEEE DSAA conference, 14 pages, 5 figures, 2 tables

Via

Access Paper or Ask Questions

Interpretable Deep Learning for Forecasting Online Advertising Costs: Insights from the Competitive Bidding Landscape

Feb 11, 2023

Fynn Oldenburg, Qiwei Han, Maximilian Kaiser

Abstract:As advertisers increasingly shift their budgets toward digital advertising, forecasting advertising costs is essential for making budget plans to optimize marketing campaign returns. In this paper, we perform a comprehensive study using a variety of time-series forecasting methods to predict daily average cost-per-click (CPC) in the online advertising market. We show that forecasting advertising costs would benefit from multivariate models using covariates from competitors' CPC development identified through time-series clustering. We further interpret the results by analyzing feature importance and temporal attention. Finally, we show that our approach has several advantages over models that individual advertisers might build based solely on their collected data.

* Acceptd at AAAI 2023 Web for Advertising Workshop, 12 pages, 8 figures, 4 tables

Via

Access Paper or Ask Questions

Combat AI With AI: Counteract Machine-Generated Fake Restaurant Reviews on Social Media

Feb 10, 2023

Alessandro Gambetti, Qiwei Han

Abstract:Recent advances in generative models such as GPT may be used to fabricate indistinguishable fake customer reviews at a much lower cost, thus posing challenges for social media platforms to detect these machine-generated fake reviews. We propose to leverage the high-quality elite restaurant reviews verified by Yelp to generate fake reviews from the OpenAI GPT review creator and ultimately fine-tune a GPT output detector to predict fake reviews that significantly outperforms existing solutions. We further apply the model to predict non-elite reviews and identify the patterns across several dimensions, such as review, user and restaurant characteristics, and writing style. We show that social media platforms are continuously challenged by machine-generated fake reviews, although they may implement detection systems to filter out suspicious reviews.

* Paper submitted to KDD2023. 8 pages, 5 figures

Via

Access Paper or Ask Questions

Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Nov 16, 2021

Max Würfel, Qiwei Han, Maximilian Kaiser

Figure 1 for Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Figure 2 for Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Figure 3 for Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Figure 4 for Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Abstract:Online advertising revenues account for an increasing share of publishers' revenue streams, especially for small and medium-sized publishers who depend on the advertisement networks of tech companies such as Google and Facebook. Thus publishers may benefit significantly from accurate online advertising revenue forecasts to better manage their website monetization strategies. However, publishers who only have access to their own revenue data lack a holistic view of the total ad market of publishers, which in turn limits their ability to generate insights into their own future online advertising revenues. To address this business issue, we leverage a proprietary database encompassing Google Adsense revenues from a large collection of publishers in diverse areas. We adopt the Temporal Fusion Transformer (TFT) model, a novel attention-based architecture to predict publishers' advertising revenues. We leverage multiple covariates, including not only the publisher's own characteristics but also other publishers' advertising revenues. Our prediction results outperform several benchmark deep-learning time-series forecast models over multiple time horizons. Moreover, we interpret the results by analyzing variable importance weights to identify significant features and self-attention weights to reveal persistent temporal patterns.

* 2021 IEEE International Conference on Big Data (Big Data), 10 pages, 3 figures and 1 table

Via

Access Paper or Ask Questions

Incorporating Domain Knowledge into Health Recommender Systems using Hyperbolic Embeddings

Jun 14, 2021

Joel Peito, Qiwei Han

Figure 1 for Incorporating Domain Knowledge into Health Recommender Systems using Hyperbolic Embeddings

Figure 2 for Incorporating Domain Knowledge into Health Recommender Systems using Hyperbolic Embeddings

Figure 3 for Incorporating Domain Knowledge into Health Recommender Systems using Hyperbolic Embeddings

Abstract:In contrast to many other domains, recommender systems in health services may benefit particularly from the incorporation of health domain knowledge, as it helps to provide meaningful and personalised recommendations catering to the individual's health needs. With recent advances in representation learning enabling the hierarchical embedding of health knowledge into the hyperbolic Poincare space, this work proposes a content-based recommender system for patient-doctor matchmaking in primary care based on patients' health profiles, enriched by pre-trained Poincare embeddings of the ICD-9 codes through transfer learning. The proposed model outperforms its conventional counterpart in terms of recommendation accuracy and has several important business implications for improving the patient-doctor relationship.

* 12 pages, 3 figures, accepted at the 2020 International Conference on Complex Networks and Their Applications

Via

Access Paper or Ask Questions