Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tiange Wang

Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

Aug 16, 2021

Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu(+7 more)

Figure 1 for Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

Figure 2 for Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

Figure 3 for Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

Figure 4 for Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

Abstract:Diabetes prediction is an important data science application in the social healthcare domain. There exist two main challenges in the diabetes prediction task: data heterogeneity since demographic and metabolic data are of different types, data insufficiency since the number of diabetes cases in a single medical center is usually limited. To tackle the above challenges, we employ gradient boosting decision trees (GBDT) to handle data heterogeneity and introduce multi-task learning (MTL) to solve data insufficiency. To this end, Task-wise Split Gradient Boosting Trees (TSGB) is proposed for the multi-center diabetes prediction task. Specifically, we firstly introduce task gain to evaluate each task separately during tree construction, with a theoretical analysis of GBDT's learning objective. Secondly, we reveal a problem when directly applying GBDT in MTL, i.e., the negative task gain problem. Finally, we propose a novel split method for GBDT in MTL based on the task gain statistics, named task-wise split, as an alternative to standard feature-wise split to overcome the mentioned negative task gain problem. Extensive experiments on a large-scale real-world diabetes dataset and a commonly used benchmark dataset demonstrate TSGB achieves superior performance against several state-of-the-art methods. Detailed case studies further support our analysis of negative task gain problems and provide insightful findings. The proposed TSGB method has been deployed as an online diabetes risk assessment software for early diagnosis.

* 11 pages (2 pages of supplementary), 10 figures, 7 tables. Accepted by ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

Via

Access Paper or Ask Questions

PSTN: Periodic Spatial-temporal Deep Neural Network for Traffic Condition Prediction

Aug 05, 2021

Tiange Wang, Zijun Zhang, Kwok-Leung Tsui

Figure 1 for PSTN: Periodic Spatial-temporal Deep Neural Network for Traffic Condition Prediction

Figure 2 for PSTN: Periodic Spatial-temporal Deep Neural Network for Traffic Condition Prediction

Figure 3 for PSTN: Periodic Spatial-temporal Deep Neural Network for Traffic Condition Prediction

Figure 4 for PSTN: Periodic Spatial-temporal Deep Neural Network for Traffic Condition Prediction

Abstract:Accurate forecasting of traffic conditions is critical for improving safety, stability, and efficiency of a city transportation system. In reality, it is challenging to produce accurate traffic forecasts due to the complex and dynamic spatiotemporal correlations. Most existing works only consider partial characteristics and features of traffic data, and result in unsatisfactory performances on modeling and forecasting. In this paper, we propose a periodic spatial-temporal deep neural network (PSTN) with three pivotal modules to improve the forecasting performance of traffic conditions through a novel integration of three types of information. First, the historical traffic information is folded and fed into a module consisting of a graph convolutional network and a temporal convolutional network. Second, the recent traffic information together with the historical output passes through the second module consisting of a graph convolutional network and a gated recurrent unit framework. Finally, a multi-layer perceptron is applied to process the auxiliary road attributes and output the final predictions. Experimental results on two publicly accessible real-world urban traffic data sets show that the proposed PSTN outperforms the state-of-the-art benchmarks by significant margins for short-term traffic conditions forecasting

Via

Access Paper or Ask Questions

Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Aug 05, 2021

Tiange Wang, Zijun Zhang, Fangfang Yang, Kwok-Leung Tsui

Figure 1 for Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Figure 2 for Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Figure 3 for Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Figure 4 for Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Abstract:Automatic detection of rail track and its fasteners via using continuously collected railway images is important to maintenance as it can significantly improve maintenance efficiency and better ensure system safety. Dominant computer vision-based detection models typically rely on convolutional neural networks that utilize local image features and cumbersome prior settings to generate candidate boxes. In this paper, we propose a deep convolutional transformer network based method to detect multi-class rail components including the rail, clip, and bolt. We effectively synergize advantages of the convolutional structure on extracting latent features from raw images as well as advantages of transformers on selectively determining valuable latent features to achieve an efficient and accurate performance on rail component detections. Our proposed method simplifies the detection pipeline by eliminating the need of prior settings, such as anchor box, aspect ratio, default coordinates, and post-processing, such as the threshold for non-maximum suppression; as well as allows users to trade off the quality and complexity of the detector with limited training data. Results of a comprehensive computational study show that our proposed method outperforms a set of existing state-of-art approaches with large margins

Via

Access Paper or Ask Questions

Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method

Aug 05, 2021

Tiange Wang, Zijun Zhang, Fangfang Yang, Kwok-Leung Tsui

Figure 1 for Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method

Figure 2 for Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method

Figure 3 for Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method

Figure 4 for Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method

Abstract:Automated inspection and detection of foreign objects on railways is important for rail transportation safety as it helps prevent potential accidents and trains derailment. Most existing vision-based approaches focus on the detection of frontal intrusion objects with prior labels, such as categories and locations of the objects. In reality, foreign objects with unknown categories can appear anytime on railway tracks. In this paper, we develop a semi-supervised convolutional autoencoder based framework that only requires railway track images without prior knowledge on the foreign objects in the training process. It consists of three different modules, a bottleneck feature generator as encoder, a photographic image generator as decoder, and a reconstruction discriminator developed via adversarial learning. In the proposed framework, the problem of detecting the presence, location, and shape of foreign objects is addressed by comparing the input and reconstructed images as well as setting thresholds based on reconstruction errors. The proposed method is evaluated through comprehensive studies under different performance criteria. The results show that the proposed method outperforms some well-known benchmarking methods. The proposed framework is useful for data analytics via the train Internet-of-Things (IoT) systems

Via

Access Paper or Ask Questions