Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yanan Liu

Simultaneous Polysomnography and Cardiotocography Reveal Temporal Correlation Between Maternal Obstructive Sleep Apnea and Fetal Hypoxia

Apr 17, 2025

Jingyu Wang, Donglin Xie, Jingying Ma, Yunliang Sun, Linyan Zhang, Rui Bai, Zelin Tu, Liyue Xu, Jun Wei, Jingjing Yang(+10 more)

Abstract:Background: Obstructive sleep apnea syndrome (OSAS) during pregnancy is common and can negatively affect fetal outcomes. However, studies on the immediate effects of maternal hypoxia on fetal heart rate (FHR) changes are lacking. Methods: We used time-synchronized polysomnography (PSG) and cardiotocography (CTG) data from two cohorts to analyze the correlation between maternal hypoxia and FHR changes (accelerations or decelerations). Maternal hypoxic event characteristics were analyzed using generalized linear modeling (GLM) to assess their associations with different FHR changes. Results: A total of 118 pregnant women participated. FHR changes were significantly associated with maternal hypoxia, primarily characterized by accelerations. A longer hypoxic duration correlated with more significant FHR accelerations (P < 0.05), while prolonged hypoxia and greater SpO2 drop were linked to FHR decelerations (P < 0.05). Both cohorts showed a transient increase in FHR during maternal hypoxia, which returned to baseline after the event resolved. Conclusion: Maternal hypoxia significantly affects FHR, suggesting that maternal OSAS may contribute to fetal hypoxia. These findings highlight the importance of maternal-fetal interactions and provide insights for future interventions.

Via

Access Paper or Ask Questions

Egocentric Hand-object Interaction Detection

Nov 16, 2022

Yao Lu, Yanan Liu

Abstract:In this paper, we propose a method to jointly determine the status of hand-object interaction. This is crucial for egocentric human activity understanding and interaction. From a computer vision perspective, we believe that determining whether a hand is interacting with an object depends on whether there is an interactive hand pose and whether the hand is touching the object. Thus, we extract the hand pose, hand-object masks to jointly determine the interaction status. In order to solve the problem of hand pose estimation due to in-hand object occlusion, we use a multi-cam system to capture hand pose data from multiple perspectives. We evaluate and compare our method with the most recent work from Shan et al. \cite{Shan20} on selected images from EPIC-KITCHENS \cite{damen2018scaling} dataset and achieve $89\%$ accuracy on HOI (hand-object interaction) detection which is comparative to Shan's ($92\%$). However, for real-time performance, our method can run over $\textbf{30}$ FPS which is much more efficient than Shan's ($\textbf{1}\sim\textbf{2}$ FPS). A demo can be found from https://www.youtube.com/watch?v=XVj3zBuynmQ

* arXiv admin note: substantial text overlap with arXiv:2109.14734

Via

Access Paper or Ask Questions

On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

Feb 02, 2022

Yanan Liu, Laurie Bose, Yao Lu, Piotr Dudek, Walterio Mayol-Cuevas

Figure 1 for On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

Figure 2 for On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

Figure 3 for On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

Figure 4 for On-Sensor Binarized Fully Convolutional Neural Network with A Pixel Processor Array

Abstract:This work presents a method to implement fully convolutional neural networks (FCNs) on Pixel Processor Array (PPA) sensors, and demonstrates coarse segmentation and object localisation tasks. We design and train binarized FCN for both binary weights and activations using batchnorm, group convolution, and learnable threshold for binarization, producing networks small enough to be embedded on the focal plane of the PPA, with limited local memory resources, and using parallel elementary add/subtract, shifting, and bit operations only. We demonstrate the first implementation of an FCN on a PPA device, performing three convolution layers entirely in the pixel-level processors. We use this architecture to demonstrate inference generating heat maps for object segmentation and localisation at over 280 FPS using the SCAMP-5 PPA vision chip.

Via

Access Paper or Ask Questions

Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Dec 24, 2021

Jiahe Fan, Mohammud J. Bocus, Brett Hosking, Rigen Wu, Yanan Liu, Sergey Vityazev, Rui Fan

Figure 1 for Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Figure 2 for Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Figure 3 for Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Figure 4 for Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

Abstract:This paper presents a novel pothole detection approach based on single-modal semantic segmentation. It first extracts visual features from input images using a convolutional neural network. A channel attention module then reweighs the channel features to enhance the consistency of different feature maps. Subsequently, we employ an atrous spatial pyramid pooling module (comprising of atrous convolutions in series, with progressive rates of dilation) to integrate the spatial context information. This helps better distinguish between potholes and undamaged road areas. Finally, the feature maps in the adjacent layers are fused using our proposed multi-scale feature fusion module. This further reduces the semantic gap between different feature channel layers. Extensive experiments were carried out on the Pothole-600 dataset to demonstrate the effectiveness of our proposed method. The quantitative comparisons suggest that our method achieves the state-of-the-art (SoTA) performance on both RGB images and transformed disparity images, outperforming three SoTA single-modal semantic segmentation networks.

* 2021 IEEE International Conference on Autonomous Systems (ICAS)

Via

Access Paper or Ask Questions

Fully-simulated Integration of Scamp5d Vision System and Robot Simulator

Oct 12, 2021

Wen Fan, Yanan Liu, Yifan Xing

Figure 1 for Fully-simulated Integration of Scamp5d Vision System and Robot Simulator

Figure 2 for Fully-simulated Integration of Scamp5d Vision System and Robot Simulator

Figure 3 for Fully-simulated Integration of Scamp5d Vision System and Robot Simulator

Figure 4 for Fully-simulated Integration of Scamp5d Vision System and Robot Simulator

Abstract:This paper proposed a fully-simulated environment by integrating an on-sensor visual computing device, SCAMP, and CoppeliaSim robot simulator via interface and remote API. Within this platform, a mobile robot obstacle avoidance and target navigation with pre-set barriers is exploited with on-sensor visual computing, where images are captured in a robot simulator and processed by an on-sensor processing server after being transferred. We made our developed platform and associated algorithms for mobile robot navigation available online.

* 6 pages, 13 figures

Via

Access Paper or Ask Questions

Bringing A Robot Simulator to the SCAMP Vision System

May 21, 2021

Yanan Liu, Jianing Chen, Laurie Bose, Piotr Dudek, Walterio Mayol-Cuevas

Figure 1 for Bringing A Robot Simulator to the SCAMP Vision System

Figure 2 for Bringing A Robot Simulator to the SCAMP Vision System

Figure 3 for Bringing A Robot Simulator to the SCAMP Vision System

Figure 4 for Bringing A Robot Simulator to the SCAMP Vision System

Abstract:This work develops and demonstrates the integration of the SCAMP-5d vision system into the CoppeliaSim robot simulator, creating a semi-simulated environment. By configuring a camera in the simulator and setting up communication with the SCAMP python host through remote API, sensor images from the simulator can be transferred to the SCAMP vision sensor, where on-sensor image processing such as CNN inference can be performed. SCAMP output is then fed back into CoppeliaSim. This proposed platform integration enables rapid prototyping validations of SCAMP algorithms for robotic systems. We demonstrate a car localisation and tracking task using this proposed semi-simulated platform, with a CNN inference on SCAMP to command the motion of a robot. We made this platform available online.

Via

Access Paper or Ask Questions

Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping

Sep 29, 2020

M Usman Maqbool Bhutta, Manohar Kuse, Rui Fan, Yanan Liu, Ming Liu

Figure 1 for Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping

Figure 2 for Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping

Figure 3 for Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping

Figure 4 for Loop-box: Multi-Agent Direct SLAM Triggered by Single Loop Closure for Large-Scale Mapping

Abstract:In this paper, we present a multi-agent framework for real-time large-scale 3D reconstruction applications. In SLAM, researchers usually build and update a 3D map after applying non-linear pose graph optimization techniques. Moreover, many multi-agent systems are prevalently using odometry information from additional sensors. These methods generally involve intensive computer vision algorithms and are tightly coupled with various sensors. We develop a generic method for the keychallenging scenarios in multi-agent 3D mapping based on different camera systems. The proposed framework performs actively in terms of localizing each agent after the first loop closure between them. It is shown that the proposed system only uses monocular cameras to yield real-time multi-agent large-scale localization and 3D global mapping. Based on the initial matching, our system can calculate the optimal scale difference between multiple 3D maps and then estimate an accurate relative pose transformation for large-scale global mapping.

* IEEE Transactions on Cybernetics, 2020
* Material related to this work is available at https://usmanmaqbool.github.io/loop-box

Via

Access Paper or Ask Questions

Agile Reactive Navigation for A Non-Holonomic Mobile Robot Using A Pixel Processor Array

Sep 27, 2020

Yanan Liu, Laurie Bose, Colin Greatwood, Jianing Chen, Rui Fan, Thomas Richardson, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas

Figure 1 for Agile Reactive Navigation for A Non-Holonomic Mobile Robot Using A Pixel Processor Array

Figure 2 for Agile Reactive Navigation for A Non-Holonomic Mobile Robot Using A Pixel Processor Array

Figure 3 for Agile Reactive Navigation for A Non-Holonomic Mobile Robot Using A Pixel Processor Array

Figure 4 for Agile Reactive Navigation for A Non-Holonomic Mobile Robot Using A Pixel Processor Array

Abstract:This paper presents an agile reactive navigation strategy for driving a non-holonomic ground vehicle around a preset course of gates in a cluttered environment using a low-cost processor array sensor. This enables machine vision tasks to be performed directly upon the sensor's image plane, rather than using a separate general-purpose computer. We demonstrate a small ground vehicle running through or avoiding multiple gates at high speed using minimal computational resources. To achieve this, target tracking algorithms are developed for the Pixel Processing Array and captured images are then processed directly on the vision sensor acquiring target information for controlling the ground vehicle. The algorithm can run at up to 2000 fps outdoors and 200fps at indoor illumination levels. Conducting image processing at the sensor level avoids the bottleneck of image transfer encountered in conventional sensors. The real-time performance of on-board image processing and robustness is validated through experiments. Experimental results demonstrate that the algorithm's ability to enable a ground vehicle to navigate at an average speed of 2.20 m/s for passing through multiple gates and 3.88 m/s for a 'slalom' task in an environment featuring significant visual clutter.

* 7 pages

Via

Access Paper or Ask Questions

Representation Learning for Classical Planning from Partially Observed Traces

Jul 19, 2019

Zhanhao Xiao, Hai Wan, Hankui Hankz Zhuo, Jinxia Lin, Yanan Liu

Figure 1 for Representation Learning for Classical Planning from Partially Observed Traces

Figure 2 for Representation Learning for Classical Planning from Partially Observed Traces

Figure 3 for Representation Learning for Classical Planning from Partially Observed Traces

Figure 4 for Representation Learning for Classical Planning from Partially Observed Traces

Abstract:Specifying a complete domain model is time-consuming, which has been a bottleneck of AI planning technique application in many real-world scenarios. Most classical domain-model learning approaches output a domain model in the form of the declarative planning language, such as STRIPS or PDDL, and solve new planning instances by invoking an existing planner. However, planning in such a representation is sensitive to the accuracy of the learned domain model which probably cannot be used to solve real planning problems. In this paper, to represent domain models in a vectorization representation way, we propose a novel framework based on graph neural network (GNN) integrating model-free learning and model-based planning, called LP-GNN. By embedding propositions and actions in a graph, the latent relationship between them is explored to form a domain-specific heuristics. We evaluate our approach on five classical planning domains, comparing with the classical domain-model learner ARMS. The experimental results show that the domain models learned by our approach are much more effective on solving real planning problems.

* 11 pages, 6 figures

Via

Access Paper or Ask Questions

Mobile Robot Localisation and Navigation Using LEGO NXT and Ultrasonic Sensor

Oct 20, 2018

Yanan Liu, Rui Fan, Bin Yu, M. Junaid Bocus, Ming Liu, Hepeng Ni, Jiahe Fan, Shixin Mao

Figure 1 for Mobile Robot Localisation and Navigation Using LEGO NXT and Ultrasonic Sensor

Figure 2 for Mobile Robot Localisation and Navigation Using LEGO NXT and Ultrasonic Sensor

Figure 3 for Mobile Robot Localisation and Navigation Using LEGO NXT and Ultrasonic Sensor

Figure 4 for Mobile Robot Localisation and Navigation Using LEGO NXT and Ultrasonic Sensor

Abstract:Mobile robots are becoming increasingly important both for individuals and industries. Mobile robotic technology is not only utilised by experts in this field but is also very popular among amateurs. However, implementing a mobile robot to perform tasks autonomously can be expensive because of the need for various types of sensors and the high price of robot platforms. Hence, in this paper we present a mobile robot localisation and navigation system which uses a LEGO ultrasonic sensor in an indoor map based on the LEGO MINDSTORM NXT. This provides an affordable and ready-to-use option for most robot fans. In this paper, an effective method is proposed to extract useful information from the distorted readings collected by the ultrasonic sensor. Then, the particle filter is used to localise the robot. After robot's position is estimated, a sampling-based path planning method is proposed for the robot navigation. This method reduces the robot accumulative motion error by minimising robot turning times and covering distances. The robot localisation and navigation algorithms are implemented in MATLAB. Simulation results show an average accuracy between 1 and 3 cm for three different indoor map locations. Furthermore, experiments performed in a real setup show the effectiveness of the proposed methods.

* 6 pages, 9 figures

Via

Access Paper or Ask Questions