Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuojun Chen

Automatic Extraction of Time-windowed ROS Computation Graphs from ROS Bag Files

May 25, 2023

Zhuojun Chen, Michel Albonico, Ivano Malvolta

Abstract:Robotic systems react to different environmental stimuli, potentially resulting in the dynamic reconfiguration of the software controlling such systems. One effect of such dynamism is the reconfiguration of the software architecture reconfiguration of the system at runtime. Such reconfigurations might severely impact the runtime properties of robotic systems, e.g., in terms of performance and energy efficiency. The ROS \emph{rosbag} package enables developers to record and store timestamped data related to the execution of robotic missions, implicitly containing relevant information about the architecture of the monitored system during its execution. In this study, we discuss about our approach for statically extracting (time-windowed) architectural information from ROS bag files. The proposed approach can support the robotics community in better discussing and reasoning the software architecture (and its runtime reconfigurations) of ROS-based systems. We evaluate our approach against hundreds of ROS bag files systematically mined from 4,434 public GitHub repositories.

* Robot Software Architectures Workshop@ICRA (2023)
* 2 pages, workshop paper

Via

Access Paper or Ask Questions

Pyramid Scale Network for Crowd Counting

Jul 11, 2020

Junhao Cheng, Zhuojun Chen, XinYu Zhang, Yizhou Li, Xiaoyuan Jing

Figure 1 for Pyramid Scale Network for Crowd Counting

Figure 2 for Pyramid Scale Network for Crowd Counting

Figure 3 for Pyramid Scale Network for Crowd Counting

Figure 4 for Pyramid Scale Network for Crowd Counting

Abstract:Crowd counting is a challenging task in computer vision due to serious occlusions, complex background and large scale variations, etc. Multi-column architecture is widely adopted to overcome these challenges, yielding state-of-the-art performance in many public benchmarks. However, there still are two issues in such design: scale limitation and feature similarity. Further performance improvements are thus restricted. In this paper, we propose a novel crowd counting framework called Pyramid Scale Network (PSNet) to explicitly address these issues. Specifically, for scale limitation, we adopt three Pyramid Scale Module (PSM) to efficiently capture multi-scale features, which integrate a message passing mechanism and an attention mechanism into multi-column architecture. Moreover, for feature similarity, a Differential loss is introduced to make the features learned by each column in PSM appropriately different from each other. To the best of our knowledge, PSNet is the first work to explicitly address scale limitation and feature similarity in multi-column design. Extensive experiments on five benchmark datasets demonstrate the effectiveness of the proposed innovations as well as the superior performance over the state-of-the-art. Our code is publicly available at: https://github.com/JunhaoCheng/Pyramid_Scale_Network

Via

Access Paper or Ask Questions

RankPose: Learning Generalised Feature with Rank Supervision for Head Pose Estimation

May 22, 2020

Donggen Dai, Wangkit Wong, Zhuojun Chen

Figure 1 for RankPose: Learning Generalised Feature with Rank Supervision for Head Pose Estimation

Figure 2 for RankPose: Learning Generalised Feature with Rank Supervision for Head Pose Estimation

Figure 3 for RankPose: Learning Generalised Feature with Rank Supervision for Head Pose Estimation

Figure 4 for RankPose: Learning Generalised Feature with Rank Supervision for Head Pose Estimation

Abstract:We address the challenging problem of RGB image-based head pose estimation. We first reformulate head pose representation learning to constrain it to a bounded space. Head pose represented as vector projection or vector angles shows helpful to improving performance. Further, a ranking loss combined with MSE regression loss is proposed. The ranking loss supervises a neural network with paired samples of the same person and penalises incorrect ordering of pose prediction. Analysis on this new loss function suggests it contributes to a better local feature extractor, where features are generalised to Abstract Landmarks which are pose-related features instead of pose-irrelevant information such as identity, age, and lighting. Extensive experiments show that our method significantly outperforms the current state-of-the-art schemes on public datasets: AFLW2000 and BIWI. Our model achieves significant improvements over previous SOTA MAE on AFLW2000 and BIWI from 4.50 to 3.66 and from 4.0 to 3.71 respectively. Source code will be made available at: https://github.com/seathiefwang/RankHeadPose.

Via

Access Paper or Ask Questions

Deep Density-aware Count Regressor

Aug 09, 2019

Zhuojun Chen, Junhao Cheng, Yuchen Yuan, Dongping Liao, Yizhou Li, Jiancheng Lv

Figure 1 for Deep Density-aware Count Regressor

Figure 2 for Deep Density-aware Count Regressor

Figure 3 for Deep Density-aware Count Regressor

Figure 4 for Deep Density-aware Count Regressor

Abstract:We seek to improve crowd counting as we perceive limits of currently prevalent density map estimation approach on both prediction accuracy and time efficiency. We show that a CNN regressing a global count trained with density map supervision can make more accurate prediction. We introduce multilayer gradient fusion for training a densityaware global count regressor. More specifically, on training stage, a backbone network receives gradients from multiple branches to learn the density information, whereas those branches are to be detached to accelerate inference. By taking advantages of such method, our model improves benchmark results on public datasets and exhibits itself to be a new solution to crowd counting problem in practice.

Via

Access Paper or Ask Questions