Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xinke Wang

Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement

Jun 07, 2024

Wei Qian, Qi Li, Kun Li, Xinke Wang, Xiao Sun, Meng Wang, Dan Guo

Figure 1 for Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement

Figure 2 for Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement

Figure 3 for Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement

Figure 4 for Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement

Abstract:This paper briefly introduces the solutions developed by our team, HFUT-VUT, for Track 1 of self-supervised heart rate measurement in the 3rd Vision-based Remote Physiological Signal Sensing (RePSS) Challenge hosted at IJCAI 2024. The goal is to develop a self-supervised learning algorithm for heart rate (HR) estimation using unlabeled facial videos. To tackle this task, we present two self-supervised HR estimation solutions that integrate spatial-temporal modeling and contrastive learning, respectively. Specifically, we first propose a non-end-to-end self-supervised HR measurement framework based on spatial-temporal modeling, which can effectively capture subtle rPPG clues and leverage the inherent bandwidth and periodicity characteristics of rPPG to constrain the model. Meanwhile, we employ an excellent end-to-end solution based on contrastive learning, aiming to generalize across different scenarios from complementary perspectives. Finally, we combine the strengths of the above solutions through an ensemble strategy to generate the final predictions, leading to a more accurate HR estimation. As a result, our solutions achieved a remarkable RMSE score of 8.85277 on the test dataset, securing \textbf{2nd place} in Track 1 of the challenge.

Via

Access Paper or Ask Questions

Mutilmodal Feature Extraction and Attention-based Fusion for Emotion Estimation in Videos

Mar 18, 2023

Tao Shu, Xinke Wang, Ruotong Wang, Chuang Chen, Yixin Zhang, Xiao Sun

Figure 1 for Mutilmodal Feature Extraction and Attention-based Fusion for Emotion Estimation in Videos

Figure 2 for Mutilmodal Feature Extraction and Attention-based Fusion for Emotion Estimation in Videos

Figure 3 for Mutilmodal Feature Extraction and Attention-based Fusion for Emotion Estimation in Videos

Abstract:The continuous improvement of human-computer interaction technology makes it possible to compute emotions. In this paper, we introduce our submission to the CVPR 2023 Competition on Affective Behavior Analysis in-the-wild (ABAW). Sentiment analysis in human-computer interaction should, as far as possible Start with multiple dimensions, fill in the single imperfect emotion channel, and finally determine the emotion tendency by fitting multiple results. Therefore, We exploited multimodal features extracted from video of different lengths from the competition dataset, including audio, pose and images. Well-informed emotion representations drive us to propose a Attention-based multimodal framework for emotion estimation. Our system achieves the performance of 0.361 on the validation dataset. The code is available at [https://github.com/xkwangcn/ABAW-5th-RT-IAI].

* 5 pages, 1 figures

Via

Access Paper or Ask Questions