Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sejoon Lim

Emotion Recognition Using Transformers with Masked Learning

Mar 23, 2024

Seongjae Min, Junseok Yang, Sangjun Lim, Junyong Lee, Sangwon Lee, Sejoon Lim

Figure 1 for Emotion Recognition Using Transformers with Masked Learning

Figure 2 for Emotion Recognition Using Transformers with Masked Learning

Abstract:In recent years, deep learning has achieved innovative advancements in various fields, including the analysis of human emotions and behaviors. Initiatives such as the Affective Behavior Analysis in-the-wild (ABAW) competition have been particularly instrumental in driving research in this area by providing diverse and challenging datasets that enable precise evaluation of complex emotional states. This study leverages the Vision Transformer (ViT) and Transformer models to focus on the estimation of Valence-Arousal (VA), which signifies the positivity and intensity of emotions, recognition of various facial expressions, and detection of Action Units (AU) representing fundamental muscle movements. This approach transcends traditional Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) based methods, proposing a new Transformer-based framework that maximizes the understanding of temporal and spatial features. The core contributions of this research include the introduction of a learning technique through random frame masking and the application of Focal loss adapted for imbalanced data, enhancing the accuracy and applicability of emotion and behavior analysis in real-world settings. This approach is expected to contribute to the advancement of emotional computing and deep learning methodologies.

Via

Access Paper or Ask Questions

PU-MFA : Point Cloud Up-sampling via Multi-scale Features Attention

Aug 22, 2022

Hyungjun Lee, Sejoon Lim

Figure 1 for PU-MFA : Point Cloud Up-sampling via Multi-scale Features Attention

Figure 2 for PU-MFA : Point Cloud Up-sampling via Multi-scale Features Attention

Figure 3 for PU-MFA : Point Cloud Up-sampling via Multi-scale Features Attention

Figure 4 for PU-MFA : Point Cloud Up-sampling via Multi-scale Features Attention

Abstract:Recently, research using point clouds has been increasing with the development of 3D scanner technology. According to this trend, the demand for high-quality point clouds is increasing, but there is still a problem with the high cost of obtaining high-quality point clouds. Therefore, with the recent remarkable development of deep learning, point cloud up-sampling research, which uses deep learning to generate high-quality point clouds from low-quality point clouds, is one of the fields attracting considerable attention. This paper proposes a new point cloud up-sampling method called Point cloud Up-sampling via Multi-scale Features Attention (PU-MFA). Inspired by previous studies that reported good performance using the multi-scale features or attention mechanisms, PU-MFA merges the two through a U-Net structure. In addition, PU-MFA adaptively uses multi-scale features to refine the global features effectively. The performance of PU-MFA was compared with other state-of-the-art methods through various experiments using the PU-GAN dataset, which is a synthetic point cloud dataset, and the KITTI dataset, which is the real-scanned point cloud dataset. In various experimental results, PU-MFA showed superior performance in quantitative and qualitative evaluation compared to other state-of-the-art methods, proving the effectiveness of the proposed method. The attention map of PU-MFA was also visualized to show the effect of multi-scale features.

* 11 pages, 9 figures, 2 tables

Via

Access Paper or Ask Questions

BYEL : Bootstrap on Your Emotion Latent

Jul 20, 2022

Hyungjun Lee, Hwangyu Lim, Sejoon Lim

Figure 1 for BYEL : Bootstrap on Your Emotion Latent

Figure 2 for BYEL : Bootstrap on Your Emotion Latent

Figure 3 for BYEL : Bootstrap on Your Emotion Latent

Figure 4 for BYEL : Bootstrap on Your Emotion Latent

Abstract:According to the problem of dataset construction cost for training in deep learning and the development of generative models, more and more researches are being conducted to train with synthetic data and to inference using real data. We propose emotion aware Self-Supervised Learning using ABAW's Learning Synthetic Data (LSD) dataset. We pre-train our method to LSD dataset as a self-supervised learning and then use the same LSD dataset to do downstream training on the emotion classification task as a supervised learning. As a result, a higher result(0.63) than baseline(0.5) was obtained.

* ABAW4th competition

Via

Access Paper or Ask Questions

Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator

Mar 24, 2022

Euiseok Jeong, Geesung Oh, Sejoon Lim

Figure 1 for Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator

Figure 2 for Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator

Abstract:Due to the collection of big data and the development of deep learning, research to predict human emotions in the wild is being actively conducted. We designed a multi-task model using ABAW dataset to predict valence-arousal, expression, and action unit through audio data and face images at in real world. We trained model from the incomplete label by applying the knowledge distillation technique. The teacher model was trained as a supervised learning method, and the student model was trained by using the output of the teacher model as a soft label. As a result we achieved 2.40 in Multi Task Learning task validation dataset.

Via

Access Paper or Ask Questions

Causal affect prediction model using a facial image sequence

Jul 08, 2021

Geesung Oh, Euiseok Jeong, Sejoon Lim

Figure 1 for Causal affect prediction model using a facial image sequence

Figure 2 for Causal affect prediction model using a facial image sequence

Figure 3 for Causal affect prediction model using a facial image sequence

Abstract:Among human affective behavior research, facial expression recognition research is improving in performance along with the development of deep learning. However, for improved performance, not only past images but also future images should be used along with corresponding facial images, but there are obstacles to the application of this technique to real-time environments. In this paper, we propose the causal affect prediction network (CAPNet), which uses only past facial images to predict corresponding affective valence and arousal. We train CAPNet to learn causal inference between past images and corresponding affective valence and arousal through supervised learning by pairing the sequence of past images with the current label using the Aff-Wild2 dataset. We show through experiments that the well-trained CAPNet outperforms the baseline of the second challenge of the Affective Behavior Analysis in-the-wild (ABAW2) Competition by predicting affective valence and arousal only with past facial images one-third of a second earlier. Therefore, in real-time application, CAPNet can reliably predict affective valence and arousal only with past data.

Via

Access Paper or Ask Questions