Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mengmeng Tian

Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Aug 24, 2022

Xuecheng Wu, Mengmeng Tian, Lanhang Zhai

Figure 1 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Figure 2 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Figure 3 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Figure 4 for Research on Mask Wearing Detection of Natural Population Based on Improved YOLOv4

Abstract:Recently, the domestic COVID-19 epidemic situation has been serious, but in some public places, some people do not wear masks or wear masks incorrectly, which requires the relevant staff to instantly remind and supervise them to wear masks correctly. However, in the face of such important and complicated work, it is necessary to carry out automated mask wearing detection in public places. This paper proposes a new mask wearing detection method based on the improved YOLOv4. Specifically, firstly, we add the Coordinate Attention Module to the backbone to coordinate feature fusion and representation. Secondly, we conduct a series of network structural improvements to enhance the model performance and robustness. Thirdly, we deploy the K-means clustering algorithm to make the nine anchor boxes more suitable for our NPMD dataset. The experimental results show that the improved YOLOv4 performs better, exceeding the baseline by 4.06% AP with a comparable speed of 64.37 FPS.

* 4 pages, 1 figures

Via

Access Paper or Ask Questions

ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Aug 24, 2022

Xuecheng Wu, Mengmeng Tian, Lanhang Zhai

Figure 1 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Figure 2 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Figure 3 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Figure 4 for ICANet: A Method of Short Video Emotion Recognition Driven by Multimodal Data

Abstract:With the fast development of artificial intelligence and short videos, emotion recognition in short videos has become one of the most important research topics in human-computer interaction. At present, most emotion recognition methods still stay in a single modality. However, in daily life, human beings will usually disguise their real emotions, which leads to the problem that the accuracy of single modal emotion recognition is relatively terrible. Moreover, it is not easy to distinguish similar emotions. Therefore, we propose a new approach denoted as ICANet to achieve multimodal short video emotion recognition by employing three different modalities of audio, video and optical flow, making up for the lack of a single modality and then improving the accuracy of emotion recognition in short videos. ICANet has a better accuracy of 80.77% on the IEMOCAP benchmark, exceeding the SOTA methods by 15.89%.

* 4 pages, 5 figures

Via

Access Paper or Ask Questions

A Contract Theory based Incentive Mechanism for Federated Learning

Aug 12, 2021

Mengmeng Tian, Yuxin Chen, Yuan Liu, Zehui Xiong, Cyril Leung, Chunyan Miao

Figure 1 for A Contract Theory based Incentive Mechanism for Federated Learning

Figure 2 for A Contract Theory based Incentive Mechanism for Federated Learning

Figure 3 for A Contract Theory based Incentive Mechanism for Federated Learning

Abstract:Federated learning (FL) serves as a data privacy-preserved machine learning paradigm, and realizes the collaborative model trained by distributed clients. To accomplish an FL task, the task publisher needs to pay financial incentives to the FL server and FL server offloads the task to the contributing FL clients. It is challenging to design proper incentives for the FL clients due to the fact that the task is privately trained by the clients. This paper aims to propose a contract theory based FL task training model towards minimizing incentive budget subject to clients being individually rational (IR) and incentive compatible (IC) in each FL training round. We design a two-dimensional contract model by formally defining two private types of clients, namely data quality and computation effort. To effectively aggregate the trained models, a contract-based aggregator is proposed. We analyze the feasible and optimal contract solutions to the proposed contract model. %Experimental results demonstrate that the proposed framework and contract model can effective improve the generation accuracy of FL tasks. Experimental results show that the generalization accuracy of the FL tasks can be improved by the proposed incentive mechanism where contract-based aggregation is applied.

* 7 pages, 2 figures, International Workshop on Federated and Transfer Learning for Data Sparsity and Confidentiality in Conjunction with IJCAI 2021 (FTL-IJCAI'21), Best Student Paper Award

Via

Access Paper or Ask Questions