Picture for Zhaoqing Zhu

Zhaoqing Zhu

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Add code
Aug 21, 2025
Viaarxiv icon

Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding

Add code
Nov 12, 2024
Figure 1 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 2 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 3 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 4 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Viaarxiv icon

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Add code
Jul 17, 2024
Viaarxiv icon

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Add code
Apr 08, 2024
Viaarxiv icon

CLIPER: A Unified Vision-Language Framework for In-the-Wild Facial Expression Recognition

Add code
Mar 01, 2023
Viaarxiv icon

Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild

Add code
Aug 19, 2022
Figure 1 for Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Figure 2 for Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Figure 3 for Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Figure 4 for Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Viaarxiv icon

NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition

Add code
Jun 10, 2022
Figure 1 for NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Figure 2 for NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Figure 3 for NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Figure 4 for NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
Viaarxiv icon

AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition

Add code
May 24, 2022
Figure 1 for AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition
Figure 2 for AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition
Figure 3 for AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition
Figure 4 for AFNet-M: Adaptive Fusion Network with Masks for 2D+3D Facial Expression Recognition
Viaarxiv icon

MMNet: Muscle motion-guided network for micro-expression recognition

Add code
Jan 14, 2022
Figure 1 for MMNet: Muscle motion-guided network for micro-expression recognition
Figure 2 for MMNet: Muscle motion-guided network for micro-expression recognition
Figure 3 for MMNet: Muscle motion-guided network for micro-expression recognition
Figure 4 for MMNet: Muscle motion-guided network for micro-expression recognition
Viaarxiv icon

MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition

Add code
Sep 20, 2021
Figure 1 for MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Figure 2 for MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Figure 3 for MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Figure 4 for MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Viaarxiv icon