Picture for Zitong Yu

Zitong Yu

AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection

Add code
Mar 30, 2025
Viaarxiv icon

AdaMHF: Adaptive Multimodal Hierarchical Fusion for Survival Prediction

Add code
Mar 27, 2025
Viaarxiv icon

TC-GS: Tri-plane based compression for 3D Gaussian Splatting

Add code
Mar 26, 2025
Viaarxiv icon

MoEdit: On Learning Quantity Perception for Multi-object Image Editing

Add code
Mar 13, 2025
Viaarxiv icon

BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement

Add code
Feb 19, 2025
Viaarxiv icon

Semi-rPPG: Semi-Supervised Remote Physiological Measurement with Curriculum Pseudo-Labeling

Add code
Feb 06, 2025
Figure 1 for Semi-rPPG: Semi-Supervised Remote Physiological Measurement with Curriculum Pseudo-Labeling
Figure 2 for Semi-rPPG: Semi-Supervised Remote Physiological Measurement with Curriculum Pseudo-Labeling
Figure 3 for Semi-rPPG: Semi-Supervised Remote Physiological Measurement with Curriculum Pseudo-Labeling
Figure 4 for Semi-rPPG: Semi-Supervised Remote Physiological Measurement with Curriculum Pseudo-Labeling
Viaarxiv icon

Kronecker Mask and Interpretive Prompts are Language-Action Video Learners

Add code
Feb 05, 2025
Viaarxiv icon

Distilled Transformers with Locally Enhanced Global Representations for Face Forgery Detection

Add code
Dec 28, 2024
Viaarxiv icon

BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing

Add code
Dec 24, 2024
Viaarxiv icon