Picture for Yang Zheng

Yang Zheng

Design of an Expression Recognition Solution Employing the Global Channel-Spatial Attention Mechanism

Add code
Mar 15, 2025
Viaarxiv icon

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Add code
Mar 14, 2025
Viaarxiv icon

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Add code
Mar 13, 2025
Viaarxiv icon

Interactive Multimodal Fusion with Temporal Modeling

Add code
Mar 13, 2025
Viaarxiv icon

GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

Add code
Mar 13, 2025
Viaarxiv icon

Road Traffic Sign Recognition method using Siamese network Combining Efficient-CNN based Encoder

Add code
Feb 21, 2025
Viaarxiv icon

PLPP: Prompt Learning with Perplexity Is Self-Distillation for Vision-Language Models

Add code
Dec 18, 2024
Viaarxiv icon

AIpparel: A Large Multimodal Generative Model for Digital Garments

Add code
Dec 05, 2024
Figure 1 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 2 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 3 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 4 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Viaarxiv icon

CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization

Add code
Sep 11, 2024
Figure 1 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 2 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 3 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 4 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Viaarxiv icon

RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models

Add code
Aug 27, 2024
Figure 1 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 2 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 3 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 4 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Viaarxiv icon