Picture for Zhe Sun

Zhe Sun

Learning Long Short-Term Intention within Human Daily Behaviors

Add code
Apr 10, 2025
Viaarxiv icon

Body Discovery of Embodied AI

Add code
Mar 25, 2025
Viaarxiv icon

Bidirectional Prototype-Reward co-Evolution for Test-Time Adaptation of Vision-Language Models

Add code
Mar 12, 2025
Viaarxiv icon

VQ-Flow: Taming Normalizing Flows for Multi-Class Anomaly Detection via Hierarchical Vector Quantization

Add code
Sep 02, 2024
Viaarxiv icon

ComKD-CLIP: Comprehensive Knowledge Distillation for Contrastive Language-Image Pre-traning Model

Add code
Aug 08, 2024
Viaarxiv icon

SentenceVAE: Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context

Add code
Aug 07, 2024
Viaarxiv icon

Static and multivariate-temporal attentive fusion transformer for readmission risk prediction

Add code
Jul 15, 2024
Figure 1 for Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
Figure 2 for Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
Figure 3 for Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
Figure 4 for Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
Viaarxiv icon

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

Add code
Apr 15, 2024
Figure 1 for CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Figure 2 for CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Figure 3 for CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Figure 4 for CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Viaarxiv icon

StreakNet-Arch: An Anti-scattering Network-based Architecture for Underwater Carrier LiDAR-Radar Imaging

Add code
Apr 14, 2024
Viaarxiv icon

MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification

Add code
Apr 13, 2024
Viaarxiv icon