Picture for Shen Zhao

Shen Zhao

VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation

Add code
Nov 14, 2024
Viaarxiv icon

Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE)

Add code
Sep 06, 2024
Viaarxiv icon

A Population-to-individual Tuning Framework for Adapting Pretrained LM to On-device User Intent Prediction

Add code
Aug 19, 2024
Viaarxiv icon

Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification

Add code
Jun 05, 2024
Viaarxiv icon

VG4D: Vision-Language Model Goes 4D Video Recognition

Add code
Apr 17, 2024
Viaarxiv icon

ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification

Add code
Jan 16, 2024
Viaarxiv icon

Explore Human Parsing Modality for Action Recognition

Add code
Jan 04, 2024
Figure 1 for Explore Human Parsing Modality for Action Recognition
Figure 2 for Explore Human Parsing Modality for Action Recognition
Viaarxiv icon

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Add code
Dec 20, 2023
Viaarxiv icon

Dynamic Compositional Graph Convolutional Network for Efficient Composite Human Motion Prediction

Add code
Nov 23, 2023
Viaarxiv icon

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training

Add code
Aug 22, 2023
Viaarxiv icon