Picture for Xin Zhao

Xin Zhao

JD.com

Can LVLMs Describe Videos like Humans? A Five-in-One Video Annotations Benchmark for Better Human-Machine Comparison

Add code
Oct 20, 2024
Viaarxiv icon

HorGait: Advancing Gait Recognition with Efficient High-Order Spatial Interactions in LiDAR Point Clouds

Add code
Oct 11, 2024
Viaarxiv icon

CipherDM: Secure Three-Party Inference for Diffusion Model Sampling

Add code
Sep 09, 2024
Viaarxiv icon

AResNet-ViT: A Hybrid CNN-Transformer Network for Benign and Malignant Breast Nodule Classification in Ultrasound Images

Add code
Jul 27, 2024
Viaarxiv icon

What Matters in Learning Facts in Language Models? Multifaceted Knowledge Probing with Diverse Multi-Prompt Datasets

Add code
Jun 18, 2024
Viaarxiv icon

Video Coding with Cross-Component Sample Offset

Add code
Jun 03, 2024
Viaarxiv icon

MetaRM: Shifted Distributions Alignment via Meta-Learning

Add code
May 01, 2024
Viaarxiv icon

EulerFormer: Sequential User Behavior Modeling with Complex Vector Attention

Add code
Apr 04, 2024
Viaarxiv icon

UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment

Add code
Mar 25, 2024
Viaarxiv icon

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Add code
Mar 16, 2024
Viaarxiv icon