Picture for Xiaojun Wu

Xiaojun Wu

Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections

Add code
Nov 22, 2024
Figure 1 for Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Figure 2 for Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Figure 3 for Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Figure 4 for Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Viaarxiv icon

Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models

Add code
Nov 09, 2024
Viaarxiv icon

Capture Artifacts via Progressive Disentangling and Purifying Blended Identities for Deepfake Detection

Add code
Oct 15, 2024
Viaarxiv icon

Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation

Add code
Oct 09, 2024
Figure 1 for Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Figure 2 for Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Figure 3 for Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Figure 4 for Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Viaarxiv icon

QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation

Add code
Oct 09, 2024
Figure 1 for QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Figure 2 for QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Figure 3 for QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Figure 4 for QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Viaarxiv icon

RMLR: Extending Multinomial Logistic Regression into General Geometries

Add code
Sep 28, 2024
Viaarxiv icon

Dynamic Subframe Splitting and Spatio-Temporal Motion Entangled Sparse Attention for RGB-E Tracking

Add code
Sep 26, 2024
Viaarxiv icon

S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion

Add code
Jun 03, 2024
Viaarxiv icon

CoMoFusion: Fast and High-quality Fusion of Infrared and Visible Image with Consistency Model

Add code
May 31, 2024
Viaarxiv icon

Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support

Add code
Jan 26, 2024
Viaarxiv icon