Picture for Xiaokang Liu

Xiaokang Liu

RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels

Add code
Jan 19, 2026
Viaarxiv icon

H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos

Add code
Dec 10, 2025
Figure 1 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 2 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 3 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 4 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Viaarxiv icon

DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Add code
Dec 19, 2024
Figure 1 for DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
Figure 2 for DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
Figure 3 for DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
Figure 4 for DiffSim: Taming Diffusion Models for Evaluating Visual Similarity
Viaarxiv icon

Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation

Add code
Dec 08, 2024
Viaarxiv icon

An End-To-End Stuttering Detection Method Based On Conformer And BILSTM

Add code
Nov 14, 2024
Figure 1 for An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
Figure 2 for An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
Figure 3 for An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
Figure 4 for An End-To-End Stuttering Detection Method Based On Conformer And BILSTM
Viaarxiv icon

Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network

Add code
May 07, 2024
Figure 1 for Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network
Figure 2 for Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network
Figure 3 for Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network
Figure 4 for Automatic Assessment of Dysarthria Using Audio-visual Vowel Graph Attention Network
Viaarxiv icon

Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras

Add code
Apr 29, 2024
Figure 1 for Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras
Figure 2 for Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras
Figure 3 for Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras
Figure 4 for Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras
Viaarxiv icon

An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data

Add code
Mar 12, 2024
Figure 1 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 2 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 3 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Figure 4 for An Audio-textual Diffusion Model For Converting Speech Signals Into Ultrasound Tongue Imaging Data
Viaarxiv icon

Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary

Add code
Apr 20, 2023
Viaarxiv icon

Schema Inference for Interpretable Image Classification

Add code
Mar 12, 2023
Viaarxiv icon