Picture for Xiaoyu Liang

Xiaoyu Liang

Learn Before Represent: Bridging Generative and Contrastive Learning for Domain-Specific LLM Embeddings

Add code
Jan 16, 2026
Viaarxiv icon

OpenRoboCare: A Multimodal Multi-Task Expert Demonstration Dataset for Robot Caregiving

Add code
Nov 17, 2025
Viaarxiv icon

SAGE: Semantic-Aware Shared Sampling for Efficient Diffusion

Add code
Sep 19, 2025
Viaarxiv icon

Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency

Add code
Feb 06, 2025
Figure 1 for Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency
Figure 2 for Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency
Figure 3 for Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency
Figure 4 for Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency
Viaarxiv icon

Dynamic Token Reduction during Generation for Vision Language Models

Add code
Jan 24, 2025
Figure 1 for Dynamic Token Reduction during Generation for Vision Language Models
Figure 2 for Dynamic Token Reduction during Generation for Vision Language Models
Figure 3 for Dynamic Token Reduction during Generation for Vision Language Models
Figure 4 for Dynamic Token Reduction during Generation for Vision Language Models
Viaarxiv icon

Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation

Add code
Dec 12, 2024
Figure 1 for Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation
Figure 2 for Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation
Viaarxiv icon

KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server

Add code
Oct 10, 2024
Figure 1 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 2 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 3 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Figure 4 for KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Viaarxiv icon

E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Add code
Aug 21, 2024
Figure 1 for E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Figure 2 for E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Figure 3 for E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Figure 4 for E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Viaarxiv icon

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

Add code
Jul 08, 2024
Figure 1 for FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Figure 2 for FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Figure 3 for FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Figure 4 for FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
Viaarxiv icon

CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction

Add code
May 06, 2024
Figure 1 for CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction
Figure 2 for CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction
Figure 3 for CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction
Figure 4 for CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction
Viaarxiv icon