Picture for Xi Yang

Xi Yang

SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors

Add code
Mar 20, 2025
Viaarxiv icon

Exploiting Vulnerabilities in Speech Translation Systems through Targeted Adversarial Attacks

Add code
Mar 05, 2025
Viaarxiv icon

CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition

Add code
Feb 26, 2025
Viaarxiv icon

Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors

Add code
Jan 31, 2025
Viaarxiv icon

Clustering Properties of Self-Supervised Learning

Add code
Jan 30, 2025
Viaarxiv icon

Towards Training-Free Open-World Classification with 3D Generative Models

Add code
Jan 29, 2025
Viaarxiv icon

CSHNet: A Novel Information Asymmetric Image Translation Method

Add code
Jan 17, 2025
Figure 1 for CSHNet: A Novel Information Asymmetric Image Translation Method
Figure 2 for CSHNet: A Novel Information Asymmetric Image Translation Method
Figure 3 for CSHNet: A Novel Information Asymmetric Image Translation Method
Figure 4 for CSHNet: A Novel Information Asymmetric Image Translation Method
Viaarxiv icon

Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification

Add code
Dec 28, 2024
Figure 1 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 2 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 3 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 4 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Viaarxiv icon

An archaeological Catalog Collection Method Based on Large Vision-Language Models

Add code
Dec 28, 2024
Figure 1 for An archaeological Catalog Collection Method Based on Large Vision-Language Models
Figure 2 for An archaeological Catalog Collection Method Based on Large Vision-Language Models
Figure 3 for An archaeological Catalog Collection Method Based on Large Vision-Language Models
Figure 4 for An archaeological Catalog Collection Method Based on Large Vision-Language Models
Viaarxiv icon

FFA Sora, video generation as fundus fluorescein angiography simulator

Add code
Dec 23, 2024
Viaarxiv icon