Picture for Wei Zou

Wei Zou

Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion

Add code
Feb 20, 2025
Viaarxiv icon

Extend Adversarial Policy Against Neural Machine Translation via Unknown Token

Add code
Jan 21, 2025
Viaarxiv icon

C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness

Add code
Dec 16, 2024
Figure 1 for C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Figure 2 for C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Figure 3 for C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Figure 4 for C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Viaarxiv icon

Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data

Add code
Dec 03, 2024
Figure 1 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 2 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 3 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 4 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Viaarxiv icon

PhaGO: Protein function annotation for bacteriophages by integrating the genomic context

Add code
Aug 12, 2024
Viaarxiv icon

Why Not Transform Chat Large Language Models to Non-English?

Add code
May 22, 2024
Figure 1 for Why Not Transform Chat Large Language Models to Non-English?
Figure 2 for Why Not Transform Chat Large Language Models to Non-English?
Figure 3 for Why Not Transform Chat Large Language Models to Non-English?
Figure 4 for Why Not Transform Chat Large Language Models to Non-English?
Viaarxiv icon

Enforcing Paraphrase Generation via Controllable Latent Diffusion

Add code
Apr 13, 2024
Viaarxiv icon

MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models

Add code
Apr 02, 2024
Viaarxiv icon

Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization

Add code
Mar 05, 2024
Viaarxiv icon

Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization

Add code
Mar 05, 2024
Viaarxiv icon