Picture for Zhiyun Lu

Zhiyun Lu

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights

Add code
Oct 06, 2024
Figure 1 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 2 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 3 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Figure 4 for TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Add code
Feb 19, 2024
Viaarxiv icon

Instruction-Following Speech Recognition

Add code
Sep 18, 2023
Viaarxiv icon

Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness

Add code
May 08, 2023
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Apr 22, 2022
Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

Unsupervised Data Selection via Discrete Speech Representation for ASR

Add code
Apr 05, 2022
Figure 1 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 2 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 3 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 4 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Viaarxiv icon

Improving the fusion of acoustic and text representations in RNN-T

Add code
Jan 25, 2022
Figure 1 for Improving the fusion of acoustic and text representations in RNN-T
Figure 2 for Improving the fusion of acoustic and text representations in RNN-T
Figure 3 for Improving the fusion of acoustic and text representations in RNN-T
Figure 4 for Improving the fusion of acoustic and text representations in RNN-T
Viaarxiv icon

Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition

Add code
Oct 08, 2021
Figure 1 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 2 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 3 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Figure 4 for Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition
Viaarxiv icon

Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

Add code
Apr 06, 2021
Figure 1 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models
Figure 2 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models
Figure 3 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models
Figure 4 for Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models
Viaarxiv icon