Picture for Linda Liu

Linda Liu

NowYouSee Me: Context-Aware Automatic Audio Description

Add code
Dec 13, 2024
Viaarxiv icon

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning

Add code
Dec 10, 2024
Viaarxiv icon

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Add code
Nov 05, 2023
Viaarxiv icon

Selective Structured State-Spaces for Long-Form Video Understanding

Add code
Mar 25, 2023
Viaarxiv icon

Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition

Add code
Feb 17, 2022
Figure 1 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 2 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 3 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 4 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Viaarxiv icon

Personalization Strategies for End-to-End Speech Recognition Systems

Add code
Feb 15, 2021
Figure 1 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 2 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 3 for Personalization Strategies for End-to-End Speech Recognition Systems
Figure 4 for Personalization Strategies for End-to-End Speech Recognition Systems
Viaarxiv icon

Domain-aware Neural Language Models for Speech Recognition

Add code
Jan 05, 2021
Figure 1 for Domain-aware Neural Language Models for Speech Recognition
Figure 2 for Domain-aware Neural Language Models for Speech Recognition
Figure 3 for Domain-aware Neural Language Models for Speech Recognition
Figure 4 for Domain-aware Neural Language Models for Speech Recognition
Viaarxiv icon

Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion

Add code
Nov 30, 2020
Figure 1 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 2 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 3 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Figure 4 for Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Viaarxiv icon

Multi-task Language Modeling for Improving Speech Recognition of Rare Words

Add code
Nov 25, 2020
Figure 1 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 2 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 3 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 4 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Viaarxiv icon

Contextual Language Model Adaptation for Conversational Agents

Add code
Jul 31, 2018
Figure 1 for Contextual Language Model Adaptation for Conversational Agents
Figure 2 for Contextual Language Model Adaptation for Conversational Agents
Figure 3 for Contextual Language Model Adaptation for Conversational Agents
Figure 4 for Contextual Language Model Adaptation for Conversational Agents
Viaarxiv icon