Picture for Vineet Garg

Vineet Garg

Oggi

Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

Add code
Nov 04, 2024
Viaarxiv icon

Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

Add code
Jun 12, 2024
Viaarxiv icon

Streaming Anchor Loss: Augmenting Supervision with Temporal Significance

Add code
Oct 09, 2023
Viaarxiv icon

Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study

Add code
Sep 27, 2023
Viaarxiv icon

Leveraging Large Language Models for Exploiting ASR Uncertainty

Add code
Sep 12, 2023
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Add code
Mar 30, 2022
Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon

Streaming on-device detection of device directed speech from voice and touch-based invocation

Add code
Oct 09, 2021
Figure 1 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 2 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 3 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 4 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

Add code
May 14, 2021
Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Progressive Voice Trigger Detection: Accuracy vs Latency

Add code
Oct 29, 2020
Figure 1 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 2 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 3 for Progressive Voice Trigger Detection: Accuracy vs Latency
Figure 4 for Progressive Voice Trigger Detection: Accuracy vs Latency
Viaarxiv icon

Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering

Add code
Aug 05, 2020
Figure 1 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 2 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 3 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Figure 4 for Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Viaarxiv icon