Picture for Pranay Dighe

Pranay Dighe

Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models

Add code
Nov 04, 2024
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features

Add code
Oct 23, 2023
Viaarxiv icon

Leveraging Large Language Models for Exploiting ASR Uncertainty

Add code
Sep 12, 2023
Viaarxiv icon

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Add code
Mar 30, 2022
Figure 1 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 2 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 3 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Figure 4 for Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
Viaarxiv icon

Streaming on-device detection of device directed speech from voice and touch-based invocation

Add code
Oct 09, 2021
Figure 1 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 2 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 3 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Figure 4 for Streaming on-device detection of device directed speech from voice and touch-based invocation
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

Add code
May 14, 2021
Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Add code
Oct 20, 2020
Figure 1 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 2 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 3 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Figure 4 for Knowledge Transfer for Efficient On-device False Trigger Mitigation
Viaarxiv icon

Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation

Add code
Aug 18, 2020
Figure 1 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 2 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 3 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Figure 4 for Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
Viaarxiv icon

Lattice-based Improvements for Voice Triggering Using Graph Neural Networks

Add code
Jan 25, 2020
Figure 1 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 2 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 3 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Figure 4 for Lattice-based Improvements for Voice Triggering Using Graph Neural Networks
Viaarxiv icon