Picture for Phil Woodland

Phil Woodland

SkillAggregation: Reference-free LLM-Dependent Aggregation

Add code
Oct 14, 2024
Viaarxiv icon

MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events

Add code
Sep 25, 2024
Figure 1 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 2 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 3 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Figure 4 for MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events
Viaarxiv icon

CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models

Add code
May 22, 2024
Viaarxiv icon

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator

Add code
May 30, 2023
Viaarxiv icon

Combination of Deep Speaker Embeddings for Diarisation

Add code
Oct 22, 2020
Figure 1 for Combination of Deep Speaker Embeddings for Diarisation
Figure 2 for Combination of Deep Speaker Embeddings for Diarisation
Figure 3 for Combination of Deep Speaker Embeddings for Diarisation
Figure 4 for Combination of Deep Speaker Embeddings for Diarisation
Viaarxiv icon

Speaker diarisation using 2D self-attentive combination of embeddings

Add code
Feb 08, 2019
Figure 1 for Speaker diarisation using 2D self-attentive combination of embeddings
Figure 2 for Speaker diarisation using 2D self-attentive combination of embeddings
Figure 3 for Speaker diarisation using 2D self-attentive combination of embeddings
Figure 4 for Speaker diarisation using 2D self-attentive combination of embeddings
Viaarxiv icon