Picture for Hung-Yi Lee

Hung-Yi Lee

Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

Add code
Jun 16, 2024
Figure 1 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 2 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 3 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 4 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Viaarxiv icon

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Add code
Feb 22, 2024
Viaarxiv icon

Examining Forgetting in Continual Pre-training of Aligned Large Language Models

Add code
Jan 06, 2024
Viaarxiv icon

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

Add code
Jan 30, 2023
Viaarxiv icon

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Add code
Dec 20, 2022
Viaarxiv icon

The Ability of Self-Supervised Speech Models for Audio Representations

Add code
Sep 28, 2022
Figure 1 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 2 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 3 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 4 for The Ability of Self-Supervised Speech Models for Audio Representations
Viaarxiv icon

On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting

Add code
Apr 01, 2022
Figure 1 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 2 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 3 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 4 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Viaarxiv icon

Partially Fake Audio Detection by Self-attention-based Fake Span Discovery

Add code
Feb 15, 2022
Figure 1 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 2 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 3 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 4 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Viaarxiv icon

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations

Add code
Oct 12, 2021
Figure 1 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 2 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 3 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 4 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Viaarxiv icon

Analyzing the Robustness of Unsupervised Speech Recognition

Add code
Oct 12, 2021
Figure 1 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 2 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 3 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 4 for Analyzing the Robustness of Unsupervised Speech Recognition
Viaarxiv icon