Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chin-Lun Fu

Exploring Efficient-tuning Methods in Self-supervised Speech Models

Oct 10, 2022

Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen Li, Hung-yi Lee

Figure 1 for Exploring Efficient-tuning Methods in Self-supervised Speech Models

Figure 2 for Exploring Efficient-tuning Methods in Self-supervised Speech Models

Figure 3 for Exploring Efficient-tuning Methods in Self-supervised Speech Models

Figure 4 for Exploring Efficient-tuning Methods in Self-supervised Speech Models

Abstract:In this study, we aim to explore efficient tuning methods for speech self-supervised learning. Recent studies show that self-supervised learning (SSL) can learn powerful representations for different speech tasks. However, fine-tuning pre-trained models for each downstream task is parameter-inefficient since SSL models are notoriously large with millions of parameters. Adapters are lightweight modules commonly used in NLP to solve this problem. In downstream tasks, the parameters of SSL models are frozen, and only the adapters are trained. Given the lack of studies generally exploring the effectiveness of adapters for self-supervised speech tasks, we intend to fill this gap by adding various adapter modules in pre-trained speech SSL models. We show that the performance parity can be achieved with over 90% parameter reduction, and discussed the pros and cons of efficient tuning techniques. This is the first comprehensive investigation of various adapter types across speech tasks.

* SLT 2022

Via

Access Paper or Ask Questions

Learning Facial Liveness Representation for Domain Generalized Face Anti-spoofing

Aug 16, 2022

Zih-Ching Chen, Lin-Hsi Tsao, Chin-Lun Fu, Shang-Fu Chen, Yu-Chiang Frank Wang

Figure 1 for Learning Facial Liveness Representation for Domain Generalized Face Anti-spoofing

Figure 2 for Learning Facial Liveness Representation for Domain Generalized Face Anti-spoofing

Figure 3 for Learning Facial Liveness Representation for Domain Generalized Face Anti-spoofing

Figure 4 for Learning Facial Liveness Representation for Domain Generalized Face Anti-spoofing

Abstract:Face anti-spoofing (FAS) aims at distinguishing face spoof attacks from the authentic ones, which is typically approached by learning proper models for performing the associated classification task. In practice, one would expect such models to be generalized to FAS in different image domains. Moreover, it is not practical to assume that the type of spoof attacks would be known in advance. In this paper, we propose a deep learning model for addressing the aforementioned domain-generalized face anti-spoofing task. In particular, our proposed network is able to disentangle facial liveness representation from the irrelevant ones (i.e., facial content and image domain features). The resulting liveness representation exhibits sufficient domain invariant properties, and thus it can be applied for performing domain-generalized FAS. In our experiments, we conduct experiments on five benchmark datasets with various settings, and we verify that our model performs favorably against state-of-the-art approaches in identifying novel types of spoof attacks in unseen image domains.

* Accepted to ICME 2022

Via

Access Paper or Ask Questions

AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Apr 30, 2022

Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee

Figure 1 for AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Figure 2 for AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Figure 3 for AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Figure 4 for AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

Abstract:Transformer-based pre-trained models with millions of parameters require large storage. Recent approaches tackle this shortcoming by training adapters, but these approaches still require a relatively large number of parameters. In this study, AdapterBias, a surprisingly simple yet effective adapter architecture, is proposed. AdapterBias adds a token-dependent shift to the hidden output of transformer layers to adapt to downstream tasks with only a vector and a linear layer. Extensive experiments are conducted to demonstrate the effectiveness of AdapterBias. The experiments show that our proposed method can dramatically reduce the trainable parameters compared to the previous works with a minimal decrease in task performances compared with fine-tuned pre-trained models. We further find that AdapterBias automatically learns to assign more significant representation shifts to the tokens related to the task in consideration.

* The first two authors contributed equally. This paper will be published in Findings of NAACL 2022

Via

Access Paper or Ask Questions