Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wenhan Chao

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

Nov 27, 2023

Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, Wenhan Chao, Leibny Paola Garcia

Abstract:In this work, we study the features extracted by English self-supervised learning (SSL) models in cross-lingual contexts and propose a new metric to predict the quality of feature representations. Using automatic speech recognition (ASR) as a downstream task, we analyze the effect of model size, training objectives, and model architecture on the models' performance as a feature extractor for a set of topologically diverse corpora. We develop a novel metric, the Phonetic-Syntax Ratio (PSR), to measure the phonetic and synthetic information in the extracted representations using deep generalized canonical correlation analysis. Results show the contrastive loss in the wav2vec2.0 objective facilitates more effective cross-lingual feature extraction. There is a positive correlation between PSR scores and ASR performance, suggesting that phonetic information extracted by monolingual SSL models can be used for downstream tasks in cross-lingual settings. The proposed metric is an effective indicator of the quality of the representations and can be useful for model selection.

* 12 pages, 5 figures, 4 tables

Via

Access Paper or Ask Questions

Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

Feb 23, 2018

Hai Ye, Xin Jiang, Zhunchen Luo, Wenhan Chao

Figure 1 for Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

Figure 2 for Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

Figure 3 for Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

Figure 4 for Interpretable Charge Predictions for Criminal Cases: Learning to Generate Court Views from Fact Descriptions

Abstract:In this paper, we propose to study the problem of COURT VIEW GENeration from the fact description in a criminal case. The task aims to improve the interpretability of charge prediction systems and help automatic legal document generation. We formulate this task as a text-to-text natural language generation (NLG) problem. Sequenceto-sequence model has achieved cutting-edge performances in many NLG tasks. However, due to the non-distinctions of fact descriptions, it is hard for Seq2Seq model to generate charge-discriminative court views. In this work, we explore charge labels to tackle this issue. We propose a label-conditioned Seq2Seq model with attention for this problem, to decode court views conditioned on encoded charge labels. Experimental results show the effectiveness of our method.

* To appear in NAACL 2018, Long paper

Via

Access Paper or Ask Questions

Jointly Extracting Relations with Class Ties via Effective Deep Ranking

Aug 05, 2017

Hai Ye, Wenhan Chao, Zhunchen Luo, Zhoujun Li

Figure 1 for Jointly Extracting Relations with Class Ties via Effective Deep Ranking

Figure 2 for Jointly Extracting Relations with Class Ties via Effective Deep Ranking

Figure 3 for Jointly Extracting Relations with Class Ties via Effective Deep Ranking

Figure 4 for Jointly Extracting Relations with Class Ties via Effective Deep Ranking

Abstract:Connections between relations in relation extraction, which we call class ties, are common. In distantly supervised scenario, one entity tuple may have multiple relation facts. Exploiting class ties between relations of one entity tuple will be promising for distantly supervised relation extraction. However, previous models are not effective or ignore to model this property. In this work, to effectively leverage class ties, we propose to make joint relation extraction with a unified model that integrates convolutional neural network (CNN) with a general pairwise ranking framework, in which three novel ranking loss functions are introduced. Additionally, an effective method is presented to relieve the severe class imbalance problem from NR (not relation) for model training. Experiments on a widely used dataset show that leveraging class ties will enhance extraction and demonstrate the effectiveness of our model to learn class ties. Our model outperforms the baselines significantly, achieving state-of-the-art performance.

* To appear in ACL2017

Via

Access Paper or Ask Questions