Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sara Bahaadini

MTLB-STRUCT @PARSEME 2020: Capturing Unseen Multiword Expressions Using Multi-task Learning and Pre-trained Masked Language Models

Nov 04, 2020

Shiva Taslimipoor, Sara Bahaadini, Ekaterina Kochmar

Figure 1 for MTLB-STRUCT @PARSEME 2020: Capturing Unseen Multiword Expressions Using Multi-task Learning and Pre-trained Masked Language Models

Figure 2 for MTLB-STRUCT @PARSEME 2020: Capturing Unseen Multiword Expressions Using Multi-task Learning and Pre-trained Masked Language Models

Figure 3 for MTLB-STRUCT @PARSEME 2020: Capturing Unseen Multiword Expressions Using Multi-task Learning and Pre-trained Masked Language Models

Abstract:This paper describes a semi-supervised system that jointly learns verbal multiword expressions (VMWEs) and dependency parse trees as an auxiliary task. The model benefits from pre-trained multilingual BERT. BERT hidden layers are shared among the two tasks and we introduce an additional linear layer to retrieve VMWE tags. The dependency parse tree prediction is modelled by a linear layer and a bilinear one plus a tree CRF on top of BERT. The system has participated in the open track of the PARSEME shared task 2020 and ranked first in terms of F1-score in identifying unseen VMWEs as well as VMWEs in general, averaged across all 14 languages.

* accepted for publication at MWE-LEX 2020 Workshop at COLING

Via

Access Paper or Ask Questions

Leveraging Semi-Supervised Learning for Fairness using Neural Networks

Dec 31, 2019

Vahid Noroozi, Sara Bahaadini, Samira Sheikhi, Nooshin Mojab, Philip S. Yu

Figure 1 for Leveraging Semi-Supervised Learning for Fairness using Neural Networks

Figure 2 for Leveraging Semi-Supervised Learning for Fairness using Neural Networks

Figure 3 for Leveraging Semi-Supervised Learning for Fairness using Neural Networks

Figure 4 for Leveraging Semi-Supervised Learning for Fairness using Neural Networks

Abstract:There has been a growing concern about the fairness of decision-making systems based on machine learning. The shortage of labeled data has been always a challenging problem facing machine learning based systems. In such scenarios, semi-supervised learning has shown to be an effective way of exploiting unlabeled data to improve upon the performance of model. Notably, unlabeled data do not contain label information which itself can be a significant source of bias in training machine learning systems. This inspired us to tackle the challenge of fairness by formulating the problem in a semi-supervised framework. In this paper, we propose a semi-supervised algorithm using neural networks benefiting from unlabeled data to not just improve the performance but also improve the fairness of the decision-making process. The proposed model, called SSFair, exploits the information in the unlabeled data to mitigate the bias in the training data.

* 6 pages, 5 figures, accepted to ICMLA 2019

Via

Access Paper or Ask Questions

Semi-supervised Deep Representation Learning for Multi-View Problems

Nov 11, 2018

Vahid Noroozi, Sara Bahaadini, Lei Zheng, Sihong Xie, Weixiang Shao, Philip S. Yu

Figure 1 for Semi-supervised Deep Representation Learning for Multi-View Problems

Figure 2 for Semi-supervised Deep Representation Learning for Multi-View Problems

Figure 3 for Semi-supervised Deep Representation Learning for Multi-View Problems

Figure 4 for Semi-supervised Deep Representation Learning for Multi-View Problems

Abstract:While neural networks for learning representation of multi-view data have been previously proposed as one of the state-of-the-art multi-view dimension reduction techniques, how to make the representation discriminative with only a small amount of labeled data is not well-studied. We introduce a semi-supervised neural network model, named Multi-view Discriminative Neural Network (MDNN), for multi-view problems. MDNN finds nonlinear view-specific mappings by projecting samples to a common feature space using multiple coupled deep networks. It is capable of leveraging both labeled and unlabeled data to project multi-view data so that samples from different classes are separated and those from the same class are clustered together. It also uses the inter-view correlation between views to exploit the available information in both the labeled and unlabeled data. Extensive experiments conducted on four datasets demonstrate the effectiveness of the proposed algorithm for multi-view semi-supervised learning.

* Accepted to IEEE Big Data 2018. 9 Pages

Via

Access Paper or Ask Questions

DIRECT: Deep Discriminative Embedding for Clustering of LIGO Data

May 07, 2018

Sara Bahaadini, Vahid Noroozi, Neda Rohani, Scott Coughlin, Michael Zevin, Aggelos K. Katsaggelos

Figure 1 for DIRECT: Deep Discriminative Embedding for Clustering of LIGO Data

Figure 2 for DIRECT: Deep Discriminative Embedding for Clustering of LIGO Data

Figure 3 for DIRECT: Deep Discriminative Embedding for Clustering of LIGO Data

Figure 4 for DIRECT: Deep Discriminative Embedding for Clustering of LIGO Data

Abstract:In this paper, benefiting from the strong ability of deep neural network in estimating non-linear functions, we propose a discriminative embedding function to be used as a feature extractor for clustering tasks. The trained embedding function transfers knowledge from the domain of a labeled set of morphologically-distinct images, known as classes, to a new domain within which new classes can potentially be isolated and identified. Our target application in this paper is the Gravity Spy Project, which is an effort to characterize transient, non-Gaussian noise present in data from the Advanced Laser Interferometer Gravitational-wave Observatory, or LIGO. Accumulating large, labeled sets of noise features and identifying of new classes of noise lead to a better understanding of their origin, which makes their removal from the data and/or detectors possible.

* This work has been accepted to be presented in the 25th IEEE International Conference on Image Processing (ICIP)

Via

Access Paper or Ask Questions

SEVEN: Deep Semi-supervised Verification Networks

Jun 14, 2017

Vahid Noroozi, Lei Zheng, Sara Bahaadini, Sihong Xie, Philip S. Yu

Figure 1 for SEVEN: Deep Semi-supervised Verification Networks

Figure 2 for SEVEN: Deep Semi-supervised Verification Networks

Figure 3 for SEVEN: Deep Semi-supervised Verification Networks

Figure 4 for SEVEN: Deep Semi-supervised Verification Networks

Abstract:Verification determines whether two samples belong to the same class or not, and has important applications such as face and fingerprint verification, where thousands or millions of categories are present but each category has scarce labeled examples, presenting two major challenges for existing deep learning models. We propose a deep semi-supervised model named SEmi-supervised VErification Network (SEVEN) to address these challenges. The model consists of two complementary components. The generative component addresses the lack of supervision within each category by learning general salient structures from a large amount of data across categories. The discriminative component exploits the learned general features to mitigate the lack of supervision within categories, and also directs the generative component to find more informative structures of the whole data manifold. The two components are tied together in SEVEN to allow an end-to-end training of the two components. Extensive experiments on four verification tasks demonstrate that SEVEN significantly outperforms other state-of-the-art deep semi-supervised techniques when labeled data are in short supply. Furthermore, SEVEN is competitive with fully supervised baselines trained with a larger amount of labeled data. It indicates the importance of the generative component in SEVEN.

* 7 pages, 2 figures, accepted to the 2017 International Joint Conference on Artificial Intelligence (IJCAI-17)

Via

Access Paper or Ask Questions

Deep Multi-view Models for Glitch Classification

Apr 28, 2017

Sara Bahaadini, Neda Rohani, Scott Coughlin, Michael Zevin, Vicky Kalogera, Aggelos K Katsaggelos

Figure 1 for Deep Multi-view Models for Glitch Classification

Figure 2 for Deep Multi-view Models for Glitch Classification

Figure 3 for Deep Multi-view Models for Glitch Classification

Figure 4 for Deep Multi-view Models for Glitch Classification

Abstract:Non-cosmic, non-Gaussian disturbances known as "glitches", show up in gravitational-wave data of the Advanced Laser Interferometer Gravitational-wave Observatory, or aLIGO. In this paper, we propose a deep multi-view convolutional neural network to classify glitches automatically. The primary purpose of classifying glitches is to understand their characteristics and origin, which facilitates their removal from the data or from the detector entirely. We visualize glitches as spectrograms and leverage the state-of-the-art image classification techniques in our model. The suggested classifier is a multi-view deep neural network that exploits four different views for classification. The experimental results demonstrate that the proposed model improves the overall accuracy of the classification compared to traditional single view algorithms.

* Accepted to the 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'17)

Via

Access Paper or Ask Questions