Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xianghong Fang

Rethinking The Uniformity Metric in Self-Supervised Learning

Mar 01, 2024

Xianghong Fang, Jian Li, Qiang Sun, Benyou Wang

Abstract:Uniformity plays a crucial role in the assessment of learned representations, contributing to a deeper comprehension of self-supervised learning. The seminal work by \citet{Wang2020UnderstandingCR} introduced a uniformity metric that quantitatively measures the collapse degree of learned representations. Directly optimizing this metric together with alignment proves to be effective in preventing constant collapse. However, we present both theoretical and empirical evidence revealing that this metric lacks sensitivity to dimensional collapse, highlighting its limitations. To address this limitation and design a more effective uniformity metric, this paper identifies five fundamental properties, some of which the existing uniformity metric fails to meet. We subsequently introduce a novel uniformity metric that satisfies all of these desiderata and exhibits sensitivity to dimensional collapse. When applied as an auxiliary loss in various established self-supervised methods, our proposed uniformity metric consistently enhances their performance in downstream tasks.Our code was released at https://github.com/sunset-clouds/WassersteinUniformityMetric.

* ICLR 2024

Via

Access Paper or Ask Questions

Discrete Auto-regressive Variational Attention Models for Text Modeling

Jun 16, 2021

Xianghong Fang, Haoli Bai, Jian Li, Zenglin Xu, Michael Lyu, Irwin King

Figure 1 for Discrete Auto-regressive Variational Attention Models for Text Modeling

Figure 2 for Discrete Auto-regressive Variational Attention Models for Text Modeling

Figure 3 for Discrete Auto-regressive Variational Attention Models for Text Modeling

Figure 4 for Discrete Auto-regressive Variational Attention Models for Text Modeling

Abstract:Variational autoencoders (VAEs) have been widely applied for text modeling. In practice, however, they are troubled by two challenges: information underrepresentation and posterior collapse. The former arises as only the last hidden state of LSTM encoder is transformed into the latent space, which is generally insufficient to summarize the data. The latter is a long-standing problem during the training of VAEs as the optimization is trapped to a disastrous local optimum. In this paper, we propose Discrete Auto-regressive Variational Attention Model (DAVAM) to address the challenges. Specifically, we introduce an auto-regressive variational attention approach to enrich the latent space by effectively capturing the semantic dependency from the input. We further design discrete latent space for the variational attention and mathematically show that our model is free from posterior collapse. Extensive experiments on language modeling tasks demonstrate the superiority of DAVAM against several VAE counterparts.

* IJCNN 2021

Via

Access Paper or Ask Questions

Discrete Variational Attention Models for Language Generation

Apr 21, 2020

Xianghong Fang, Haoli Bai, Zenglin Xu, Michael Lyu, Irwin King

Figure 1 for Discrete Variational Attention Models for Language Generation

Figure 2 for Discrete Variational Attention Models for Language Generation

Figure 3 for Discrete Variational Attention Models for Language Generation

Figure 4 for Discrete Variational Attention Models for Language Generation

Abstract:Variational autoencoders have been widely applied for natural language generation, however, there are two long-standing problems: information under-representation and posterior collapse. The former arises from the fact that only the last hidden state from the encoder is transformed to the latent space, which is insufficient to summarize data. The latter comes as a result of the imbalanced scale between the reconstruction loss and the KL divergence in the objective function. To tackle these issues, in this paper we propose the discrete variational attention model with categorical distribution over the attention mechanism owing to the discrete nature in languages. Our approach is combined with an auto-regressive prior to capture the sequential dependency from observations, which can enhance the latent space for language generation. Moreover, thanks to the property of discreteness, the training of our proposed approach does not suffer from posterior collapse. Furthermore, we carefully analyze the superiority of discrete latent space over the continuous space with the common Gaussian distribution. Extensive experiments on language generation demonstrate superior advantages of our proposed approach in comparison with the state-of-the-art counterparts.

* 7 pages, 3 figures

Via

Access Paper or Ask Questions

DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification

Dec 30, 2018

Xianghong Fang, Haoli Bai, Ziyi Guo, Bin Shen, Steven Hoi, Zenglin Xu

Figure 1 for DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification

Figure 2 for DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification

Figure 3 for DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification

Figure 4 for DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification

Abstract:The accuracy of deep learning (e.g., convolutional neural networks) for an image classification task critically relies on the amount of labeled training data. Aiming to solve an image classification task on a new domain that lacks labeled data but gains access to cheaply available unlabeled data, unsupervised domain adaptation is a promising technique to boost the performance without incurring extra labeling cost, by assuming images from different domains share some invariant characteristics. In this paper, we propose a new unsupervised domain adaptation method named Domain-Adversarial Residual-Transfer (DART) learning of Deep Neural Networks to tackle cross-domain image classification tasks. In contrast to the existing unsupervised domain adaption approaches, the proposed DART not only learns domain-invariant features via adversarial training, but also achieves robust domain-adaptive classification via a residual-transfer strategy, all in an end-to-end training framework. We evaluate the performance of the proposed method for cross-domain image classification tasks on several well-known benchmark data sets, in which our method clearly outperforms the state-of-the-art approaches.

Via

Access Paper or Ask Questions