Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hong Tao

Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

Nov 05, 2024

Ruidong Fan, Xiao Ouyang, Hong Tao, Yuhua Qian, Chenping Hou

Figure 1 for Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

Figure 2 for Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

Figure 3 for Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

Figure 4 for Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

Abstract:As a prominent challenge in addressing real-world issues within a dynamic environment, label shift, which refers to the learning setting where the source (training) and target (testing) label distributions do not match, has recently received increasing attention. Existing label shift methods solely use unlabeled target samples to estimate the target label distribution, and do not involve them during the classifier training, resulting in suboptimal utilization of available information. One common solution is to directly blend the source and target distributions during the training of the target classifier. However, we illustrate the theoretical deviation and limitations of the direct distribution mixture in the label shift setting. To tackle this crucial yet unexplored issue, we introduce the concept of aligned distribution mixture, showcasing its theoretical optimality and generalization error bounds. By incorporating insights from generalization theory, we propose an innovative label shift framework named as Aligned Distribution Mixture (ADM). Within this framework, we enhance four typical label shift methods by introducing modifications to the classifier training process. Furthermore, we also propose a one-step approach that incorporates a pioneering coupling weight estimation strategy. Considering the distinctiveness of the proposed one-step approach, we develop an efficient bi-level optimization strategy. Experimental results demonstrate the effectiveness of our approaches, together with their effectiveness in COVID-19 diagnosis applications.

Via

Access Paper or Ask Questions

Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Dec 16, 2019

Hong Tao, Chenping Hou, Yuhua Qian, Jubo Zhu, Dongyun Yi

Figure 1 for Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Figure 2 for Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Figure 3 for Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Figure 4 for Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Abstract:Multi-view subspace clustering has been applied to applications such as image processing and video surveillance, and has attracted increasing attention. Most existing methods learn view-specific self-representation matrices, and construct a combined affinity matrix from multiple views. The affinity construction process is time-consuming, and the combined affinity matrix is not guaranteed to reflect the whole true subspace structure. To overcome these issues, the Latent Complete Row Space Recovery (LCRSR) method is proposed. Concretely, LCRSR is based on the assumption that the multi-view observations are generated from an underlying latent representation, which is further assumed to collect the authentic samples drawn exactly from multiple subspaces. LCRSR is able to recover the row space of the latent representation, which not only carries complete information from multiple views but also determines the subspace membership under certain conditions. LCRSR does not involve the graph construction procedure and is solved with an efficient and convergent algorithm, thereby being more scalable to large-scale datasets. The effectiveness and efficiency of LCRSR are validated by clustering various kinds of multi-view data and illustrated in the background subtraction task.

Via

Access Paper or Ask Questions

Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

Dec 25, 2018

Hong Tao, Chenping Hou, Dongyun Yi, Jubo Zhu

Figure 1 for Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

Figure 2 for Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

Figure 3 for Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

Figure 4 for Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

Abstract:In real-world applications, not all instances in multi-view data are fully represented. To deal with incomplete multi-view data, traditional multi-view algorithms usually throw away the incomplete instances, resulting in loss of available information. To overcome this loss, Incomplete Multi-view Learning (IML) has become a hot research topic. In this paper, we propose a general IML framework for unifying existing IML methods and gaining insight into IML. The proposed framework jointly performs embedding learning and low-rank approximation. Concretely, it approximates the incomplete data by a set of low-rank matrices and learns a full and common embedding by linear transformation. Several existing IML methods can be unified as special cases of the framework. More interestingly, some linear transformation based full-view methods can be adapted to IML directly with the guidance of the framework. This bridges the gap between full multi-view learning and IML. Moreover, the framework can provide guidance for developing new algorithms. For illustration, within the framework, we propose a specific method, termed as Incomplete Multi-view Learning with Block Diagonal Representation (IML-BDR). Based on the assumption that the sampled examples have approximate linear subspace structure, IML-BDR uses the block diagonal structure prior to learn the full embedding, which would lead to more correct clustering. A convergent alternating iterative algorithm with the Successive Over-Relaxation (SOR) optimization technique is devised for optimization. Experimental results on various datasets demonstrate the effectiveness of IML-BDR.

Via

Access Paper or Ask Questions

Effective Discriminative Feature Selection with Non-trivial Solutions

Apr 21, 2015

Hong Tao, Chenping Hou, Feiping Nie, Yuanyuan Jiao, Dongyun Yi

Figure 1 for Effective Discriminative Feature Selection with Non-trivial Solutions

Figure 2 for Effective Discriminative Feature Selection with Non-trivial Solutions

Figure 3 for Effective Discriminative Feature Selection with Non-trivial Solutions

Figure 4 for Effective Discriminative Feature Selection with Non-trivial Solutions

Abstract:Feature selection and feature transformation, the two main ways to reduce dimensionality, are often presented separately. In this paper, a feature selection method is proposed by combining the popular transformation based dimensionality reduction method Linear Discriminant Analysis (LDA) and sparsity regularization. We impose row sparsity on the transformation matrix of LDA through ${\ell}_{2,1}$-norm regularization to achieve feature selection, and the resultant formulation optimizes for selecting the most discriminative features and removing the redundant ones simultaneously. The formulation is extended to the ${\ell}_{2,p}$-norm regularized case: which is more likely to offer better sparsity when $0<p<1$. Thus the formulation is a better approximation to the feature selection problem. An efficient algorithm is developed to solve the ${\ell}_{2,p}$-norm based optimization problem and it is proved that the algorithm converges when $0<p\le 2$. Systematical experiments are conducted to understand the work of the proposed method. Promising experimental results on various types of real-world data sets demonstrate the effectiveness of our algorithm.

Via

Access Paper or Ask Questions