Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Viet Huynh

Clustering-based Meta Bayesian Optimization with Theoretical Guarantee

Mar 08, 2025

Khoa Nguyen, Viet Huynh, Binh Tran, Tri Pham, Tin Huynh, Thin Nguyen

Abstract:Bayesian Optimization (BO) is a well-established method for addressing black-box optimization problems. In many real-world scenarios, optimization often involves multiple functions, emphasizing the importance of leveraging data and learned functions from prior tasks to enhance efficiency in the current task. To expedite convergence to the global optimum, recent studies have introduced meta-learning strategies, collectively referred to as meta-BO, to incorporate knowledge from historical tasks. However, in practical settings, the underlying functions are often heterogeneous, which can adversely affect optimization performance for the current task. Additionally, when the number of historical tasks is large, meta-BO methods face significant scalability challenges. In this work, we propose a scalable and robust meta-BO method designed to address key challenges in heterogeneous and large-scale meta-tasks. Our approach (1) effectively partitions transferred meta-functions into highly homogeneous clusters, (2) learns the geometry-based surrogate prototype that capture the structural patterns within each cluster, and (3) adaptively synthesizes meta-priors during the online phase using statistical distance-based weighting policies. Experimental results on real-world hyperparameter optimization (HPO) tasks, combined with theoretical guarantees, demonstrate the robustness and effectiveness of our method in overcoming these challenges.

* Accepted at PAKDD 2025

Via

Access Paper or Ask Questions

Improved and Efficient Text Adversarial Attacks using Target Information

May 02, 2021

Mahmoud Hossam, Trung Le, He Zhao, Viet Huynh, Dinh Phung

Figure 1 for Improved and Efficient Text Adversarial Attacks using Target Information

Figure 2 for Improved and Efficient Text Adversarial Attacks using Target Information

Figure 3 for Improved and Efficient Text Adversarial Attacks using Target Information

Figure 4 for Improved and Efficient Text Adversarial Attacks using Target Information

Abstract:There has been recently a growing interest in studying adversarial examples on natural language models in the black-box setting. These methods attack natural language classifiers by perturbing certain important words until the classifier label is changed. In order to find these important words, these methods rank all words by importance by querying the target model word by word for each input sentence, resulting in high query inefficiency. A new interesting approach was introduced that addresses this problem through interpretable learning to learn the word ranking instead of previous expensive search. The main advantage of using this approach is that it achieves comparable attack rates to the state-of-the-art methods, yet faster and with fewer queries, where fewer queries are desirable to avoid suspicion towards the attacking agent. Nonetheless, this approach sacrificed the useful information that could be leveraged from the target classifier for that sake of query efficiency. In this paper we study the effect of leveraging the target model outputs and data on both attack rates and average number of queries, and we show that both can be improved, with a limited overhead of additional queries.

* Accepted in the International Conference on Learning Representations (ICLR) workshop on Robust and Reliable Machine Learning in the Real World (RobustML)

Via

Access Paper or Ask Questions

Text Generation with Deep Variational GAN

Apr 27, 2021

Mahmoud Hossam, Trung Le, Michael Papasimeon, Viet Huynh, Dinh Phung

Figure 1 for Text Generation with Deep Variational GAN

Figure 2 for Text Generation with Deep Variational GAN

Figure 3 for Text Generation with Deep Variational GAN

Abstract:Generating realistic sequences is a central task in many machine learning applications. There has been considerable recent progress on building deep generative models for sequence generation tasks. However, the issue of mode-collapsing remains a main issue for the current models. In this paper we propose a GAN-based generic framework to address the problem of mode-collapse in a principled approach. We change the standard GAN objective to maximize a variational lower-bound of the log-likelihood while minimizing the Jensen-Shanon divergence between data and model distributions. We experiment our model with text generation task and show that it can generate realistic text with high diversity.

* Accepted in the Third Workshop on Bayesian Deep Learning (NIPS / NeurIPS 2018)

Via

Access Paper or Ask Questions

Topic Modelling Meets Deep Neural Networks: A Survey

Feb 28, 2021

He Zhao, Dinh Phung, Viet Huynh, Yuan Jin, Lan Du, Wray Buntine

Figure 1 for Topic Modelling Meets Deep Neural Networks: A Survey

Abstract:Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review focusing on this specific topic.

* A review on Neural Topic Models

Via

Access Paper or Ask Questions

Neural Sinkhorn Topic Model

Aug 12, 2020

He Zhao, Dinh Phung, Viet Huynh, Trung Le, Wray Buntine

Figure 1 for Neural Sinkhorn Topic Model

Figure 2 for Neural Sinkhorn Topic Model

Figure 3 for Neural Sinkhorn Topic Model

Figure 4 for Neural Sinkhorn Topic Model

Abstract:In this paper, we present a new topic modelling approach via the theory of optimal transport (OT). Specifically, we present a document with two distributions: a distribution over the words (doc-word distribution) and a distribution over the topics (doc-topic distribution). For one document, the doc-word distribution is the observed, sparse, low-level representation of the content, while the doc-topic distribution is the latent, dense, high-level one of the same content. Learning a topic model can then be viewed as a process of minimising the transportation of the semantic information from one distribution to the other. This new viewpoint leads to a novel OT-based topic modelling framework, which enjoys appealing simplicity, effectiveness, and efficiency. Extensive experiments show that our framework significantly outperforms several state-of-the-art models in terms of both topic quality and document representations.

Via

Access Paper or Ask Questions

OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

May 22, 2020

Mahmoud Hossam, Trung Le, Viet Huynh, Michael Papasimeon, Dinh Phung

Figure 1 for OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

Figure 2 for OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

Figure 3 for OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

Figure 4 for OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation

Abstract:One of the challenging problems in sequence generation tasks is the optimized generation of sequences with specific desired goals. Current sequential generative models mainly generate sequences to closely mimic the training data, without direct optimization of desired goals or properties specific to the task. We introduce OptiGAN, a generative model that incorporates both Generative Adversarial Networks (GAN) and Reinforcement Learning (RL) to optimize desired goal scores using policy gradients. We apply our model to text and real-valued sequence generation, where our model is able to achieve higher desired scores out-performing GAN and RL baselines, while not sacrificing output sample diversity.

* Preprint for accepted conference paper at International Joint Conference on Neural Networks (IJCNN) 2020

Via

Access Paper or Ask Questions

On Scalable Variant of Wasserstein Barycenter

Oct 10, 2019

Tam Le, Viet Huynh, Nhat Ho, Dinh Phung, Makoto Yamada

Figure 1 for On Scalable Variant of Wasserstein Barycenter

Figure 2 for On Scalable Variant of Wasserstein Barycenter

Figure 3 for On Scalable Variant of Wasserstein Barycenter

Figure 4 for On Scalable Variant of Wasserstein Barycenter

Abstract:We study a variant of Wasserstein barycenter problem, which we refer to as \emph{tree-sliced Wasserstein barycenter}, by leveraging the structure of tree metrics for the ground metrics in the formulation of Wasserstein distance. Drawing on the tree structure, we propose efficient algorithms for solving the unconstrained and constrained versions of tree-sliced Wasserstein barycenter. The algorithms have fast computational time and efficient memory usage, especially for high dimensional settings while demonstrating favorable results when the tree metrics are appropriately constructed. Experimental results on large-scale synthetic and real datasets from Wasserstein barycenter for documents with word embedding, multilevel clustering, and scalable Bayes problems show the advantages of tree-sliced Wasserstein barycenter over (Sinkhorn) Wasserstein barycenter.

* Tam Le, Viet Huynh, and Nhat Ho contributed equally to this work

Via

Access Paper or Ask Questions

On Efficient Multilevel Clustering via Wasserstein Distances

Sep 19, 2019

Viet Huynh, Nhat Ho, Nhan Dam, XuanLong Nguyen, Mikhail Yurochkin, Hung Bui, and Dinh Phung

Figure 1 for On Efficient Multilevel Clustering via Wasserstein Distances

Figure 2 for On Efficient Multilevel Clustering via Wasserstein Distances

Figure 3 for On Efficient Multilevel Clustering via Wasserstein Distances

Figure 4 for On Efficient Multilevel Clustering via Wasserstein Distances

Abstract:We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose several variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, the experimental results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.

* 42 pages, 8 figures, JMLR submission. arXiv admin note: substantial text overlap with arXiv:1706.03883

Via

Access Paper or Ask Questions

Probabilistic Multilevel Clustering via Composite Transportation Distance

Oct 29, 2018

Nhat Ho, Viet Huynh, Dinh Phung, Michael I. Jordan

Figure 1 for Probabilistic Multilevel Clustering via Composite Transportation Distance

Figure 2 for Probabilistic Multilevel Clustering via Composite Transportation Distance

Figure 3 for Probabilistic Multilevel Clustering via Composite Transportation Distance

Figure 4 for Probabilistic Multilevel Clustering via Composite Transportation Distance

Abstract:We propose a novel probabilistic approach to multilevel clustering problems based on composite transportation distance, which is a variant of transportation distance where the underlying metric is Kullback-Leibler divergence. Our method involves solving a joint optimization problem over spaces of probability measures to simultaneously discover grouping structures within groups and among groups. By exploiting the connection of our method to the problem of finding composite transportation barycenters, we develop fast and efficient optimization algorithms even for potentially large-scale multilevel datasets. Finally, we present experimental results with both synthetic and real data to demonstrate the efficiency and scalability of the proposed approach.

* 25 pages, 3 figures

Via

Access Paper or Ask Questions

Multilevel Clustering via Wasserstein Means

Jun 13, 2017

Nhat Ho, XuanLong Nguyen, Mikhail Yurochkin, Hung Hai Bui, Viet Huynh, Dinh Phung

Figure 1 for Multilevel Clustering via Wasserstein Means

Figure 2 for Multilevel Clustering via Wasserstein Means

Figure 3 for Multilevel Clustering via Wasserstein Means

Figure 4 for Multilevel Clustering via Wasserstein Means

Abstract:We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose a number of variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, experiment results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.

* Proceedings of the ICML, 2017

Via

Access Paper or Ask Questions