Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xavier Boucher

Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Oct 31, 2022

Kenneth Ezukwoke, Anis Hoayek, Mireille Batton-Hubert, Xavier Boucher, Pascal Gounet, Jerome Adrian

Figure 1 for Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Figure 2 for Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Figure 3 for Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Figure 4 for Leveraging Pre-trained Models for Failure Analysis Triplets Generation

Abstract:Pre-trained Language Models recently gained traction in the Natural Language Processing (NLP) domain for text summarization, generation and question-answering tasks. This stems from the innovation introduced in Transformer models and their overwhelming performance compared with Recurrent Neural Network Models (Long Short Term Memory (LSTM)). In this paper, we leverage the attention mechanism of pre-trained causal language models such as Transformer model for the downstream task of generating Failure Analysis Triplets (FATs) - a sequence of steps for analyzing defected components in the semiconductor industry. We compare different transformer models for this generative task and observe that Generative Pre-trained Transformer 2 (GPT2) outperformed other transformer model for the failure analysis triplet generation (FATG) task. In particular, we observe that GPT2 (trained on 1.5B parameters) outperforms pre-trained BERT, BART and GPT3 by a large margin on ROUGE. Furthermore, we introduce Levenshstein Sequential Evaluation metric (LESE) for better evaluation of the structured FAT data and show that it compares exactly with human judgment than existing metrics.

* 33 pages, 11 figures, 9 tables

Via

Access Paper or Ask Questions

GCVAE: Generalized-Controllable Variational AutoEncoder

Jun 09, 2022

Kenneth Ezukwoke, Anis Hoayek, Mireille Batton-Hubert, Xavier Boucher

Figure 1 for GCVAE: Generalized-Controllable Variational AutoEncoder

Figure 2 for GCVAE: Generalized-Controllable Variational AutoEncoder

Figure 3 for GCVAE: Generalized-Controllable Variational AutoEncoder

Figure 4 for GCVAE: Generalized-Controllable Variational AutoEncoder

Abstract:Variational autoencoders (VAEs) have recently been used for unsupervised disentanglement learning of complex density distributions. Numerous variants exist to encourage disentanglement in latent space while improving reconstruction. However, none have simultaneously managed the trade-off between attaining extremely low reconstruction error and a high disentanglement score. We present a generalized framework to handle this challenge under constrained optimization and demonstrate that it outperforms state-of-the-art existing models as regards disentanglement while balancing reconstruction. We introduce three controllable Lagrangian hyperparameters to control reconstruction loss, KL divergence loss and correlation measure. We prove that maximizing information in the reconstruction network is equivalent to information maximization during amortized inference under reasonable assumptions and constraint relaxation.

* 17 pages, 7 figures, 2 tables

Via

Access Paper or Ask Questions