Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Sep 27, 2024

Eleni D. Koronaki, Geremy Loachamin Suntaxi, Paris Papavasileiou, Dimitrios G. Giovanis, Martin Kathrein, Andreas G. Boudouvis, Stéphane P. A. Bordas

Figure 1 for Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Figure 2 for Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Figure 3 for Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Figure 4 for Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Share this with someone who'll enjoy it:

Abstract:Important variables of processes are, in many occasions, categorical, i.e. names or labels representing, e.g. categories of inputs, or types of reactors or a sequence of steps. In this work, we use Large Language Models (LLMs) to derive embeddings of such inputs that represent their actual meaning, or reflect the ``distances" between categories, i.e. how similar or dissimilar they are. This is a marked difference from the current standard practice of using binary, or one-hot encoding to replace categorical variables with sequences of ones and zeros. Combined with dimensionality reduction techniques, either linear such as Principal Components Analysis (PCA), or nonlinear such as Uniform Manifold Approximation and Projection (UMAP), the proposed approach leads to a \textit{meaningful}, low-dimensional feature space. The significance of obtaining meaningful embeddings is illustrated in the context of an industrial coating process for cutting tools that includes both numerical and categorical inputs. The proposed approach enables feature importance which is a marked improvement compared to the current state-of-the-art (SotA) in the encoding of categorical variables.

View paper on

Share this with someone who'll enjoy it:

Title:Implementing LLMs in industrial process modeling: Addressing Categorical Variables

Paper and Code