Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Krefl

Zero- and Few-Shots Knowledge Graph Triplet Extraction with Large Language Models

Dec 04, 2023

Andrea Papaluca, Daniel Krefl, Sergio Mendez Rodriguez, Artem Lensky, Hanna Suominen

Abstract:In this work, we tested the Triplet Extraction (TE) capabilities of a variety of Large Language Models (LLMs) of different sizes in the Zero- and Few-Shots settings. In detail, we proposed a pipeline that dynamically gathers contextual information from a Knowledge Base (KB), both in the form of context triplets and of (sentence, triplets) pairs as examples, and provides it to the LLM through a prompt. The additional context allowed the LLMs to be competitive with all the older fully trained baselines based on the Bidirectional Long Short-Term Memory (BiLSTM) Network architecture. We further conducted a detailed analysis of the quality of the gathered KB context, finding it to be strongly correlated with the final TE performance of the model. In contrast, the size of the model appeared to only logarithmically improve the TE capabilities of the LLMs.

Via

Access Paper or Ask Questions

Product Jacobi-Theta Boltzmann machines with score matching

Mar 10, 2023

Andrea Pasquale, Daniel Krefl, Stefano Carrazza, Frank Nielsen

Figure 1 for Product Jacobi-Theta Boltzmann machines with score matching

Figure 2 for Product Jacobi-Theta Boltzmann machines with score matching

Abstract:The estimation of probability density functions is a non trivial task that over the last years has been tackled with machine learning techniques. Successful applications can be obtained using models inspired by the Boltzmann machine (BM) architecture. In this manuscript, the product Jacobi-Theta Boltzmann machine (pJTBM) is introduced as a restricted version of the Riemann-Theta Boltzmann machine (RTBM) with diagonal hidden sector connection matrix. We show that score matching, based on the Fisher divergence, can be used to fit probability densities with the pJTBM more efficiently than with the original RTBM.

* 7 pages, 3 figures, ACAT22 proceedings

Via

Access Paper or Ask Questions

Modelling conditional probabilities with Riemann-Theta Boltzmann Machines

May 27, 2019

Stefano Carrazza, Daniel Krefl, Andrea Papaluca

Figure 1 for Modelling conditional probabilities with Riemann-Theta Boltzmann Machines

Figure 2 for Modelling conditional probabilities with Riemann-Theta Boltzmann Machines

Figure 3 for Modelling conditional probabilities with Riemann-Theta Boltzmann Machines

Figure 4 for Modelling conditional probabilities with Riemann-Theta Boltzmann Machines

Abstract:The probability density function for the visible sector of a Riemann-Theta Boltzmann machine can be taken conditional on a subset of the visible units. We derive that the corresponding conditional density function is given by a reparameterization of the Riemann-Theta Boltzmann machine modelling the original probability density function. Therefore the conditional densities can be directly inferred from the Riemann-Theta Boltzmann machine.

* 7 pages, 3 figures, in proceedings of the 19th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2019)

Via

Access Paper or Ask Questions

Sampling the Riemann-Theta Boltzmann Machine

Apr 20, 2018

Stefano Carrazza, Daniel Krefl

Figure 1 for Sampling the Riemann-Theta Boltzmann Machine

Figure 2 for Sampling the Riemann-Theta Boltzmann Machine

Figure 3 for Sampling the Riemann-Theta Boltzmann Machine

Figure 4 for Sampling the Riemann-Theta Boltzmann Machine

Abstract:We show that the visible sector probability density function of the Riemann-Theta Boltzmann machine corresponds to a gaussian mixture model consisting of an infinite number of component multi-variate gaussians. The weights of the mixture are given by a discrete multi-variate gaussian over the hidden state space. This allows us to sample the visible sector density function in a straight-forward manner. Furthermore, we show that the visible sector probability density function possesses an affine transform property, similar to the multi-variate gaussian density.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions

Riemann-Theta Boltzmann Machine

Apr 06, 2018

Daniel Krefl, Stefano Carrazza, Babak Haghighat, Jens Kahlen

Figure 1 for Riemann-Theta Boltzmann Machine

Figure 2 for Riemann-Theta Boltzmann Machine

Figure 3 for Riemann-Theta Boltzmann Machine

Figure 4 for Riemann-Theta Boltzmann Machine

Abstract:A general Boltzmann machine with continuous visible and discrete integer valued hidden states is introduced. Under mild assumptions about the connection matrices, the probability density function of the visible units can be solved for analytically, yielding a novel parametric density function involving a ratio of Riemann-Theta functions. The conditional expectation of a hidden state for given visible states can also be calculated analytically, yielding a derivative of the logarithmic Riemann-Theta function. The conditional expectation can be used as activation function in a feedforward neural network, thereby increasing the modelling capacity of the network. Both the Boltzmann machine and the derived feedforward neural network can be successfully trained via standard gradient- and non-gradient-based optimization techniques.

* 25 pages, 11 figures, typos corrected

Via

Access Paper or Ask Questions