Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Konstantinos Skianis

On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets

May 30, 2025

Giannis Nikolentzos, Konstantinos Skianis

Abstract:The Lipschitz constant of a neural network is connected to several important properties of the network such as its robustness and generalization. It is thus useful in many settings to estimate the Lipschitz constant of a model. Prior work has focused mainly on estimating the Lipschitz constant of multi-layer perceptrons and convolutional neural networks. Here we focus on data modeled as sets or multisets of vectors and on neural networks that can handle such data. These models typically apply some permutation invariant aggregation function, such as the sum, mean or max operator, to the input multisets to produce a single vector for each input sample. In this paper, we investigate whether these aggregation functions are Lipschitz continuous with respect to three distance functions for unordered multisets, and we compute their Lipschitz constants. In the general case, we find that each aggregation function is Lipschitz continuous with respect to only one of the three distance functions. Then, we build on these results to derive upper bounds on the Lipschitz constant of neural networks that can process multisets of vectors, while we also study their stability to perturbations and generalization under distribution shifts. To empirically verify our theoretical analysis, we conduct a series of experiments on datasets from different domains.

Via

Access Paper or Ask Questions

Leveraging LLMs for Translating and Classifying Mental Health Data

Oct 16, 2024

Konstantinos Skianis, A. Seza Doğruöz, John Pavlopoulos

Abstract:Large language models (LLMs) are increasingly used in medical fields. In mental health support, the early identification of linguistic markers associated with mental health conditions can provide valuable support to mental health professionals, and reduce long waiting times for patients. Despite the benefits of LLMs for mental health support, there is limited research on their application in mental health systems for languages other than English. Our study addresses this gap by focusing on the detection of depression severity in Greek through user-generated posts which are automatically translated from English. Our results show that GPT3.5-turbo is not very successful in identifying the severity of depression in English, and it has a varying performance in Greek as well. Our study underscores the necessity for further research, especially in languages with less resources. Also, careful implementation is necessary to ensure that LLMs are used effectively in mental health platforms, and human supervision remains crucial to avoid misdiagnosis.

Via

Access Paper or Ask Questions

Graph Reasoning with Large Language Models via Pseudo-code Prompting

Sep 26, 2024

Konstantinos Skianis, Giannis Nikolentzos, Michalis Vazirgiannis

Abstract:Large language models (LLMs) have recently achieved remarkable success in various reasoning tasks in the field of natural language processing. This success of LLMs has also motivated their use in graph-related tasks. Among others, recent work has explored whether LLMs can solve graph problems such as counting the number of connected components of a graph or computing the shortest path distance between two nodes. Although LLMs possess preliminary graph reasoning abilities, they might still struggle to solve some seemingly simple problems. In this paper, we investigate whether prompting via pseudo-code instructions can improve the performance of LLMs in solving graph problems. Our experiments demonstrate that using pseudo-code instructions generally improves the performance of all considered LLMs. The graphs, pseudo-code prompts, and evaluation code are publicly available.

Via

Access Paper or Ask Questions

Severity Prediction in Mental Health: LLM-based Creation, Analysis, Evaluation of a Novel Multilingual Dataset

Sep 25, 2024

Konstantinos Skianis, John Pavlopoulos, A. Seza Doğruöz

Figure 1 for Severity Prediction in Mental Health: LLM-based Creation, Analysis, Evaluation of a Novel Multilingual Dataset

Figure 2 for Severity Prediction in Mental Health: LLM-based Creation, Analysis, Evaluation of a Novel Multilingual Dataset

Figure 3 for Severity Prediction in Mental Health: LLM-based Creation, Analysis, Evaluation of a Novel Multilingual Dataset

Figure 4 for Severity Prediction in Mental Health: LLM-based Creation, Analysis, Evaluation of a Novel Multilingual Dataset

Abstract:Large Language Models (LLMs) are increasingly integrated into various medical fields, including mental health support systems. However, there is a gap in research regarding the effectiveness of LLMs in non-English mental health support applications. To address this problem, we present a novel multilingual adaptation of widely-used mental health datasets, translated from English into six languages (Greek, Turkish, French, Portuguese, German, and Finnish). This dataset enables a comprehensive evaluation of LLM performance in detecting mental health conditions and assessing their severity across multiple languages. By experimenting with GPT and Llama, we observe considerable variability in performance across languages, despite being evaluated on the same translated dataset. This inconsistency underscores the complexities inherent in multilingual mental health support, where language-specific nuances and mental health data coverage can affect the accuracy of the models. Through comprehensive error analysis, we emphasize the risks of relying exclusively on large language models (LLMs) in medical settings (e.g., their potential to contribute to misdiagnoses). Moreover, our proposed approach offers significant cost savings for multilingual tasks, presenting a major advantage for broad-scale implementation.

Via

Access Paper or Ask Questions

Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Sep 26, 2022

Konstantinos Katserelis, Konstantinos Skianis

Figure 1 for Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Figure 2 for Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Figure 3 for Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Figure 4 for Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Abstract:Food is essential to human survival. So much so that we have developed different recipes to suit our taste needs. In this work, we propose a novel way of creating new, fine-dining recipes from scratch using Transformers, specifically auto-regressive language models. Given a small dataset of food recipes, we try to train models to identify cooking techniques, propose novel recipes, and test the power of fine-tuning with minimal data.

Via

Access Paper or Ask Questions

Evaluation of Greek Word Embeddings

Apr 08, 2019

Stamatis Outsios, Christos Karatsalos, Konstantinos Skianis, Michalis Vazirgiannis

Figure 1 for Evaluation of Greek Word Embeddings

Figure 2 for Evaluation of Greek Word Embeddings

Figure 3 for Evaluation of Greek Word Embeddings

Figure 4 for Evaluation of Greek Word Embeddings

Abstract:Since word embeddings have been the most popular input for many NLP tasks, evaluating their quality is of critical importance. Most research efforts are focusing on English word embeddings. This paper addresses the problem of constructing and evaluating such models for the Greek language. We created a new word analogy corpus considering the original English Word2vec word analogy corpus and some specific linguistic aspects of the Greek language as well. Moreover, we created a Greek version of WordSim353 corpora for a basic evaluation of word similarities. We tested seven word vector models and our evaluation showed that we are able to create meaningful representations. Last, we discovered that the morphological complexity of the Greek language and polysemy can influence the quality of the resulting word embeddings.

Via

Access Paper or Ask Questions

Rep the Set: Neural Networks for Learning Set Representations

Apr 03, 2019

Konstantinos Skianis, Giannis Nikolentzos, Stratis Limnios, Michalis Vazirgiannis

Figure 1 for Rep the Set: Neural Networks for Learning Set Representations

Figure 2 for Rep the Set: Neural Networks for Learning Set Representations

Figure 3 for Rep the Set: Neural Networks for Learning Set Representations

Figure 4 for Rep the Set: Neural Networks for Learning Set Representations

Abstract:In several domains, data objects can be decomposed into sets of simpler objects. It is then natural to represent each object as the set of its components or parts. Many conventional machine learning algorithms are unable to process this kind of representations, since sets may vary in cardinality and elements lack a meaningful ordering. In this paper, we present a new neural network architecture, called RepSet, that can handle examples that are represented as sets of vectors. The proposed model computes the correspondences between an input set and some hidden sets by solving a series of network flow problems. This representation is then fed to a standard neural network architecture to produce the output. The architecture allows end-to-end gradient-based learning. We demonstrate RepSet on classification tasks, including text categorization, and graph classification, and we show that the proposed neural network achieves performance better or comparable to state-of-the-art algorithms.

Via

Access Paper or Ask Questions

Word Embeddings from Large-Scale Greek Web Content

Oct 26, 2018

Stamatis Outsios, Konstantinos Skianis, Polykarpos Meladianos, Christos Xypolopoulos, Michalis Vazirgiannis

Abstract:Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact with the Greek word vectors.

Via

Access Paper or Ask Questions

Orthogonal Matching Pursuit for Text Classification

Oct 09, 2018

Konstantinos Skianis, Nikolaos Tziortziotis, Michalis Vazirgiannis

Figure 1 for Orthogonal Matching Pursuit for Text Classification

Figure 2 for Orthogonal Matching Pursuit for Text Classification

Figure 3 for Orthogonal Matching Pursuit for Text Classification

Figure 4 for Orthogonal Matching Pursuit for Text Classification

Abstract:In text classification, the problem of overfitting arises due to the high dimensionality, making regularization essential. Although classic regularizers provide sparsity, they fail to return highly accurate models. On the contrary, state-of-the-art group-lasso regularizers provide better results at the expense of low sparsity. In this paper, we apply a greedy variable selection algorithm, called Orthogonal Matching Pursuit, for the text classification task. We also extend standard group OMP by introducing overlapping Group OMP to handle overlapping groups of features. Empirical analysis verifies that both OMP and overlapping GOMP constitute powerful regularizers, able to produce effective and very sparse models. Code and data are available online: https://github.com/y3nk0/OMP-for-Text-Classification .

Via

Access Paper or Ask Questions

Kernel Graph Convolutional Neural Networks

Sep 07, 2018

Giannis Nikolentzos, Polykarpos Meladianos, Antoine Jean-Pierre Tixier, Konstantinos Skianis, Michalis Vazirgiannis

Figure 1 for Kernel Graph Convolutional Neural Networks

Figure 2 for Kernel Graph Convolutional Neural Networks

Figure 3 for Kernel Graph Convolutional Neural Networks

Abstract:Graph kernels have been successfully applied to many graph classification problems. Typically, a kernel is first designed, and then an SVM classifier is trained based on the features defined implicitly by this kernel. This two-stage approach decouples data representation from learning, which is suboptimal. On the other hand, Convolutional Neural Networks (CNNs) have the capability to learn their own features directly from the raw data during training. Unfortunately, they cannot handle irregular data such as graphs. We address this challenge by using graph kernels to embed meaningful local neighborhoods of the graphs in a continuous vector space. A set of filters is then convolved with these patches, pooled, and the output is then passed to a feedforward network. With limited parameter tuning, our approach outperforms strong baselines on 7 out of 10 benchmark datasets.

* Accepted at ICANN '18

Via

Access Paper or Ask Questions