Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Konstantinos Diamantaras

Learning local discrete features in explainable-by-design convolutional neural networks

Oct 31, 2024

Pantelis I. Kaplanoglou, Konstantinos Diamantaras

Abstract:Our proposed framework attempts to break the trade-off between performance and explainability by introducing an explainable-by-design convolutional neural network (CNN) based on the lateral inhibition mechanism. The ExplaiNet model consists of the predictor, that is a high-accuracy CNN with residual or dense skip connections, and the explainer probabilistic graph that expresses the spatial interactions of the network neurons. The value on each graph node is a local discrete feature (LDF) vector, a patch descriptor that represents the indices of antagonistic neurons ordered by the strength of their activations, which are learned with gradient descent. Using LDFs as sequences we can increase the conciseness of explanations by repurposing EXTREME, an EM-based sequence motif discovery method that is typically used in molecular biology. Having a discrete feature motif matrix for each one of intermediate image representations, instead of a continuous activation tensor, allows us to leverage the inherent explainability of Bayesian networks. By collecting observations and directly calculating probabilities, we can explain causal relationships between motifs of adjacent levels and attribute the model's output to global motifs. Moreover, experiments on various tiny image benchmark datasets confirm that our predictor ensures the same level of performance as the baseline architecture for a given count of parameters and/or layers. Our novel method shows promise to exceed this performance while providing an additional stream of explanations. In the solved MNIST classification task, it reaches a comparable to the state-of-the-art performance for single models, using standard training setup and 0.75 million parameters.

Via

Access Paper or Ask Questions

Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population

Jul 25, 2024

Nikolaos Ntampakis, Konstantinos Diamantaras, Ioanna Chouvarda, Vasileios Argyriou, Panagiotis Sarigianndis

Figure 1 for Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population

Figure 2 for Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population

Figure 3 for Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population

Figure 4 for Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population

Abstract:Dementia, a debilitating neurological condition affecting millions worldwide, presents significant diagnostic challenges. In this work, we introduce a novel methodology for the classification of demented and non-demented elderly patients using 3D brain Magnetic Resonance Imaging (MRI) scans. Our approach features a unique technique for selectively processing MRI slices, focusing on the most relevant brain regions and excluding less informative sections. This methodology is complemented by a confidence-based classification committee composed of three custom deep learning models: Dem3D ResNet, Dem3D CNN, and Dem3D EfficientNet. These models work synergistically to enhance decision-making accuracy, leveraging their collective strengths. Tested on the Open Access Series of Imaging Studies(OASIS) dataset, our method achieved an impressive accuracy of 94.12%, surpassing existing methodologies. Furthermore, validation on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset confirmed the robustness and generalizability of our approach. The use of explainable AI (XAI) techniques and comprehensive ablation studies further substantiate the effectiveness of our techniques, providing insights into the decision-making process and the importance of our methodology. This research offers a significant advancement in dementia diagnosis, providing a highly accurate and efficient tool for clinical applications.

Via

Access Paper or Ask Questions

Automated Single-Label Patent Classification using Ensemble Classifiers

Mar 03, 2022

Eleni Kamateri, Vasileios Stamatis, Konstantinos Diamantaras, Michail Salampasis

Figure 1 for Automated Single-Label Patent Classification using Ensemble Classifiers

Figure 2 for Automated Single-Label Patent Classification using Ensemble Classifiers

Figure 3 for Automated Single-Label Patent Classification using Ensemble Classifiers

Figure 4 for Automated Single-Label Patent Classification using Ensemble Classifiers

Abstract:Many thousands of patent applications arrive at patent offices around the world every day. One important subtask when a patent application is submitted is to assign one or more classification codes from the complex and hierarchical patent classification schemes that will enable routing of the patent application to a patent examiner who is knowledgeable about the specific technical field. This task is typically undertaken by patent professionals, however due to the large number of applications and the potential complexity of an invention, they are usually overwhelmed. Therefore, there is a need for this code assignment manual task to be supported or even fully automated by classification systems that will classify patent applications, hopefully with an accuracy close to patent professionals. Like in many other text analysis problems, in the last years, this intellectually demanding task has been studied using word embeddings and deep learning techniques. In this paper we shortly review these research efforts and experiment with similar deep learning techniques using different feature representations on automatic patent classification in the level of sub-classes. On top of that, we present an innovative method of ensemble classifiers trained with different parts of the patent document. To the best of our knowledge, this is the first time that an ensemble method was proposed for the patent classification problem. Our first results are quite promising showing that an ensemble architecture of classifiers significantly outperforms current state-of-the-art techniques using the same classifiers as standalone solutions.

* Published in ICMLC 2022

Via

Access Paper or Ask Questions

A Graph-based Method for Session-based Recommendations

Jun 22, 2021

Marina Delianidi, Michail Salampasis, Konstantinos Diamantaras, Theodosios Siomos, Alkiviadis Katsalis, Iphigenia Karaveli

Figure 1 for A Graph-based Method for Session-based Recommendations

Figure 2 for A Graph-based Method for Session-based Recommendations

Abstract:We present a graph-based approach for the data management tasks and the efficient operation of a system for session-based next-item recommendations. The proposed method can collect data continuously and incrementally from an ecommerce web site, thus seemingly prepare the necessary data infrastructure for the recommendation algorithm to operate without any excessive training phase. Our work aims at developing a recommender method that represents a balance between data processing and management efficiency requirements and the effectiveness of the recommendations produced. We use the Neo4j graph database to implement a prototype of such a system. Furthermore, we use an industry dataset corresponding to a typical e-commerce session-based scenario, and we report on experiments using our graph-based approach and other state-of-the-art machine learning and deep learning methods.

* Preprint version of the paper, the original paper is published on ACM DL. 6 pages, 1 figure, 1 table

Via

Access Paper or Ask Questions

Student Performance Prediction Using Dynamic Neural Models

Jun 01, 2021

Marina Delianidi, Konstantinos Diamantaras, George Chrysogonidis, Vasileios Nikiforidis

Figure 1 for Student Performance Prediction Using Dynamic Neural Models

Figure 2 for Student Performance Prediction Using Dynamic Neural Models

Figure 3 for Student Performance Prediction Using Dynamic Neural Models

Figure 4 for Student Performance Prediction Using Dynamic Neural Models

Abstract:We address the problem of predicting the correctness of the student's response on the next exam question based on their previous interactions in the course of their learning and evaluation process. We model the student performance as a dynamic problem and compare the two major classes of dynamic neural architectures for its solution, namely the finite-memory Time Delay Neural Networks (TDNN) and the potentially infinite-memory Recurrent Neural Networks (RNN). Since the next response is a function of the knowledge state of the student and this, in turn, is a function of their previous responses and the skills associated with the previous questions, we propose a two-part network architecture. The first part employs a dynamic neural network (either TDNN or RNN) to trace the student knowledge state. The second part applies on top of the dynamic part and it is a multi-layer feed-forward network which completes the classification task of predicting the student response based on our estimate of the student knowledge state. Both input skills and previous responses are encoded using different embeddings. Regarding the skill embeddings we tried two different initialization schemes using (a) random vectors and (b) pretrained vectors matching the textual descriptions of the skills. Our experiments show that the performance of the RNN approach is better compared to the TDNN approach in all datasets that we have used. Also, we show that our RNN architecture outperforms the state-of-the-art models in four out of five datasets. It is worth noting that the TDNN approach also outperforms the state of the art models in four out of five datasets, although it is slightly worse than our proposed RNN approach. Finally, contrary to our expectations, we find that the initialization of skill embeddings using pretrained vectors offers practically no advantage over random initialization.

* 9 pages, 4 figures, to be published in EDM 2021: the 14th International Conference on Educational Data Mining, June 29 - July 2, 2021, Paris, France

Via

Access Paper or Ask Questions

Sparse Antenna Array Design for MIMO Radar Using Softmax Selection

Feb 09, 2021

Konstantinos Diamantaras, Zhaoyi Xu, Athina Petropulu

Figure 1 for Sparse Antenna Array Design for MIMO Radar Using Softmax Selection

Figure 2 for Sparse Antenna Array Design for MIMO Radar Using Softmax Selection

Figure 3 for Sparse Antenna Array Design for MIMO Radar Using Softmax Selection

Figure 4 for Sparse Antenna Array Design for MIMO Radar Using Softmax Selection

Abstract:MIMO transmit arrays allow for flexible design of the transmit beampattern. However, the large number of elements required to achieve certain performance using uniform linear arrays (ULA) maybe be too costly. This motivated the need for thinned arrays by appropriately selecting a small number of elements so that the full array beampattern is preserved. In this paper, we propose Learn-to-Select (L2S), a novel machine learning model for selecting antennas from a dense ULA employing a combination of multiple Softmax layers constrained by an orthogonalization criterion. The proposed approach can be efficiently scaled for larger problems as it avoids the combinatorial explosion of the selection problem. It also offers a flexible array design framework as the selection problem can be easily formulated for any metric.

* arXiv admin note: text overlap with arXiv:2101.06837

Via

Access Paper or Ask Questions

Learning to Select for MIMO Radar based on Hybrid Analog-Digital Beamforming

Jan 18, 2021

Zhaoyi Xu, Fan Liu, Konstantinos Diamantaras, Christos Masouros, Athina Petropulu

Figure 1 for Learning to Select for MIMO Radar based on Hybrid Analog-Digital Beamforming

Figure 2 for Learning to Select for MIMO Radar based on Hybrid Analog-Digital Beamforming

Abstract:In this paper, we propose an energy-efficient radar beampattern design framework for a Millimeter Wave (mmWave) massive multi-input multi-output (mMIMO) system, equipped with a hybrid analog-digital (HAD) beamforming structure. Aiming to reduce the power consumption and hardware cost of the mMIMO system, we employ a machine learning approach to synthesize the probing beampattern based on a small number of RF chains and antennas. By leveraging a combination of softmax neural networks, the proposed solution is able to achieve a desirable beampattern with high accuracy.

Via

Access Paper or Ask Questions

Machine Learning Sentiment Prediction based on Hybrid Document Representation

Nov 29, 2015

Panagiotis Stalidis, Maria Giatsoglou, Konstantinos Diamantaras, George Sarigiannidis, Konstantinos Ch. Chatzisavvas

Figure 1 for Machine Learning Sentiment Prediction based on Hybrid Document Representation

Figure 2 for Machine Learning Sentiment Prediction based on Hybrid Document Representation

Figure 3 for Machine Learning Sentiment Prediction based on Hybrid Document Representation

Abstract:Automated sentiment analysis and opinion mining is a complex process concerning the extraction of useful subjective information from text. The explosion of user generated content on the Web, especially the fact that millions of users, on a daily basis, express their opinions on products and services to blogs, wikis, social networks, message boards, etc., render the reliable, automated export of sentiments and opinions from unstructured text crucial for several commercial applications. In this paper, we present a novel hybrid vectorization approach for textual resources that combines a weighted variant of the popular Word2Vec representation (based on Term Frequency-Inverse Document Frequency) representation and with a Bag- of-Words representation and a vector of lexicon-based sentiment values. The proposed text representation approach is assessed through the application of several machine learning classification algorithms on a dataset that is used extensively in literature for sentiment detection. The classification accuracy derived through the proposed hybrid vectorization approach is higher than when its individual components are used for text represenation, and comparable with state-of-the-art sentiment detection methodologies.

Via

Access Paper or Ask Questions