Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohamed Mejri

RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing

Nov 13, 2024

Mohamed Mejri, Chandramouli Amarnath, Abhijit Chatterjee

Abstract:Modern transformer-based encoder-decoder architectures struggle with reasoning tasks due to their inability to effectively extract relational information between input objects (data/tokens). Recent work introduced the Abstractor module, embedded between transformer layers, to address this gap. However, the Abstractor layer while excelling at capturing relational information (pure relational reasoning), faces challenges in tasks that require both object and relational-level reasoning (partial relational reasoning). To address this, we propose RESOLVE, a neuro-vector symbolic architecture that combines object-level features with relational representations in high-dimensional spaces, using fast and efficient operations such as bundling (summation) and binding (Hadamard product) allowing both object-level features and relational representations to coexist within the same structure without interfering with one another. RESOLVE is driven by a novel attention mechanism that operates in a bipolar high dimensional space, allowing fast attention score computation compared to the state-of-the-art. By leveraging this design, the model achieves both low compute latency and memory efficiency. RESOLVE also offers better generalizability while achieving higher accuracy in purely relational reasoning tasks such as sorting as well as partial relational reasoning tasks such as math problem-solving compared to state-of-the-art methods.

Via

Access Paper or Ask Questions

LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules

May 23, 2024

Mohamed Mejri, Chandramouli Amarnath, Abhijit Chatterjee

Figure 1 for LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules

Figure 2 for LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules

Figure 3 for LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules

Figure 4 for LARS-VSA: A Vector Symbolic Architecture For Learning with Abstract Rules

Abstract:Human cognition excels at symbolic reasoning, deducing abstract rules from limited samples. This has been explained using symbolic and connectionist approaches, inspiring the development of a neuro-symbolic architecture that combines both paradigms. In parallel, recent studies have proposed the use of a "relational bottleneck" that separates object-level features from abstract rules, allowing learning from limited amounts of data . While powerful, it is vulnerable to the curse of compositionality meaning that object representations with similar features tend to interfere with each other. In this paper, we leverage hyperdimensional computing, which is inherently robust to such interference to build a compositional architecture. We adapt the "relational bottleneck" strategy to a high-dimensional space, incorporating explicit vector binding operations between symbols and relational representations. Additionally, we design a novel high-dimensional attention mechanism that leverages this relational representation. Our system benefits from the low overhead of operations in hyperdimensional space, making it significantly more efficient than the state of the art when evaluated on a variety of test datasets, while maintaining higher or equal accuracy.

Via

Access Paper or Ask Questions

A Novel Hyperdimensional Computing Framework for Online Time Series Forecasting on the Edge

Feb 03, 2024

Mohamed Mejri, Chandramouli Amarnath, Abhijit Chatterjee

Abstract:In recent years, both online and offline deep learning models have been developed for time series forecasting. However, offline deep forecasting models fail to adapt effectively to changes in time-series data, while online deep forecasting models are often expensive and have complex training procedures. In this paper, we reframe the online nonlinear time-series forecasting problem as one of linear hyperdimensional time-series forecasting. Nonlinear low-dimensional time-series data is mapped to high-dimensional (hyperdimensional) spaces for linear hyperdimensional prediction, allowing fast, efficient and lightweight online time-series forecasting. Our framework, TSF-HD, adapts to time-series distribution shifts using a novel co-training framework for its hyperdimensional mapping and its linear hyperdimensional predictor. TSF-HD is shown to outperform the state of the art, while having reduced inference latency, for both short-term and long-term time series forecasting. Our code is publicly available at http://github.com/tsfhd2024/tsf-hd.git

Via

Access Paper or Ask Questions

Impact of Feature Encoding on Malware Classification Explainability

Jul 10, 2023

Elyes Manai, Mohamed Mejri, Jaouhar Fattahi

Abstract:This paper investigates the impact of feature encoding techniques on the explainability of XAI (Explainable Artificial Intelligence) algorithms. Using a malware classification dataset, we trained an XGBoost model and compared the performance of two feature encoding methods: Label Encoding (LE) and One Hot Encoding (OHE). Our findings reveal a marginal performance loss when using OHE instead of LE. However, the more detailed explanations provided by OHE compensated for this loss. We observed that OHE enables deeper exploration of details in both global and local contexts, facilitating more comprehensive answers. Additionally, we observed that using OHE resulted in smaller explanation files and reduced analysis time for human analysts. These findings emphasize the significance of considering feature encoding techniques in XAI research and suggest potential for further exploration by incorporating additional encoding methods and innovative visualization approaches.

* This paper is accpeted in the 15th Edition of INTERNATIONAL CONFERENCE on Electronics, Computers and Artificial Intelligence, BUCHAREST, ROMANIA (ECAI 2023)

Via

Access Paper or Ask Questions

Semantic Segmentation and Object Detection Towards Instance Segmentation: Breast Tumor Identification

Aug 06, 2021

Mohamed Mejri, Aymen Mejri, Oumayma Mejri, Chiraz Fekih

Figure 1 for Semantic Segmentation and Object Detection Towards Instance Segmentation: Breast Tumor Identification

Figure 2 for Semantic Segmentation and Object Detection Towards Instance Segmentation: Breast Tumor Identification

Figure 3 for Semantic Segmentation and Object Detection Towards Instance Segmentation: Breast Tumor Identification

Figure 4 for Semantic Segmentation and Object Detection Towards Instance Segmentation: Breast Tumor Identification

Abstract:Breast cancer is one of the factors that cause the increase of mortality of women. The most widely used method for diagnosing this geological disease i.e. breast cancer is the ultrasound scan. Several key features such as the smoothness and the texture of the tumor captured through ultrasound scans encode the abnormality of the breast tumors (malignant from benign). However, ultrasound scans are often noisy and include irrelevant parts of the breast that may bias the segmentation of eventual tumors. In this paper, we are going to extract the region of interest ( i.e, bounding boxes of the tumors) and feed-forward them to one semantic segmentation encoder-decoder structure based on its classification (i.e, malignant or benign). the whole process aims to build an instance-based segmenter from a semantic segmenter and an object detector.

Via

Access Paper or Ask Questions

Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes

Dec 30, 2020

Jaouhar Fattahi, Mohamed Mejri

Figure 1 for Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes

Figure 2 for Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes

Figure 3 for Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes

Figure 4 for Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes

Abstract:Fingerprint recognition is often a game-changing step in establishing evidence against criminals. However, we are increasingly finding that criminals deliberately alter their fingerprints in a variety of ways to make it difficult for technicians and automatic sensors to recognize their fingerprints, making it tedious for investigators to establish strong evidence against them in a forensic procedure. In this sense, deep learning comes out as a prime candidate to assist in the recognition of damaged fingerprints. In particular, convolution algorithms. In this paper, we focus on the recognition of damaged fingerprints by Convolutional Long Short-Term Memory networks. We present the architecture of our model and demonstrate its performance which exceeds 95% accuracy, 99% precision, and approaches 95% recall and 99% AUC.

* This paper was accepted, on December 5, 2020, for publication and oral presentation at the 2021 IEEE 5th International Conference on Cryptography, Security and Privacy (CSP 2021) to be held in Zhuhai, China during January 8-10, 2021 and hosted by Beijing Normal University (Zhuhai)

Via

Access Paper or Ask Questions

RandomForestMLP: An Ensemble-Based Multi-Layer Perceptron Against Curse of Dimensionality

Nov 02, 2020

Mohamed Mejri, Aymen Mejri

Figure 1 for RandomForestMLP: An Ensemble-Based Multi-Layer Perceptron Against Curse of Dimensionality

Figure 2 for RandomForestMLP: An Ensemble-Based Multi-Layer Perceptron Against Curse of Dimensionality

Figure 3 for RandomForestMLP: An Ensemble-Based Multi-Layer Perceptron Against Curse of Dimensionality

Figure 4 for RandomForestMLP: An Ensemble-Based Multi-Layer Perceptron Against Curse of Dimensionality

Abstract:We present a novel and practical deep learning pipeline termed RandomForestMLP. This core trainable classification engine consists of a convolutional neural network backbone followed by an ensemble-based multi-layer perceptrons core for the classification task. It is designed in the context of self and semi-supervised learning tasks to avoid overfitting while training on very small datasets. The paper details the architecture of the RandomForestMLP and present different strategies for neural network decision aggregation. Then, it assesses its robustness to overfitting when trained on realistic image datasets and compares its classification performance with existing regular classifiers.

Via

Access Paper or Ask Questions

A Survey On 3D Inner Structure Prediction from its Outer Shape

Feb 11, 2020

Mohamed Mejri, Antoine Richard, Cédric Pradalier

Figure 1 for A Survey On 3D Inner Structure Prediction from its Outer Shape

Figure 2 for A Survey On 3D Inner Structure Prediction from its Outer Shape

Figure 3 for A Survey On 3D Inner Structure Prediction from its Outer Shape

Figure 4 for A Survey On 3D Inner Structure Prediction from its Outer Shape

Abstract:The analysis of the internal structure of trees is highly important for both forest experts, biological scientists, and the wood industry. Traditionally, CT-scanners are considered as the most efficient way to get an accurate inner representation of the tree. However, this method requires an important investment and reduces the cost-effectiveness of this operation. Our goal is to design neural-network-based methods to predict the internal density of the tree from its external bark shape. This paper compares different image-to-image(2D), volume-to-volume(3D) and Convolutional Long Short Term Memory based neural network architectures in the context of the prediction of the defect distribution inside trees from their external bark shape. Those models are trained on a synthetic dataset of 1800 CT-scanned look-like volumetric structures of the internal density of the trees and their corresponding external surface.

Via

Access Paper or Ask Questions