Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Danial Kamali

NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

Dec 20, 2024

Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi

Figure 1 for NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

Figure 2 for NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

Figure 3 for NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

Figure 4 for NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

Abstract:Compositional generalization is crucial for artificial intelligence agents to solve complex vision-language reasoning tasks. Neuro-symbolic approaches have demonstrated promise in capturing compositional structures, but they face critical challenges: (a) reliance on predefined predicates for symbolic representations that limit adaptability, (b) difficulty in extracting predicates from raw data, and (c) using non-differentiable operations for combining primitive concepts. To address these issues, we propose NeSyCoCo, a neuro-symbolic framework that leverages large language models (LLMs) to generate symbolic representations and map them to differentiable neural computations. NeSyCoCo introduces three innovations: (a) augmenting natural language inputs with dependency structures to enhance the alignment with symbolic representations, (b) employing distributed word representations to link diverse, linguistically motivated logical predicates to neural modules, and (c) using the soft composition of normalized predicate scores to align symbolic and differentiable reasoning. Our framework achieves state-of-the-art results on the ReaSCAN and CLEVR-CoGenT compositional generalization benchmarks and demonstrates robust performance with novel concepts in the CLEVR-SYN benchmark.

* AAAI 2025 Project Page: https://iamdanialkamali.github.io/publication/neuro-symbolic-concept-composer

Via

Access Paper or Ask Questions

Syntax-Guided Transformers: Elevating Compositional Generalization and Grounding in Multimodal Environments

Nov 07, 2023

Danial Kamali, Parisa Kordjamshidi

Abstract:Compositional generalization, the ability of intelligent models to extrapolate understanding of components to novel compositions, is a fundamental yet challenging facet in AI research, especially within multimodal environments. In this work, we address this challenge by exploiting the syntactic structure of language to boost compositional generalization. This paper elevates the importance of syntactic grounding, particularly through attention masking techniques derived from text input parsing. We introduce and evaluate the merits of using syntactic information in the multimodal grounding problem. Our results on grounded compositional generalization underscore the positive impact of dependency parsing across diverse tasks when utilized with Weight Sharing across the Transformer encoder. The results push the state-of-the-art in multimodal grounding and parameter-efficient modeling and provide insights for future research.

Via

Access Paper or Ask Questions

Evaluating Persian Tokenizers

Feb 22, 2022

Danial Kamali, Behrooz Janfada, Mohammad Ebrahim Shenasa, Behrouz Minaei-Bidgoli

Figure 1 for Evaluating Persian Tokenizers

Figure 2 for Evaluating Persian Tokenizers

Figure 3 for Evaluating Persian Tokenizers

Figure 4 for Evaluating Persian Tokenizers

Abstract:Tokenization plays a significant role in the process of lexical analysis. Tokens become the input for other natural language processing tasks, like semantic parsing and language modeling. Natural Language Processing in Persian is challenging due to Persian's exceptional cases, such as half-spaces. Thus, it is crucial to have a precise tokenizer for Persian. This article provides a novel work by introducing the most widely used tokenizers for Persian and comparing and evaluating their performance on Persian texts using a simple algorithm with a pre-tagged Persian dependency dataset. After evaluating tokenizers with the F1-Score, the hybrid version of the Farsi Verb and Hazm with bounded morphemes fixing showed the best performance with an F1 score of 98.97%.

Via

Access Paper or Ask Questions