Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sadaf Ghaffari

Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics

Feb 24, 2024

Sadaf Ghaffari, Nikhil Krishnaswamy

Abstract:In this paper, we present an exploration of LLMs' abilities to problem solve with physical reasoning in situated environments. We construct a simple simulated environment and demonstrate examples of where, in a zero-shot setting, both text and multimodal LLMs display atomic world knowledge about various objects but fail to compose this knowledge in correct solutions for an object manipulation and placement task. We also use BLIP, a vision-language model trained with more sophisticated cross-modal attention, to identify cases relevant to object physical properties that that model fails to ground. Finally, we present a procedure for discovering the relevant properties of objects in the environment and propose a method to distill this knowledge back into the LLM.

* 10 pages, 10 figures, Proceedings of AAAI Spring Symposium: Empowering Machine Learning and Large Language Models with Domain and Commonsense Knowledge (MAKE). AAAI (2024)

Via

Access Paper or Ask Questions

Combining Automatic Coding and Instructor Input to Generate ENA Visualizations for Asynchronous Online Discussion

Aug 22, 2023

Marcia Moraes, Sadaf Ghaffari, Yanye Luther, James Folkestad

Figure 1 for Combining Automatic Coding and Instructor Input to Generate ENA Visualizations for Asynchronous Online Discussion

Figure 2 for Combining Automatic Coding and Instructor Input to Generate ENA Visualizations for Asynchronous Online Discussion

Figure 3 for Combining Automatic Coding and Instructor Input to Generate ENA Visualizations for Asynchronous Online Discussion

Figure 4 for Combining Automatic Coding and Instructor Input to Generate ENA Visualizations for Asynchronous Online Discussion

Abstract:Asynchronous online discussions are a common fundamental tool to facilitate social interaction in hybrid and online courses. However, instructors lack the tools to accomplish the overwhelming task of evaluating asynchronous online discussion activities. In this paper we present an approach that uses Latent Dirichlet Analysis (LDA) and the instructor's keywords to automatically extract codes from a relatively small dataset. We use the generated codes to build an Epistemic Network Analysis (ENA) model and compare this model with a previous ENA model built by human coders. The results show that there is no statistical difference between the two models. We present an analysis of these models and discuss the potential use of ENA as a visualization to help instructors evaluating asynchronous online discussions.

* 15 pages, 4 figures, 6 Tables, appearing in ICQE 2023 proceedings

Via

Access Paper or Ask Questions

Grounding and Distinguishing Conceptual Vocabulary Through Similarity Learning in Embodied Simulations

May 23, 2023

Sadaf Ghaffari, Nikhil Krishnaswamy

Abstract:We present a novel method for using agent experiences gathered through an embodied simulation to ground contextualized word vectors to object representations. We use similarity learning to make comparisons between different object types based on their properties when interacted with, and to extract common features pertaining to the objects' behavior. We then use an affine transformation to calculate a projection matrix that transforms contextualized word vectors from different transformer-based language models into this learned space, and evaluate whether new test instances of transformed token vectors identify the correct concept in the object embedding space. Our results expose properties of the embedding spaces of four different transformer models and show that grounding object token vectors is usually more helpful to grounding verb and attribute token vectors than the reverse, which reflects earlier conclusions in the analogical reasoning and psycholinguistic literature.

* Accepted at IWCS Conference

Via

Access Paper or Ask Questions

Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

Nov 08, 2022

Sadaf Ghaffari, Nikhil Krishnaswamy

Figure 1 for Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

Figure 2 for Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

Figure 3 for Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

Figure 4 for Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

Abstract:In this paper, we present methods for two types of metacognitive tasks in an AI system: rapidly expanding a neural classification model to accommodate a new category of object, and recognizing when a novel object type is observed instead of misclassifying the observation as a known class. Our methods take numerical data drawn from an embodied simulation environment, which describes the motion and properties of objects when interacted with, and we demonstrate that this type of representation is important for the success of novel type detection. We present a suite of experiments in rapidly accommodating the introduction of new categories and concepts and in novel type detection, and an architecture to integrate the two in an interactive system.

* arXiv admin note: substantial text overlap with arXiv:2204.08107

Via

Access Paper or Ask Questions

Automated Code Extraction from Discussion Board Text Dataset

Oct 31, 2022

Sina Mahdipour Saravani, Sadaf Ghaffari, Yanye Luther, James Folkestad, Marcia Moraes

Figure 1 for Automated Code Extraction from Discussion Board Text Dataset

Figure 2 for Automated Code Extraction from Discussion Board Text Dataset

Figure 3 for Automated Code Extraction from Discussion Board Text Dataset

Figure 4 for Automated Code Extraction from Discussion Board Text Dataset

Abstract:This study introduces and investigates the capabilities of three different text mining approaches, namely Latent Semantic Analysis, Latent Dirichlet Analysis, and Clustering Word Vectors, for automating code extraction from a relatively small discussion board dataset. We compare the outputs of each algorithm with a previous dataset that was manually coded by two human raters. The results show that even with a relatively small dataset, automated approaches can be an asset to course instructors by extracting some of the discussion codes, which can be used in Epistemic Network Analysis.

Via

Access Paper or Ask Questions

Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

Apr 17, 2022

Nikhil Krishnaswamy, Sadaf Ghaffari

Figure 1 for Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

Figure 2 for Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

Figure 3 for Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

Figure 4 for Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

Abstract:In this paper we present a novel method for a naive agent to detect novel objects it encounters in an interaction. We train a reinforcement learning policy on a stacking task given a known object type, and then observe the results of the agent attempting to stack various other objects based on the same trained policy. By extracting embedding vectors from a convolutional neural net trained over the results of the aforementioned stacking play, we can determine the similarity of a given object to known object types, and determine if the given object is likely dissimilar enough to the known types to be considered a novel class of object. We present the results of this method on two datasets gathered using two different policies and demonstrate what information the agent needs to extract from its environment to make these novelty judgments.

Via

Access Paper or Ask Questions