Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Karin de Langis

A Framework for Robust Cognitive Evaluation of LLMs

Apr 03, 2025

Karin de Langis, Jong Inn Park, Bin Hu, Khanh Chi Le, Andreas Schramm, Michael C. Mensink, Andrew Elfenbein, Dongyeop Kang

Abstract:Emergent cognitive abilities in large language models (LLMs) have been widely observed, but their nature and underlying mechanisms remain poorly understood. A growing body of research draws on cognitive science to investigate LLM cognition, but standard methodologies and experimen-tal pipelines have not yet been established. To address this gap we develop CognitivEval, a framework for systematically evaluating the artificial cognitive capabilities of LLMs, with a particular emphasis on robustness in response collection. The key features of CognitivEval include: (i) automatic prompt permutations, and (ii) testing that gathers both generations and model probability estimates. Our experiments demonstrate that these features lead to more robust experimental outcomes. Using CognitivEval, we replicate five classic experiments in cognitive science, illustrating the framework's generalizability across various experimental tasks and obtaining a cognitive profile of several state of the art LLMs. CognitivEval will be released publicly to foster broader collaboration within the cognitive science community.

Via

Access Paper or Ask Questions

Reinforcement Learning with Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation

Feb 21, 2024

Karin de Langis, Ryan Koo, Dongyeop Kang

Abstract:Style is an integral component of text that expresses a diverse set of information, including interpersonal dynamics (e.g. formality) and the author's emotions or attitudes (e.g. disgust). Humans often employ multiple styles simultaneously. An open question is how large language models can be explicitly controlled so that they weave together target styles when generating text: for example, to produce text that is both negative and non-toxic. Previous work investigates the controlled generation of a single style, or else controlled generation of a style and other attributes. In this paper, we expand this into controlling multiple styles simultaneously. Specifically, we investigate various formulations of multiple style rewards for a reinforcement learning (RL) approach to controlled multi-style generation. These reward formulations include calibrated outputs from discriminators and dynamic weighting by discriminator gradient magnitudes. We find that dynamic weighting generally outperforms static weighting approaches, and we explore its effectiveness in 2- and 3-style control, even compared to strong baselines like plug-and-play model. All code and data for RL pipelines with multiple style attributes will be publicly available.

Via

Access Paper or Ask Questions

infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information

Jun 12, 2023

Jaehyung Kim, Yekyung Kim, Karin de Langis, Jinwoo Shin, Dongyeop Kang

Figure 1 for infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information

Figure 2 for infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information

Figure 3 for infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information

Figure 4 for infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information

Abstract:The success of NLP systems often relies on the availability of large, high-quality datasets. However, not all samples in these datasets are equally valuable for learning, as some may be redundant or noisy. Several methods for characterizing datasets based on model-driven meta-information (e.g., model's confidence) have been developed, but the relationship and complementary effects of these methods have received less attention. In this paper, we introduce infoVerse, a universal framework for dataset characterization, which provides a new feature space that effectively captures multidimensional characteristics of datasets by incorporating various model-driven meta-information. infoVerse reveals distinctive regions of the dataset that are not apparent in the original semantic space, hence guiding users (or models) in identifying which samples to focus on for exploration, assessment, or annotation. Additionally, we propose a novel sampling method on infoVerse to select a set of data points that maximizes informativeness. In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines in all applications. Our code and demo are publicly available.

* 26 pages, accepted at ACL 2023 conference as a full paper

Via

Access Paper or Ask Questions

An Analysis of Reader Engagement in Literary Fiction through Eye Tracking and Linguistic Features

Jun 06, 2023

Rose Neis, Karin de Langis, Zae Myung Kim, Dongyeop Kang

Abstract:Capturing readers' engagement in fiction is a challenging but important aspect of narrative understanding. In this study, we collected 23 readers' reactions to 2 short stories through eye tracking, sentence-level annotations, and an overall engagement scale survey. We analyzed the significance of various qualities of the text in predicting how engaging a reader is likely to find it. As enjoyment of fiction is highly contextual, we also investigated individual differences in our data. Furthering our understanding of what captivates readers in fiction will help better inform models used in creative narrative generation and collaborative writing tools.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models

Dec 19, 2022

Karin de Langis, Dongyeop Kang

Abstract:There is growing interest in incorporating eye-tracking data and other implicit measures of human language processing into natural language processing (NLP) pipelines. The data from human language processing contain unique insight into human linguistic understanding that could be exploited by language models. However, many unanswered questions remain about the nature of this data and how it can best be utilized in downstream NLP tasks. In this paper, we present eyeStyliency, an eye-tracking dataset for human processing of stylistic text (e.g., politeness). We develop a variety of methods to derive style saliency scores over text using the collected eye dataset. We further investigate how this saliency data compares to both human annotation methods and model-based interpretability metrics. We find that while eye-tracking data is unique, it also intersects with both human annotations and model-based importance scores, providing a possible bridge between human- and machine-based perspectives. In downstream few-shot learning tasks, adding salient words to prompts generally improved style classification, with eye-tracking-based and annotation-based salient words achieving the highest accuracy.

Via

Access Paper or Ask Questions

Semantically-Aware Strategies for Stereo-Visual Robotic Obstacle Avoidance

Jul 13, 2021

Jungseok Hong, Karin de Langis, Cole Wyeth, Christopher Walaszek, Junaed Sattar

Figure 1 for Semantically-Aware Strategies for Stereo-Visual Robotic Obstacle Avoidance

Figure 2 for Semantically-Aware Strategies for Stereo-Visual Robotic Obstacle Avoidance

Figure 3 for Semantically-Aware Strategies for Stereo-Visual Robotic Obstacle Avoidance

Figure 4 for Semantically-Aware Strategies for Stereo-Visual Robotic Obstacle Avoidance

Abstract:Mobile robots in unstructured, mapless environments must rely on an obstacle avoidance module to navigate safely. The standard avoidance techniques estimate the locations of obstacles with respect to the robot but are unaware of the obstacles' identities. Consequently, the robot cannot take advantage of semantic information about obstacles when making decisions about how to navigate. We propose an obstacle avoidance module that combines visual instance segmentation with a depth map to classify and localize objects in the scene. The system avoids obstacles differentially, based on the identity of the objects: for example, the system is more cautious in response to unpredictable objects such as humans. The system can also navigate closer to harmless obstacles and ignore obstacles that pose no collision danger, enabling it to navigate more efficiently. We validate our approach in two simulated environments: one terrestrial and one underwater. Results indicate that our approach is feasible and can enable more efficient navigation strategies.

Via

Access Paper or Ask Questions

An Analysis of Deep Object Detectors For Diver Detection

Nov 25, 2020

Karin de Langis, Michael Fulton, Junaed Sattar

Figure 1 for An Analysis of Deep Object Detectors For Diver Detection

Figure 2 for An Analysis of Deep Object Detectors For Diver Detection

Figure 3 for An Analysis of Deep Object Detectors For Diver Detection

Figure 4 for An Analysis of Deep Object Detectors For Diver Detection

Abstract:With the end goal of selecting and using diver detection models to support human-robot collaboration capabilities such as diver following, we thoroughly analyze a large set of deep neural networks for diver detection. We begin by producing a dataset of approximately 105,000 annotated images of divers sourced from videos -- one of the largest and most varied diver detection datasets ever created. Using this dataset, we train a variety of state-of-the-art deep neural networks for object detection, including SSD with Mobilenet, Faster R-CNN, and YOLO. Along with these single-frame detectors, we also train networks designed for detection of objects in a video stream, using temporal information as well as single-frame image information. We evaluate these networks on typical accuracy and efficiency metrics, as well as on the temporal stability of their detections. Finally, we analyze the failures of these detectors, pointing out the most common scenarios of failure. Based on our results, we recommend SSDs or Tiny-YOLOv4 for real-time applications on robots and recommend further investigation of video object detection methods.

* 14 pages, submitted for ICRA 21

Via

Access Paper or Ask Questions

SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots

Nov 12, 2020

Md Jahidul Islam, Ruobing Wang, Karin de Langis, Junaed Sattar

Figure 1 for SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots

Figure 2 for SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots

Figure 3 for SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots

Figure 4 for SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater Robots

Abstract:This paper presents a holistic approach to saliency-guided visual attention modeling (SVAM) for use by autonomous underwater robots. Our proposed model, named SVAM-Net, integrates deep visual features at various scales and semantics for effective salient object detection (SOD) in natural underwater images. The SVAM-Net architecture is configured in a unique way to jointly accommodate bottom-up and top-down learning within two separate branches of the network while sharing the same encoding layers. We design dedicated spatial attention modules (SAMs) along these learning pathways to exploit the coarse-level and fine-level semantic features for SOD at four stages of abstractions. The bottom-up branch performs a rough yet reasonably accurate saliency estimation at a fast rate, whereas the deeper top-down branch incorporates a residual refinement module (RRM) that provides fine-grained localization of the salient objects. Extensive performance evaluation of SVAM-Net on benchmark datasets clearly demonstrates its effectiveness for underwater SOD. We also validate its generalization performance by several ocean trials' data that include test images of diverse underwater scenes and waterbodies, and also images with unseen natural objects. Moreover, we analyze its computational feasibility for robotic deployments and demonstrate its utility in several important use cases of visual attention modeling.

Via

Access Paper or Ask Questions

Real-Time Multi-Diver Tracking and Re-identification for Underwater Human-Robot Collaboration

Oct 21, 2019

Karin de Langis, Junaed Sattar

Figure 1 for Real-Time Multi-Diver Tracking and Re-identification for Underwater Human-Robot Collaboration

Figure 2 for Real-Time Multi-Diver Tracking and Re-identification for Underwater Human-Robot Collaboration

Figure 3 for Real-Time Multi-Diver Tracking and Re-identification for Underwater Human-Robot Collaboration

Figure 4 for Real-Time Multi-Diver Tracking and Re-identification for Underwater Human-Robot Collaboration

Abstract:Autonomous underwater robots working with teams of human divers may need to distinguish between different divers, e.g. to recognize a lead diver or to follow a specific team member. This paper describes a technique that enables autonomous underwater robots to track divers in real time as well as to reidentify them. The approach is an extension of Simple Online Realtime Tracking (SORT) with an appearance metric (deep SORT). Initial diver detection is performed with a custom CNN designed for realtime diver detection, and appearance features are subsequently extracted for each detected diver. Next, realtime tracking-by-detection is performed with an extension of the deep SORT algorithm. We evaluate this technique on a series of videos of divers performing human-robot collaborative tasks and show that our methods result in more divers being accurately identified during tracking. We also discuss the practical considerations of applying multi-person tracking to on-board autonomous robot operations, and we consider how failure cases can be addressed during on-board tracking.

Via

Access Paper or Ask Questions