Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mehul Bhatt

Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Dec 28, 2020

Jakob Suchan, Mehul Bhatt, Srikrishna Varadarajan

Figure 1 for Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Figure 2 for Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Figure 3 for Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Figure 4 for Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics

Abstract:We demonstrate the need and potential of systematically integrated vision and semantics solutions for visual sensemaking in the backdrop of autonomous driving. A general neurosymbolic method for online visual sensemaking using answer set programming (ASP) is systematically formalised and fully implemented. The method integrates state of the art in visual computing, and is developed as a modular framework that is generally usable within hybrid architectures for realtime perception and control. We evaluate and demonstrate with community established benchmarks KITTIMOD, MOT-2017, and MOT-2020. As use-case, we focus on the significance of human-centred visual sensemaking -- e.g., involving semantic representation and explainability, question-answering, commonsense interpolation -- in safety-critical autonomous driving situations. The developed neurosymbolic framework is domain-independent, with the case of autonomous driving designed to serve as an exemplar for online visual sensemaking in diverse cognitive interaction settings in the backdrop of select human-centred AI technology design considerations. Keywords: Cognitive Vision, Deep Semantics, Declarative Spatial Reasoning, Knowledge Representation and Reasoning, Commonsense Reasoning, Visual Abduction, Answer Set Programming, Autonomous Driving, Human-Centred Computing and Design, Standardisation in Driving Technology, Spatial Cognition and AI.

* This is a preprint / review version of an accepted contribution to be published as part of the Artificial Intelligence Journal (AIJ).? The article is an extended version of an IJCAI 2019 publication [74, arXiv:1906.00107]

Via

Access Paper or Ask Questions

Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Jun 02, 2020

Vasiliki Kondyli, Mehul Bhatt, Jakob Suchan

Figure 1 for Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Figure 2 for Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Figure 3 for Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Figure 4 for Towards a Human-Centred Cognitive Model of Visuospatial Complexity in Everyday Driving

Abstract:We develop a human-centred, cognitive model of visuospatial complexity in everyday, naturalistic driving conditions. With a focus on visual perception, the model incorporates quantitative, structural, and dynamic attributes identifiable in the chosen context; the human-centred basis of the model lies in its behavioural evaluation with human subjects with respect to psychophysical measures pertaining to embodied visuoauditory attention. We report preliminary steps to apply the developed cognitive model of visuospatial complexity for human-factors guided dataset creation and benchmarking, and for its use as a semantic template for the (explainable) computational analysis of visuospatial complexity.

* 9th European Starting AI Researchers Symposium (STAIRS), at ECAI 2020, the 24th European Conference on Artificial Intelligence (ECAI)., Santiago de Compostela, Spain

Via

Access Paper or Ask Questions

Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

May 31, 2019

Jakob Suchan, Mehul Bhatt, Srikrishna Varadarajan

Figure 1 for Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Figure 2 for Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Figure 3 for Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Figure 4 for Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving

Abstract:We demonstrate the need and potential of systematically integrated vision and semantics} solutions for visual sensemaking (in the backdrop of autonomous driving). A general method for online visual sensemaking using answer set programming is systematically formalised and fully implemented. The method integrates state of the art in (deep learning based) visual computing, and is developed as a modular framework usable within hybrid architectures for perception & control. We evaluate and demo with community established benchmarks KITTIMOD and MOT. As use-case, we focus on the significance of human-centred visual sensemaking ---e.g., semantic representation and explainability, question-answering, commonsense interpolation--- in safety-critical autonomous driving situations.

* IJCAI 2019: the 28th International Joint Conference on Artificial Intelligence (IJCAI) 2019, August 10 - 16, Macao. (Preprint / to appear)

Via

Access Paper or Ask Questions

Semantic Analysis of Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Sep 14, 2018

Jakob Suchan, Mehul Bhatt, Srikrishna Vardarajan, Seyed Ali Amirshahi, Stella Yu

Figure 1 for Semantic Analysis of Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Figure 2 for Semantic Analysis of Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Figure 3 for Semantic Analysis of Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Figure 4 for Semantic Analysis of Visual Symmetry: A Human-Centred Computational Model for Declarative Explainability

Abstract:We present a computational model for the semantic interpretation of symmetry in naturalistic scenes. Key features include a human-centred representation, and a declarative, explainable interpretation model supporting deep semantic question-answering founded on an integration of methods in knowledge representation and deep learning based computer vision. In the backdrop of the visual arts, we showcase the framework's capability to generate human-centred, queryable, relational structures, also evaluating the framework with an empirical study on the human perception of visual symmetry. Our framework represents and is driven by the application of foundational, integrated Vision and Knowledge Representation and Reasoning methods for applications in the arts, and the psychological and social sciences.

* Advances in Cognitive Systems. (http://www.cogsys.org/journal), 2018
* Preprint of accepted article / Journal: Advances in Cognitive Systems. ( http://www.cogsys.org/journal )

Via

Access Paper or Ask Questions

Answer Set Programming Modulo `Space-Time'

May 17, 2018

Carl Schultz, Mehul Bhatt, Jakob Suchan, Przemysław Wałęga

Figure 1 for Answer Set Programming Modulo `Space-Time'

Figure 2 for Answer Set Programming Modulo `Space-Time'

Figure 3 for Answer Set Programming Modulo `Space-Time'

Figure 4 for Answer Set Programming Modulo `Space-Time'

Abstract:We present ASP Modulo `Space-Time', a declarative representational and computational framework to perform commonsense reasoning about regions with both spatial and temporal components. Supported are capabilities for mixed qualitative-quantitative reasoning, consistency checking, and inferring compositions of space-time relations; these capabilities combine and synergise for applications in a range of AI application areas where the processing and interpretation of spatio-temporal data is crucial. The framework and resulting system is the only general KR-based method for declaratively reasoning about the dynamics of `space-time' regions as first-class objects. We present an empirical evaluation (with scalability and robustness results), and include diverse application examples involving interpretation and control tasks.

Via

Access Paper or Ask Questions

Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Dec 03, 2017

Jakob Suchan, Mehul Bhatt, Przemysław Wałęga, Carl Schultz

Figure 1 for Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Figure 2 for Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Figure 3 for Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Figure 4 for Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

Abstract:We propose a hybrid architecture for systematically computing robust visual explanation(s) encompassing hypothesis formation, belief revision, and default reasoning with video data. The architecture consists of two tightly integrated synergistic components: (1) (functional) answer set programming based abductive reasoning with space-time tracklets as native entities; and (2) a visual processing pipeline for detection based object tracking and motion analysis. We present the formal framework, its general implementation as a (declarative) method in answer set programming, and an example application and evaluation based on two diverse video datasets: the MOTChallenge benchmark developed by the vision community, and a recently developed Movie Dataset.

* Preprint of final publication published as part of AAAI 2018: J. Suchan., M. Bhatt, Wa{\l}\k{e}ga, P., Schultz, C. (2018). Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects. In AAAI 2018: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, February 2-7, 2018, New Orleans, USA

Via

Access Paper or Ask Questions

Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Oct 10, 2017

Jakob Suchan, Mehul Bhatt

Figure 1 for Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Figure 2 for Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Figure 3 for Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Figure 4 for Deep Semantic Abstractions of Everyday Human Activities: On Commonsense Representations of Human Interactions

Abstract:We propose a deep semantic characterization of space and motion categorically from the viewpoint of grounding embodied human-object interactions. Our key focus is on an ontological model that would be adept to formalisation from the viewpoint of commonsense knowledge representation, relational learning, and qualitative reasoning about space and motion in cognitive robotics settings. We demonstrate key aspects of the space & motion ontology and its formalization as a representational framework in the backdrop of select examples from a dataset of everyday activities. Furthermore, focussing on human-object interaction data obtained from RGBD sensors, we also illustrate how declarative (spatio-temporal) reasoning in the (constraint) logic programming family may be performed with the developed deep semantic abstractions.

* In ROBOT 2017: Third Iberian Robotics Conference. Escuela T\'ecnica Superior de Ingenier\'ia, Sevilla (Spain) (November 22-24, 2017). https://grvc.us.es/robot2017/ (to appear). arXiv admin note: substantial text overlap with arXiv:1709.05293

Via

Access Paper or Ask Questions

Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Sep 15, 2017

Jakob Suchan, Mehul Bhatt

Figure 1 for Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Figure 2 for Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Figure 3 for Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Figure 4 for Commonsense Scene Semantics for Cognitive Robotics: Towards Grounding Embodied Visuo-Locomotive Interactions

Abstract:We present a commonsense, qualitative model for the semantic grounding of embodied visuo-spatial and locomotive interactions. The key contribution is an integrative methodology combining low-level visual processing with high-level, human-centred representations of space and motion rooted in artificial intelligence. We demonstrate practical applicability with examples involving object interactions, and indoor movement.

* to appear in: ICCV 2017 Workshop - Vision in Practice on Autonomous Robots (ViPAR), International Conference on Computer Vision (ICCV), Venice, Italy

Via

Access Paper or Ask Questions

Deeply Semantic Inductive Spatio-Temporal Learning

Aug 09, 2016

Jakob Suchan, Mehul Bhatt, Carl Schultz

Figure 1 for Deeply Semantic Inductive Spatio-Temporal Learning

Abstract:We present an inductive spatio-temporal learning framework rooted in inductive logic programming. With an emphasis on visuo-spatial language, logic, and cognition, the framework supports learning with relational spatio-temporal features identifiable in a range of domains involving the processing and interpretation of dynamic visuo-spatial imagery. We present a prototypical system, and an example application in the domain of computing for visual arts and computational cognitive science.

* Accepted for publication at ILP 2016: 26th International Conference on Inductive Logic Programming 4th - 6th September 2016, London. Keywords: Spatio-Temporal Learning; Dynamic Visuo-Spatial Imagery; Declarative Spatial Reasoning; Inductive Logic Programming; AI and Art

Via

Access Paper or Ask Questions

Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Jul 26, 2016

Michael Spranger, Jakob Suchan, Mehul Bhatt, Manfred Eppe

Figure 1 for Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Figure 2 for Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Figure 3 for Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Figure 4 for Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Abstract:This paper presents a computational model of the processing of dynamic spatial relations occurring in an embodied robotic interaction setup. A complete system is introduced that allows autonomous robots to produce and interpret dynamic spatial phrases (in English) given an environment of moving objects. The model unites two separate research strands: computational cognitive semantics and on commonsense spatial representation and reasoning. The model for the first time demonstrates an integration of these different strands.

* in: Pham, D.-N. and Park, S.-B., editors, PRICAI 2014: Trends in Artificial Intelligence, volume 8862 of Lecture Notes in Computer Science, pages 958-971. Springer

Via

Access Paper or Ask Questions