KR2025Proceedings of the 22nd International Conference on Principles of Knowledge Representation and ReasoningProceedings of the 22nd International Conference on Principles of Knowledge Representation and Reasoning

Melbourne, Australia. November 11-17, 2025.

Edited by

ISSN: 2334-1033
ISBN: 978-1-956792-08-9

Sponsored by
Published by

Copyright © 2025 International Joint Conferences on Artificial Intelligence Organization

ASP-Driven Visual Commonsense: A General Framework for Reasoning About Embodied Interaction in the Wild

  1. Jakob Suchan(Constructor University, Germany, CoDesign Lab EU)
  2. Mehul Bhatt(Örebro University, Sweden, CoDesign Lab EU)
  3. Julius Monsen(Örebro University, Sweden, CoDesign Lab EU)

Keywords

  1. Vision And AI
  2. Answer Set Programming
  3. General Tools And Open-Source Development
  4. KRR For Autonomous Systems
  5. AI For Humanities And Social Sciences

Abstract

We present a general framework for declaratively grounded visual commonsense (reasoning) about embodied interaction in naturalistic, in-the-wild settings relevant to a range of AI application domains. The core computational capabilities of the framework pertaining visual commonsense are driven by a robust neurosymbolic architecture primarily consisting of: (1) answer set programming based modelling of foundational aspects pertaining spatio-temporal dynamics, encompassing space, time, events, action, motion; (2) modularly integrated visual computing techniques constituting the neural substrate linking quantitative perceptual features serving as low-level counterparts to high-level semantic characterisations of (inter)active visual commonsense.

Practically, we also present a first open-release of the developed framework with the aim to promote independent extensions and real-world applied KRR. The release comprises: (a) demonstrated case-studies in domains such as autonomous driving, psychology and media studies; (b) systematic evaluation mechanisms for community benchmarking; and (c) supporting material such as tutorials and datasets.