Analyzing Intentional Behavior in Autonomous Agents under Uncertainty

Analyzing Intentional Behavior in Autonomous Agents under Uncertainty

Filip Cano Córdoba, Samuel Judson, Timos Antonopoulos, Katrine Bjørner, Nicholas Shoemaker, Scott J. Shapiro, Ruzica Piskac, Bettina Könighofer

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Main Track. Pages 372-381. https://doi.org/10.24963/ijcai.2023/42

Principled accountability for autonomous decision-making in uncertain environments requires distinguishing intentional outcomes from negligent designs from actual accidents. We propose analyzing the behavior of autonomous agents through a quantitative measure of the evidence of intentional behavior. We model an uncertain environment as a Markov Decision Process (MDP). For a given scenario, we rely on probabilistic model checking to compute the ability of the agent to influence reaching a certain event. We call this the scope of agency. We say that there is evidence of intentional behavior if the scope of agency is high and the decisions of the agent are close to being optimal for reaching the event. Our method applies counterfactual reasoning to automatically generate relevant scenarios that can be analyzed to increase the confidence of our assessment. In a case study, we show how our method can distinguish between 'intentional' and 'accidental' traffic collisions.
Keywords:
AI Ethics, Trust, Fairness: ETF: Accountability
AI Ethics, Trust, Fairness: ETF: Moral decision making
Planning and Scheduling: PS: Markov decisions processes