AIA 2025

Overview

AI systems are increasingly interacting with users who are not experts in AI. This has led to growing calls for better safety assessment and regulation of AI systems. However, broad questions remain on the processes and technical approaches that would be required to conceptualize, express, manage, and enforce such regulations for adaptive AI systems, which by nature, are expected to exhibit different behaviors while adapting to evolving user requirements and deployment environments.

This workshop will foster research and development of new paradigms for assessment and design of AI systems that are not only efficient according to a task-based performance measure, but also safe to use by diverse groups of users and compliant with the relevant regulatory frameworks. It will highlight and engender research on new paradigms and algorithms for assessing AI systems' compliance with a variety of evolving safety and regulatory requirements, along with methods for expressing such requirements.

We also expect that the workshop will lead to a productive exchange of ideas across two highly active fields of research, viz., AI and formal methods. The organization team includes active researchers from both fields and our pool of invited speakers features prominent researchers from both areas.

Call for Papers

Although there is a growing need for independent assessment and regulation of AI systems, broad questions remain on the processes and technical approaches that would be required to conceptualize, express, manage, assess, and enforce such regulations for adaptive AI systems.

This workshop addresses research gaps in assessing the compliance of adaptive AI systems (systems capable of planning/learning) in the presence of post-deployment changes in requirements, in user-specific objectives, in deployment environments, and in the AI systems themselves.

These research problems go beyond the classical notions of verification and validation, where operational requirements and system specifications are available a priori. In contrast, adaptive AI systems such as household robots are expected to be designed to adapt to day-to-day changes in the requirements (which can be user-provided), environments, and as a result of system updates and learning. The workshop will feature invited talks by researchers from AI and formal methods, as well as talks on contributed papers.

Topics of interest include:

Assessment of AI system capabilities.
Algorithmic paradigms for assessment of safety and/or compliance of AI systems with evolving regulations.
Learning predictive models of agent capabilities.
Self-assessment and monitoring.
Differential assessment of AI systems following system updates or learning.
Assessment of black-box AI systems.
Types of assessment frameworks and ecosystems.
Specification languages and representations for specifying requirements on AI systems.
Assessment of LLM-based agents.
Specification and assessment of compliance w.r.t. ethics/ethical properties.
Regulation, management, and enforcement of AI assessment paradigms.

Submission Instructions

Submissions can describe either work in progress or mature work that has already been published at another research venue. We also welcome "highlights" papers summarizing and highlighting results from multiple recent papers by the authors. Submissions of papers being reviewed at other venues (NeurIPS, CoRL, ECAI, KR, etc.) are welcome since AIA 2025 is a non-archival venue and we will not require a transfer of copyright. If such papers are currently under blind review, please anonymize the submission.

Submissions should use the IJCAI 2025 style. . Papers under review at other venues can use the style file of that venue, but the camera-ready versions of accepted papers will be required in the IJCAI 2025 format by the camera-ready deadline. The papers should adhere to the IJCAI Code of Conduct for the Authors, the IJCAI Code of Ethics, and the NeurIPS 2025 policy on using LLMs.

Three types of papers can be submitted:

New full technical papers with the length of up to 7 pages + references
New short papers with the length between 2 and 4 pages + references
Previously published papers in their original format (ICML, ICLR, R:SS, etc.). For these submissions, if the match with workshop topics is not immediately clear, we recommend editing the introduction to clarify relevance.

Papers can be submitted via OpenReview at https://openreview.net/group?id=ijcai.org/IJCAI/2025/Workshop/AIA.

Important Dates

Announcement and call for submissions	April 09, 2025
Paper submission deadline	May 27, 2025 (11:59 PM UTC-12)
Author notification	June 13, 2025
Workshop	August 18, 2025

Invited Talks

Reid Simmons

Carnegie Mellon University, USA

Counterfactual Explanations for Better Grounding

Clark's Common Ground Theory posits that successful communication needs grounding, which is a convergence of mutual beliefs, obtained with the least joint effort. In practice, this means that AI agents need to have a model of what people do, and do not, already know in order to effectively communicate with them. In particular, for explaining the agent's policy, we have found that counterfactual explanations are critical - that is, providing explanations that communicate relevant differences between what the agent knows and what it believes that the person already knows. Inferring what the person knows is modeled as an iterative process - the AI agent provides some information, the person responds in some way and, based on the response, the agent refines its model of what it believes the person knows. This talk will present the overall framework of grounding via counterfactual explanations and prior and ongoing research projects that use this framework to achieve better grounding.

Bio: Dr. Reid Simmons is a Research Professor in the Robotics Institute and Computer Science Department at Carnegie Mellon University. He received his PhD from MIT in Artificial Intelligence, and since coming to CMU in 1988, his research has focused on developing self-reliant robots that can autonomously operate over extended periods of time in unknown, unstructured environments, and on human-robot social interaction, especially non-verbal communication through affect, proxemics, motion, and gesture. He is co-PI and Research Director for the NSF-sponsored AI-CARING Institute. Dr. Simmons is an author of over 250 publications on AI, Robotics, and Human-Robot Interaction and has graduated 25 PhD students. He previously served as a Program Director at the National Science Foundation, where he oversaw the National Robotics Initiative and initiated the Smart and Autonomous Systems program. In 2018, Dr. Simmons helped found the first-in-the-nation standalone undergraduate major in Artificial Intelligence and currently serves as its program director. He is a Fulbright Scholar, a Fellow of the Association for the Advancement of Artificial Intelligence, a Senior Member of IEEE, and was an ONR Summer Faculty Fellow in 2022.

Xujie Si

University of Toronto, Canada

The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry

Formalizing mathematics into machine-checkable logic is essential for advancing scientific rigor and enabling powerful AI reasoning. However, the process of translating informal mathematical text into formal languages remains a major bottleneck. This talk explores the challenge of autoformalization - the automated conversion of natural mathematical language into formal logic - through the lens of Euclidean geometry, one of the oldest and most foundational domains in mathematics. I will present insights from our recent work on LeanEuclid and PyEuclid, which demonstrate how modern Large Language Models (LLMs), combined with formal methods, can help bridge the gap between informal and formal mathematical reasoning.

Bio: Xujie Si is an Assistant Professor in the Department of Computer Science at the University of Toronto. He is also a faculty affiliate at Vector Institute and an external affiliate member at Mila, the Quebec AI Institute, where he holds a Canada CIFAR AI Chair. His research centers on automated reasoning and formalization, neuro-symbolic systems, and developing foundational abstractions for reliable and explainable AI (XAI). His recent work concerns formalizing mathematics, program synthesis and verification with deep learning techniques, learning verifiably correct specifications for neural networks, and interpretable rule learning from perceptual data. His works have been recognized with the ACM SIGPLAN distinguished paper award and oral/spotlight presentations at top programming languages and machine learning conferences.

Ruqi Zhang

Purdue University, USA

Aligned and Safe LLMs via Probabilistic Modeling

As large language models (LLMs) are increasingly deployed in complex and high-stakes applications, ensuring their alignment and safety is more important than ever. In this talk, I will explore how probabilistic modeling provides principled and effective approaches for addressing these challenges. First, I will introduce a framework that casts LLM alignment as a problem of probabilistic inference, and present two discrete sampling techniques for efficient inference. Then, I will show how variational inference can be used to automatically uncover diverse adversarial inputs, providing a comprehensive, distributional characterization of model vulnerabilities. Finally, I will conclude by outlining promising directions for future research.

Bio: Ruqi Zhang is an Assistant Professor in the Department of Computer Science at Purdue University. Her research focuses on machine learning, generative modeling, and probabilistic methods. Prior to joining Purdue, she was a postdoctoral researcher at the Institute for Foundations of Machine Learning (IFML) at the University of Texas at Austin. She received her Ph.D. from Cornell University. Dr. Zhang has been a key organizer of Symposium on Advances in Approximate Bayesian Inference for four years. She has served as an Area Chair and Editor for top ML conferences and journals, including ICML, NeurIPS, AISTATS, UAI, and TMLR. Her contributions have been recognized with several honors, including AAAI New Faculty Highlights, Amazon Research Award, Spotlight Rising Star in Data Science from University of Chicago, Seed for Success Acorn Award, and Ross-Lynn Research Scholar.

Program

09:00 AM Workshop Opening

09:05 AM Invited Talk: Reid Simmons
Counterfactual Explanations for Better Grounding

09:45 AM Session Chair: Hazem Torfah
Contributed Talks

SPoRt - Safe Policy Ratio: Certified Training and Deployment of Task Policies in Model-Free RL
Jacques Cloete, Nikolaus Vertovec, and Alessandro Abate

An Approach to Quantify Plans Robustness in Real-world Applications
Francesco Percassi, Sandra Castellanos-Paez, Romain Rombourg, and Mauro Vallati

Rule-Guided Reinforcement Learning Policy Evaluation and Improvement
Martin Tappler, Ignacio D. Lopez-Miguel, Sebastian Tschiatschek, and Ezio Bartocci

Safety Beyond Verification: The Need for Continual, User-Driven Assessment of AI Systems
Siddharth Srivastava, Georgios Fainekos, Pulkit Verma, and Daniel R. Bramblett

10:30 AM Coffee Break

11:00 AM Invited Talk: Xujie Si
The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry

11:40 AM Session Chair: Georgios Fainekos
Contributed Talks

Learning Probabilisitic Temporal Logic Specifications for Stochastic Systems
Rajarshi Roy, Yash Pote, David Parker, and Marta Kwiatkowska

Combining MORL with Restraining Bolts to Learn Normative Behaviour
Emery A. Neufeld, Agata Ciabattoni, and Radu Florin Tulcan

Efficient Counterexample-Guided Fairness Verification and Repair of Neural Networks Using Satisfiability Modulo Convex Programming
Arya Fayyazi, Yifeng Xiao, Pierluigi Nuzzo, and Massoud Pedram

Inference of Human-derived Specifications of Object Placement via Demonstration
Alex Cuellar, Ho Chit Siu, and Julie A. Shah

12:30 PM Lunch Break

02:30 PM Invited Talk: Ruqi Zhang
Aligned and Safe LLMs via Probabilistic Modeling

03:10 PM Session Chair: YooJung Choi
Contributed Talk

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia, Daniel R. Bramblett, Daksh Dobhal, and Siddharth Srivastava

03:20 PM Lightening Poster Introductions

03:25 PM Coffee Break with Poster Session:

SafeTuneBed: A Safety Assessment Framework for Harmful Finetuning Defenses
Saad Hossain, Samanvay Vajpayee, and Sirisha Rambhatla

DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious-Play
Akash Karthikeyan and Yash Vardhan Pant

Learning Adaptive Diffusion Policies for Hybrid Dynamical Systems
Leroy D'Souza, Akash Karthikeyan, Yash Vardhan Pant, and Sebastian Fischmeister

ARC: A Tool to Rate AI Models for Robustness Through a Causal Lens
Kausik Lakkaraju, Siva Likitha Valluru, Biplav Srivastava, and Marco Valtorta

Wisdom from Diversity: Bias Mitigation Through Hybrid Human-LLM Crowds
Axel Abels and Tom Lenaerts

Transforming Expert Insight into Scalable AI Assessment: A Framework for LLM-Generated Metrics and User-Calibrated Evaluation
Nicholas Choma, Sreecharan Sankaranarayanan, and Rajesh Kumar Cherukuri

Feature-Guided Neighbor Selection for Non-Expert Evaluation of Model Predictions
Courtney Ford and Mark T. Keane

Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling
Zun Li, Marc Lanctot, Kevin R. McKee, Luke Marris, Ian Gemp, Daniel Hennes, Paul FM Muller, Kate Larson, Yoram Bachrach, and Michael P. Wellman

Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Wenhan Dong, Tianyi Hu, Jingyi Zheng, Zhen Sun, Yuemeng Zhao, Yule Liu, Xinlei He, and Xinyi Huang

04:10 PM Session Chair: Pulkit Verma
Contributed Talks

LLMs for Causal Reasoning in Medicine? A Call for Caution
Saurabh Mathur, Ranveer Singh, Michael Skinner, Predrag Radivojac, David M. Haas, Lakshmi Raman, and Sriraam Natarajan

Inferring Implicit Goals Across Differing Task Models
Silvia Tulli, Stylianos L. Vasileiou, Mohamed Chetouani, and Sarath Sreedharan

Interpreting Pretrained Language Models via Concept Bottlenecks
Zhen Tan, Lu Cheng, Song Wang, Bo Yuan, Jundong Li, and Huan Liu

Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions
Stéphane T. Aroca-Ouellette, Miguel Aroca-Ouellette, Katharina von der Wense, and Alessandro Roncone

05:00 PM Panel Discussion
Moderator: Pulkit Verma
Panelists:
Reuth Mirsky, Tufts University, USA
Sriraam Natarajan, University of Texas at Dallas, USA
Reid Simmons, Carnegie Mellon University, USA
Xujie Si, University of Toronto, Canada

05:45 PM Closing

Related Events

Organizing Committee

Program Committee

Houssam Abbas, Oregon State University, USA.
Erdem Bıyık, University of Southern California, USA.
Daniel Bramblett, Arizona State University, USA.
Alex Cuellar, Massachusetts Institute of Technology, USA.
Giuseppe De Giacomo, University of Oxford, United Kingdom.
Dylan Hadfield-Menell, Massachusetts Institute of Technology, USA.
Minyoung Hwang, Massachusetts Institute of Technology, USA.
Ashkan Jasour, NASA Jet Propulsion Laboratory (JPL), USA.
Kiana Jafari Meimandi, Stanford University, USA.
Reuth Mirsky, Tufts University, USA.
Calarina Muslimani, University of Alberta, Canada.
Samer Nashed, Mila and University of Montreal, Canada.
Sriraam Natarajan, University of Texas at Dallas, USA.
Yash Vardhan Pant, University of Waterloo, Canada.
Ron Petrick, Heriot-Watt University, United Kingdom.
Sandhya Saisubramanian, Oregon State University, USA.
Andreas Theissler, Aalen University of Applied Sciences, Germany.

User-Aligned Assessment of
Adaptive AI Systems

Room 520C, Palais des congrès

Montreal, Canada

Overview

Call for Papers

Submission Instructions

Important Dates

Invited Talks

Reid Simmons

Counterfactual Explanations for Better Grounding

Xujie Si

The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry

Ruqi Zhang

Aligned and Safe LLMs via Probabilistic Modeling

Program

Related Events

Organizing Committee

Pulkit Verma

YooJung Choi

Georgios Fainekos

Siddharth Srivastava

Hazem Torfah

Program Committee

09:00 AM	Workshop Opening
09:05 AM	Invited Talk: Reid Simmons Counterfactual Explanations for Better Grounding
09:45 AM	Session Chair: Hazem Torfah Contributed Talks SPoRt - Safe Policy Ratio: Certified Training and Deployment of Task Policies in Model-Free RL Jacques Cloete, Nikolaus Vertovec, and Alessandro Abate An Approach to Quantify Plans Robustness in Real-world Applications Francesco Percassi, Sandra Castellanos-Paez, Romain Rombourg, and Mauro Vallati Rule-Guided Reinforcement Learning Policy Evaluation and Improvement Martin Tappler, Ignacio D. Lopez-Miguel, Sebastian Tschiatschek, and Ezio Bartocci Safety Beyond Verification: The Need for Continual, User-Driven Assessment of AI Systems Siddharth Srivastava, Georgios Fainekos, Pulkit Verma, and Daniel R. Bramblett
10:30 AM	Coffee Break
11:00 AM	Invited Talk: Xujie Si The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry
11:40 AM	Session Chair: Georgios Fainekos Contributed Talks Learning Probabilisitic Temporal Logic Specifications for Stochastic Systems Rajarshi Roy, Yash Pote, David Parker, and Marta Kwiatkowska Combining MORL with Restraining Bolts to Learn Normative Behaviour Emery A. Neufeld, Agata Ciabattoni, and Radu Florin Tulcan Efficient Counterexample-Guided Fairness Verification and Repair of Neural Networks Using Satisfiability Modulo Convex Programming Arya Fayyazi, Yifeng Xiao, Pierluigi Nuzzo, and Massoud Pedram Inference of Human-derived Specifications of Object Placement via Demonstration Alex Cuellar, Ho Chit Siu, and Julie A. Shah
12:30 PM	Lunch Break
02:30 PM	Invited Talk: Ruqi Zhang Aligned and Safe LLMs via Probabilistic Modeling
03:10 PM	Session Chair: YooJung Choi Contributed Talk Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks Rushang Karia, Daniel R. Bramblett, Daksh Dobhal, and Siddharth Srivastava
03:20 PM	Lightening Poster Introductions
03:25 PM	Coffee Break with Poster Session: SafeTuneBed: A Safety Assessment Framework for Harmful Finetuning Defenses Saad Hossain, Samanvay Vajpayee, and Sirisha Rambhatla DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious-Play Akash Karthikeyan and Yash Vardhan Pant Learning Adaptive Diffusion Policies for Hybrid Dynamical Systems Leroy D'Souza, Akash Karthikeyan, Yash Vardhan Pant, and Sebastian Fischmeister ARC: A Tool to Rate AI Models for Robustness Through a Causal Lens Kausik Lakkaraju, Siva Likitha Valluru, Biplav Srivastava, and Marco Valtorta Wisdom from Diversity: Bias Mitigation Through Hybrid Human-LLM Crowds Axel Abels and Tom Lenaerts Transforming Expert Insight into Scalable AI Assessment: A Framework for LLM-Generated Metrics and User-Calibrated Evaluation Nicholas Choma, Sreecharan Sankaranarayanan, and Rajesh Kumar Cherukuri Feature-Guided Neighbor Selection for Non-Expert Evaluation of Model Predictions Courtney Ford and Mark T. Keane Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling Zun Li, Marc Lanctot, Kevin R. McKee, Luke Marris, Ian Gemp, Daniel Hennes, Paul FM Muller, Kate Larson, Yoram Bachrach, and Michael P. Wellman Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks Wenhan Dong, Tianyi Hu, Jingyi Zheng, Zhen Sun, Yuemeng Zhao, Yule Liu, Xinlei He, and Xinyi Huang
04:10 PM	Session Chair: Pulkit Verma Contributed Talks LLMs for Causal Reasoning in Medicine? A Call for Caution Saurabh Mathur, Ranveer Singh, Michael Skinner, Predrag Radivojac, David M. Haas, Lakshmi Raman, and Sriraam Natarajan Inferring Implicit Goals Across Differing Task Models Silvia Tulli, Stylianos L. Vasileiou, Mohamed Chetouani, and Sarath Sreedharan Interpreting Pretrained Language Models via Concept Bottlenecks Zhen Tan, Lu Cheng, Song Wang, Bo Yuan, Jundong Li, and Huan Liu Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions Stéphane T. Aroca-Ouellette, Miguel Aroca-Ouellette, Katharina von der Wense, and Alessandro Roncone
05:00 PM	Panel Discussion Moderator: Pulkit Verma Panelists: Reuth Mirsky, Tufts University, USA Sriraam Natarajan, University of Texas at Dallas, USA Reid Simmons, Carnegie Mellon University, USA Xujie Si, University of Toronto, Canada
05:45 PM	Closing

User-Aligned Assessment of Adaptive AI Systems

Room 520C, Palais des congrès

Montreal, Canada

Overview

Call for Papers

Submission Instructions

Important Dates

Invited Talks

Counterfactual Explanations for Better Grounding

The Science and Engineering of Autoformalizing Mathematics: A Case Study in Euclidean Geometry

Aligned and Safe LLMs via Probabilistic Modeling

Program

Related Events

Organizing Committee

Program Committee

User-Aligned Assessment of
Adaptive AI Systems