AIA 2025 Tutorial

Overview

This tutorial will cover approaches for assessing the safety and functionality of AI systems designed to learn continuously and complete tasks in a user's environment. AI systems are increasingly interacting with non-expert users, leading to growing calls for better safety assessment and regulation by users, governments, and industry. While recent AI developments have made it easier to develop taskable AI systems, ensuring their safety presents unique challenges. Unlike traditional engineered systems where limited functionality yields safety, taskable AI systems are designed to adapt to user-specific tasks and environments, invalidating conventional approaches to safety assurance. These challenges cannot be addressed by simply extending existing verification and validation paradigms.

This tutorial is essential for researchers working on AI safety and will interest those in robotics, planning, and human-robot interaction. Participants will learn about foundational topics like active and passive action-model learning and assessment of black-box AI systems in stationary and adaptive settings. The tutorial covers novel capability discovery and assessment techniques, with applications in real-world scenarios like household robotics, digital assistants, autonomous vehicles, and healthcare systems. Specifically, we address three main areas: (i) why conventional verification and validation approaches fall short, (ii) specific requirements and promising research directions for formal assessment of AI systems, and (iii) solutions developed for restricted settings.

By exploring these challenges and research directions, the tutorial will provide both junior and senior researchers with the foundation to contribute to this area of continual assessment of AI systems that can learn, plan and act; emphasizing the interdisciplinary nature of AI assessment that combines formal methods, human-AI interaction, and AI safety.

Please feel free to send tutorial related queries at: pulkitv@mit.edu and siddharths@asu.edu.

Schedule

Tutorial video now available here

08:30 AM - 09:00 AM	Meet and Greet over Coffee
09:00 AM - 10:30 AM	Session 1 Introduction and Motivation Assessment of AI Systems through Model Learning Assessment of Black-Box AI Systems in Stationary Settings
10:30 AM - 11:00 AM	Coffee Break
11:00 AM - 12:30 PM	Session 2 Discovering Capabilities for Black-Box AI Assessment AI Assessment in Adaptive Settings Future Directions and Conclusion
Papers coveres in the tutorial (in order of appearance): Motivation/Overview Safety Beyond Verification: The Need for Continual, User-Driven Assessment of AI Systems. Siddharth Srivastava, Georgios Fainekos. In AAAI Spring Symposium on User-Aligned Assessment of Adaptive AI Systems, 2024. User-Aligned Autonomous Capability Assessment of Black-Box AI Systems. Pulkit Verma, Siddharth Srivastava. In AAAI Spring Symposium on User-Aligned Assessment of Adaptive AI Systems, 2024. Session 1 Online Learning of Action Models for PDDL Planning. Leonardo Lamanna, Alessandro Saetti, Luciano Serafini, Alfonso E. Gerevini, Paolo Traverso. In Proceedings of IJCAI, 2021. GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling. Rohan Chitnis, Tom Silver, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Tomás Lozano-Pérez. In Proceedings of AAAI, 2021. Asking the Right Questions: Learning Interpretable Action Models Through Query Answering. Pulkit Verma, Shashank Rao Marpally, Siddharth Srivastava. In Proceedings of AAAI, 2021. Autonomous Assessment of Sequential Decision-Making Systems in Stochastic Settings. Pulkit Verma, Rushang Karia, Siddharth Srivastava. In Proceedings of NeurIPS, 2023. Session 2 Discovering User-Interpretable Capabilities of Black-Box Planning Agents. Pulkit Verma, Shashank Rao Marpally, Siddharth Srivastava. In Proceedings of KR, 2022. Learning Neuro-Symbolic Skills for Bilevel Planning. Tom Silver, Ashay Athalye, Joshua B. Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling. In Proceedings of CoRL, 2022. Maintaining Evolving Domain Models. Dan Bryce, J. Benton, Michael W. Boldt. In Proceedings of IJCAI, 2016. Differential Assessment of Black-Box AI Agents. Rashmeet Kaur Nayyar^, Pulkit Verma^, Siddharth Srivastava. In Proceedings of AAAI, 2022. Interpretability Analysis of Symbolic Representations for Sequential Decision-Making Systems. Pulkit Verma, Julie A. Shah. In HRI Workshop on Explainability for Human-Robot Collaboration, 2025. AutoEval: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks. Rushang Karia, Daniel Bramblett, Daksh Dobhal, Siddharth Srivastava. In Proceedings of ICLR, 2025.

Related Events

Venue Details

Room 115A, Pennsylvania Convention Center, Philadelphia, PA:

InMotion Hosting

Convention Center Floor Map (100 Level):

Additional details available on the AAAI Website: https://aaai.org/conference/aaai/aaai-25/know-before-you-go/

Organizers

Pulkit Verma

Postdoctoral Associate, Massachusetts Institute of Technology, USA

Pulkit Verma is a Postdoctoral Associate at the Interactive Robotics Group at the Massachusetts Institute of Technology, where he works with Julie Shah. His research focuses on the safe and reliable behavior of taskable AI agents. He investigates the minimal set of requirements in an AI system that would enable a user to assess and understand the limits of its safe operability. He received his Ph.D. in Computer Science from the School of Computing and Augmented Intelligence, Arizona State University, where he worked with Siddharth Srivastava. Before that, he completed his M.Tech. in Computer Science and Engineering at Indian Institute of Technology Guwahati with Pradip K. Das. He was awarded the Graduate College Completion Fellowship at ASU in 2023, Post Graduation Scholarship from the Government of India in 2013 and 2014, and received the Best Demo Award at the International Conference on Autonomous Agents and Multiagent Systems (AAMAS) in 2022.

Siddharth Srivastava

Associate Professor, Arizona State University, USA

Siddharth Srivastava is an Associate Professor in the School of Computing and Augmented Intelligence at Arizona State University. Srivastava was a Staff Scientist at the United Technologies Research Center in Berkeley before joining ASU. Prior to that, he was a postdoctoral researcher working with Stuart Russell and Pieter Abbeel at the University of California Berkeley. Srivastava received his PhD in Computer Science from the University of Massachusetts Amherst, working with Shlomo Zilberstein and Neil Immerman, and a (4+1) MS in Mathematics from Indian Institute of Technology (IIT), Kanpur. Srivastava is a recipient of the NSF CAREER award, the Top 5% Faculty Award from the Fulton Schools of Engineering at ASU, a Best Paper Award at the International Conference on Automated Planning and Scheduling (ICAPS), an Outstanding Dissertation award from the Department of Computer Science at UMass Amherst, a Best Final Year Thesis award from the Department of Mathematics at IIT Kanpur and the National Board of Higher Mathematics Scholarship in India. He served as conference Co-Chair for ICAPS 2019. He currently serves as Chair of the ICAPS Awards Committee and as Associate Editor for the Journal of AI Research.

User-Driven Capability Assessment of
Taskable AI Systems

Room 115A, Pennsylvania Convention Center

Philadelphia, USA

Overview

Schedule

Tutorial video now available here

Venue Details

Organizers

Pulkit Verma

Postdoctoral Associate, Massachusetts Institute of Technology, USA

Siddharth Srivastava

Associate Professor, Arizona State University, USA

User-Driven Capability Assessment of Taskable AI Systems

Room 115A, Pennsylvania Convention Center

Philadelphia, USA

Overview

Schedule

Tutorial video now available here

Related Events

Venue Details

Organizers

Pulkit Verma

Postdoctoral Associate, Massachusetts Institute of Technology, USA

Siddharth Srivastava

Associate Professor, Arizona State University, USA

User-Driven Capability Assessment of
Taskable AI Systems