Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAISAR: A platform for Characterizing Artificial Intelligence Safety and Robustness

About

We present CAISAR, an open-source platform under active development for the characterization of AI systems' robustness and safety. CAISAR provides a unified entry point for defining verification problems by using WhyML, the mature and expressive language of the Why3 verification platform. Moreover, CAISAR orchestrates and composes state-of-the-art machine learning verification tools which, individually, are not able to efficiently handle all problems but, collectively, can cover a growing number of properties. Our aim is to assist, on the one hand, the V\&V process by reducing the burden of choosing the methodology tailored to a given verification problem, and on the other hand the tools developers by factorizing useful features-visualization, report generation, property description-in one platform. CAISAR will soon be available at https://git.frama-c.com/pub/caisar.

Julien Girard-Satabin, Michele Alberti, Fran\c{c}ois Bobot, Zakaria Chihani, Augustin Lemesle• 2022

Related benchmarks

TaskDatasetResultRank
Neural Network Verificationcifar100 VNN-COMP 2024
Verified Properties Count68
5
Neural Network Verificationtinyimagenet VNN-COMP 2024
Verified Properties Count49
5
Showing 2 of 2 rows

Other info

Follow for update