CAISAR: A platform for Characterizing Artificial Intelligence Safety and Robustness

About

We present CAISAR, an open-source platform under active development for the characterization of AI systems' robustness and safety. CAISAR provides a unified entry point for defining verification problems by using WhyML, the mature and expressive language of the Why3 verification platform. Moreover, CAISAR orchestrates and composes state-of-the-art machine learning verification tools which, individually, are not able to efficiently handle all problems but, collectively, can cover a growing number of properties. Our aim is to assist, on the one hand, the V\&V process by reducing the burden of choosing the methodology tailored to a given verification problem, and on the other hand the tools developers by factorizing useful features-visualization, report generation, property description-in one platform. CAISAR will soon be available at https://git.frama-c.com/pub/caisar.

Julien Girard-Satabin, Michele Alberti, Fran\c{c}ois Bobot, Zakaria Chihani, Augustin Lemesle• 2022

Related benchmarks

Task	Dataset	Result	Rank
Neural Network Verification	cifar100 VNN-COMP 2024	Verified Properties Count68		5
Neural Network Verification	tinyimagenet VNN-COMP 2024	Verified Properties Count49		5

Showing 2 of 2 rows

Other info

Follow for update

@wizwand_team Discord