SMLE: Safe Machine Learning via Embedded Overapproximation

Matteo Francobaldi; Michele Lombardi

doi:10.1609/aaai.v39i26.34938

Back to AAAI

AAAI 2025

SMLE: Safe Machine Learning via Embedded Overapproximation

Conference Paper AAAI Technical Track on AI Alignment Artificial Intelligence

PDF Details DOI

Abstract

Despite the extent of recent advances in Machine Learning (ML) and Neural Networks, providing formal guarantees on the behavior of these systems is still an open problem, and a crucial requirement for their adoption in regulated or safety-critical scenarios. We consider the task of training differentiable ML models guaranteed to satisfy designer-chosen properties, stated as input-output implications. This is very challenging, due to the computational complexity of rigorously verifying and enforcing compliance in deep neural models. We provide an innovative approach based on: 1) a general, simple architecture enabling efficient verification with a conservative semantic; 2) a rigorous training algorithm based on the Projected Gradient Method; 3) a formulation of the problem of searching for strong counterexamples. The proposed framework, being only marginally affected by model complexity, scales well to practical applications, and produces models that provide full property satisfaction guarantees. We evaluate our approach on properties defined by linear inequalities in regression, and on mutually exclusive classes in multi-label classification. Our approach is competitive with a baseline that includes property enforcement in preprocessing (on training data) and postprocessing (on model predictions). Finally, our contributions establish a framework that opens up multiple research directions and potential improvements.

SMLE: Safe Machine Learning via Embedded Overapproximation

Abstract

Authors

Keywords

Context