As part of the Confiance.ai program, IRT SystemX, in collaboration with Atos and CEA, has developed DQM (Data Quality Metrics), an open-source library designed in Python. This tool enables the evaluation of data quality used in the development and assessment of artificial intelligence (AI) models—particularly within complex industrial environments.
Published on 10/31/2025
Data quality is essential to ensure the reliability of AI models. The DQM library provides a concrete response to this challenge by offering relevant and interpretable quality attributes that assess critical aspects such as the representativeness and coverage of data within specific operational domains.
The institute’s teams developed two main categories of metrics:
DQM was designed as a standalone Python package, making it easy to use independently or to integrate into other tools, such as DebiAI. The library has already been integrated into the end-to-end methodologies of major industrial players such as Naval Group and Valeo, strengthening their ability to accurately assess data quality.
The potential of the DQM library is very promising. Our experiments have shown its effectiveness in providing a deep understanding of the data used in machine learning workflows. Integrated into the European Trustworthy Foundation created by the Confiance.ai community, the library is generating strong interest and is paving the way for new applications across various industrial sectors. Furthermore, a scientific paper detailing the library’s contributions was published in ATRACC.
4 technology transfers to CAB project partners (RTE, Orange) and European players (Flatlandet, EnliteAI)