ROIpad ← Back to Search
openalex.org › research concept

"S-PIC4CHU (Semantics-based Provenance, Integrity, and Curation for Consistent, High-quality, and Unbiased data science)".

Ilaria Bartolini
Published: May 10, 2026
The S-PIC4CHU project aims to develop innovative models and techniques for scalable data preparation in Data Science and Machine Learning. The project focuses on leveraging data semantics throughout all data preparation stages to improve data quality and ensure unbiased results. The proposed approach involves a novel data preparation pipeline semantically enriched with domain knowledge from ontologies and knowledge graphs, along with novel, semanticbased techniques for data cleaning, integration, provenance, explanation, and quality management. The validation of the approach relies on use cases from different domains, with the goal of releasing open-source tools.
Computer science Pipeline (software) Semantics (computer science) Scalability Domain (mathematical analysis)