During the development of an Internal Combustion Engine-based powertrain, traditional procedures for control strategies calibration and validation produce huge amount of data, that can be used to develop innovative data-driven applications, such as emission virtual sensing. One of the main criticalities is related to the data quality, that cannot be easily assessed for such a big amount of data. This work focuses on an emission modeling activity, using an enhanced Light Gradient Boosting Regressor and a dedicated data pre-processing pipeline to improve data quality. First thing, a software tool is developed to access a database containing data coming from emissions tests. The tool performs a data cleaning procedure to exclude corrupted data or invalid parts of the test. Moreover, it automatically tunes model hyperparameters, it chooses the best set of features, and it validates the procedure by comparing the estimation and the experimental measurement. The proposed pre-processing pipeline shows an improvement in terms of accuracy, demonstrating the utility of using large training data which cover a wide set of vehicle maneuvers. Thus, custom designed tests are performed for dataset enrichment, allowing the model to predict non-conventional conditions of aftertreatment systems inefficiency. Real case applications of the proposed model are exposed, such as emission estimation in non-measurable conditions, virtual assessment of the impact of new control strategy calibration on emissions, alignment of emission measurements with all other vehicle signals. Finally, a Principal Component Analysis-based algorithm is developed, to assess the epistemic uncertainty of the model and the prediction reliability during inference.

Boris Petrone, E.G. (2024). Development of an Automatic Pipeline for Data Analysis and Pre-Processing for Data Driven-Based Engine Emission Modeling in a Real Industrial Application [10.4271/2024-01-2018].

Development of an Automatic Pipeline for Data Analysis and Pre-Processing for Data Driven-Based Engine Emission Modeling in a Real Industrial Application

Boris Petrone
Writing – Original Draft Preparation
;
Emanuele Giovannardi
Writing – Review & Editing
;
Alessandro Brusa
Writing – Review & Editing
;
Nicolo Cavina
Supervision
;
2024

Abstract

During the development of an Internal Combustion Engine-based powertrain, traditional procedures for control strategies calibration and validation produce huge amount of data, that can be used to develop innovative data-driven applications, such as emission virtual sensing. One of the main criticalities is related to the data quality, that cannot be easily assessed for such a big amount of data. This work focuses on an emission modeling activity, using an enhanced Light Gradient Boosting Regressor and a dedicated data pre-processing pipeline to improve data quality. First thing, a software tool is developed to access a database containing data coming from emissions tests. The tool performs a data cleaning procedure to exclude corrupted data or invalid parts of the test. Moreover, it automatically tunes model hyperparameters, it chooses the best set of features, and it validates the procedure by comparing the estimation and the experimental measurement. The proposed pre-processing pipeline shows an improvement in terms of accuracy, demonstrating the utility of using large training data which cover a wide set of vehicle maneuvers. Thus, custom designed tests are performed for dataset enrichment, allowing the model to predict non-conventional conditions of aftertreatment systems inefficiency. Real case applications of the proposed model are exposed, such as emission estimation in non-measurable conditions, virtual assessment of the impact of new control strategy calibration on emissions, alignment of emission measurements with all other vehicle signals. Finally, a Principal Component Analysis-based algorithm is developed, to assess the epistemic uncertainty of the model and the prediction reliability during inference.
2024
WCX SAE World Congress Experience
1
13
Boris Petrone, E.G. (2024). Development of an Automatic Pipeline for Data Analysis and Pre-Processing for Data Driven-Based Engine Emission Modeling in a Real Industrial Application [10.4271/2024-01-2018].
Boris Petrone, Emanuele Giovannardi, Alessandro Brusa, Nicolo Cavina, Ioannis Kitsopanidis
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/969737
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact