
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://beta.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Multivariate Wind Turbine Power Curve Model Based on Data Clustering and Polynomial LASSO Regression

doi: 10.3390/app12010072
handle: 11379/593336
Wind turbine performance monitoring is a complex task because of the non-stationary operation conditions and because the power has a multivariate dependence on the ambient conditions and working parameters. This motivates the research about the use of SCADA data for constructing reliable models applicable in wind turbine performance monitoring. The present work is devoted to multivariate wind turbine power curves, which can be conceived of as multiple input, single output models. The output is the power of the target wind turbine, and the input variables are the wind speed and additional covariates, which in this work are the blade pitch and rotor speed. The objective of this study is to contribute to the formulation of multivariate wind turbine power curve models, which conjugate precision and simplicity and are therefore appropriate for industrial applications. The non-linearity of the relation between the input variables and the output was taken into account through the simplification of a polynomial LASSO regression: the advantages of this are that the input variables selection is performed automatically. The k-means algorithm was employed for automatic multi-dimensional data clustering, and a separate sub-model was formulated for each cluster, whose total number was selected by analyzing the silhouette score. The proposed method was tested on the SCADA data of an industrial Vestas V52 wind turbine. It resulted that the most appropriate number of clusters was three, which fairly resembles the main features of the wind turbine control. As expected, the importance of the different input variables varied with the cluster. The achieved model validation error metrics are the following: the mean absolute percentage error was in the order of 7.2%, and the average difference of mean percentage errors on random subsets of the target data set was of the order of 0.001%. This indicates that the proposed model, despite its simplicity, can be reliably employed for wind turbine power monitoring and for evaluating accumulated performance changes due to aging and/or optimization.
- Università degli Studi di PERUGIA Italy
- University of Perugia Italy
- "UNIVERSITA DEGLI STUDI DI PERUGIA Italy
- University of Exeter United Kingdom
- University of Brescia Italy
Technology, QH301-705.5, power curve, T, Physics, QC1-999, data analysis, Data analysis; Multivariate regression; Power curve; SCADA; Wind energy; Wind turbines, wind energy; wind turbines; power curve; SCADA; data analysis; multivariate regression, Engineering (General). Civil engineering (General), Chemistry, wind turbines, wind energy, SCADA, multivariate regression, TA1-2040, Biology (General), QD1-999
Technology, QH301-705.5, power curve, T, Physics, QC1-999, data analysis, Data analysis; Multivariate regression; Power curve; SCADA; Wind energy; Wind turbines, wind energy; wind turbines; power curve; SCADA; data analysis; multivariate regression, Engineering (General). Civil engineering (General), Chemistry, wind turbines, wind energy, SCADA, multivariate regression, TA1-2040, Biology (General), QD1-999
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).18 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
