Machine Learning Models for Spring Discharge Forecasting
2018
Granata, Francesco(Department of Civil and Mechanical Engineering, University of Cassino and Southern Lazio) | Saroli, Michele(Department of Civil and Mechanical Engineering, University of Cassino and Southern Lazio) | de Marinis, Giovanni(Department of Civil and Mechanical Engineering, University of Cassino and Southern Lazio) | Gargano, Rudy(Department of Civil and Mechanical Engineering, University of Cassino and Southern Lazio)
Nowadays, drought phenomena increasingly affect large areas of the globe; therefore, the need for a careful and rational management of water resources is becoming more pressing. Considering that most of the world’s unfrozen freshwater reserves are stored in aquifers, the capability of prediction of spring discharges is a crucial issue. An approach based on water balance is often extremely complicated or ineffective. A promising alternative is represented by data-driven approaches. Recently, many hydraulic engineering problems have been addressed by means of advanced models derived from artificial intelligence studies. Three different machine learning algorithms were used for spring discharge forecasting in this comparative study: M5P regression tree, random forest, and support vector regression. The spring of Rasiglia Alzabove, Umbria, Central Italy, was selected as a case study. The machine learning models have proven to be able to provide very encouraging results. M5P provides good short-term predictions of monthly average flow rates (e.g., in predicting average discharge of the spring after 1 month, R2=0.991, RAE=14.97%, if a 4-month input is considered), while RF is able to provide accurate medium-term forecasts (e.g., in forecasting average discharge of the spring after 3 months, R2=0.964, RAE=43.12%, if a 4-month input is considered). As the time of forecasting advances, the models generally provide less accurate predictions. Moreover, the effectiveness of the models significantly depends on the duration of the period considered for input data. This duration should be close to the aquifer response time, approximately estimated by cross-correlation analysis.
显示更多 [+] 显示较少 [-]