Multi-Class Machine Learning to Quantify the Impact of Nitrogen Management Practices on Grassland Biomass
2025
Sebastian Raubitzek | Margarita Hartlieb | Philip König | Judith Hinderling | Kevin Mallinger
Grassland biomass yield reflects a complex interaction of management intensity and environmental factors, yet quantifying the relative role of practices such as mowing and fertilization remains challenging. In this study, we introduce a multi-class machine learning framework to predict above-ground biomass on 150 permanent grassland plots across eight years (2009&ndash:2016) in Germany&rsquo:s Biodiversity Exploratories and to evaluate the influence of key management variables. Following rigorous data cleaning, imputation of missing nitrogen values, feature standardization, and encoding of categorical practices, we trained CatBoost classifiers optimized via Bayesian hyperparameter search and mitigated class imbalance with ADASYN oversampling. We assessed model performance under binary, three-class, four-class, and five-class quantile-based categorizations, achieving test accuracies of 0.76, 0.57, 0.42, and 0.38, respectively. Across all schemes, mowing frequency and mineral nitrogen input emerged as the dominant predictors, while secondary variables such as drainage and conditioner use contributed as well. These results demonstrate that broad biomass categories can be forecast reliably from standardized management records, whereas finer distinctions necessitate additional environmental information or automated sensing to capture nonlinear effects and reduce reporting bias. This work shows both the potential and the limits of machine learning for informing sustainable grassland management and explainability thereof. Frequent mowing and higher mineral nitrogen inputs explained most of the predictable variation, enabling a 76% accurate separation of low and high biomass categories. Predictive accuracy fell below 60% for finer class resolutions, indicating that management records alone are insufficient for detailed yield forecasts without complementary environmental data.
Show more [+] Less [-]Bibliographic information
This bibliographic record has been provided by Multidisciplinary Digital Publishing Institute