Random Reflectance: A New Hyperspectral Data Preprocessing Method for Improving the Accuracy of Machine Learning Algorithms
2025
Pavel A. Dmitriev | Anastasiya A. Dmitrieva | Boris L. Kozlovsky
Hyperspectral plant phenotyping is a method that has a wide range of applications in various fields, including agriculture, forestry, food processing, medicine and plant breeding. It can be used to obtain a large amount of spectral and spatial information about an object. However, it is important to acknowledge the inherent limitations of this approach, which include the presence of noise and the redundancy of information. The present study aims to assess a novel approach to hyperspectral data preprocessing, namely Random Reflectance (RR), for the classification of plant species. This study employs machine learning (ML) algorithms, specifically Random Forest (RF) and Gradient Boosting (GB), to analyse the performance of RR in comparison to Min&ndash:Max Normalisation (MMN) and Principal Component Analysis (PCA). The testing process was conducted on data derived from the proximal hyperspectral imaging (HSI) of leaves from three different maple species, which were sampled from trees at 7&ndash:10-day intervals between 2021 and 2024. The RF algorithm demonstrated a relative increase of 8.8% in the F1-score in 2021, 9.7% in 2022, 11.3% in 2023 and 11.8% in 2024. The GB algorithm exhibited a similar trend: 6.5% in 2021, 13.2% in 2022, 16.5% in 2023 and 17.4% in 2024. It has been demonstrated that hyperspectral data preprocessing with the MMN and PCA methods does not result in enhanced accuracy when classifying species using ML algorithms. The impact of preprocessing spectral profiles using the RR method may be associated with the observation that the synthesised set of spectral profiles exhibits a stronger reflection of the general parameters of spectral reflectance compared to the set of actual profiles. Subsequent research endeavours are anticipated to elucidate a mechanistic rationale for the RR method in conjunction with the RF and GB algorithms. Furthermore, the efficacy of this method will be evaluated through its application in deep machine learning algorithms.
Afficher plus [+] Moins [-]Mots clés AGROVOC
Informations bibliographiques
Cette notice bibliographique a été fournie par Multidisciplinary Digital Publishing Institute
Découvrez la collection de ce fournisseur de données dans AGRIS