Explainable Deep Learning to Predict Kelp Geographical Origin from Volatile Organic Compound Analysis
2025
Xuming Kang | Zhijun Tan | Yanfang Zhao | Lin Yao | Xiaofeng Sheng | Yingying Guo
In addition to its flavor and nutritional value, the origin of kelp has become a crucial factor influencing consumer choices. Nevertheless, research on kelp&rsquo:s origin traceability by volatile organic compound (VOC) analysis is lacking, and the application of deep learning in this field remains scarce due to its black-box nature. To address this gap, we attempted to identify the origin of kelp by analyzing its VOCs in conjunction with explainable deep learning. In this work, we identified 115 distinct VOCs in kelp samples using gas chromatography coupled with ion mobility spectroscopy (GC-IMS), of which 68 categories were discernible. Consequently, we developed a comprehensible one-dimensional convolutional neural network (1D-CNN) model that incorporated 107 VOCs exhibiting significant regional disparities (p <: 0.05). The model successfully discerns the origin of kelp, achieving perfect metrics across accuracy (100%), precision (100%), recall (100%), F1 score (100%), and AUC (1.0). SHapley Additive exPlanations (SHAP) analysis highlighted the impact of features such as 1-Octen-3-ol-M, (+)-limonene, allyl sulfide-D, 1-hydroxy-2-propanone-D, and (E)-2-hexen-1-al-M on the model output. This research provides deeper insights into how critical product features correlate with specific geographic information, which in turn boosts consumer trust and promotes practical utilization in actual settings.
显示更多 [+] 显示较少 [-]