Predicting Spoilage Intensity Level in Sausage Products Using Explainable Machine Learning and GAN-Based Data Augmentation

dc.authorscopusid 59399954800
dc.authorscopusid 19638408400
dc.authorscopusid 57205761373
dc.authorscopusid 57215308142
dc.authorscopusid 59400043000
dc.contributor.author Ince, Volkan
dc.contributor.author Bader-El-Den, Mohamed
dc.contributor.author Esmeli, Ramazan
dc.contributor.author Maurya, Lalit
dc.contributor.author Sari, Omer Faruk
dc.date.accessioned 2025-09-03T16:37:48Z
dc.date.available 2025-09-03T16:37:48Z
dc.date.issued 2025
dc.department T.C. Van Yüzüncü Yıl Üniversitesi en_US
dc.department-temp [Ince, Volkan; Bader-El-Den, Mohamed; Maurya, Lalit; Sari, Omer Faruk] Univ Portsmouth, Comp Sci, Portsmouth PO1 3GY, England; [Bader-El-Den, Mohamed] Abdullah Al Salem Univ AASU, Coll Comp & Syst Engn, Khaldiya, Kuwait; [Esmeli, Ramazan] Van Yuzuncu Yil Univ, Comp Engn Dept, TR-65080 Van, Turkiye en_US
dc.description.abstract Spoilage in processed meat products, such as poultry and pork sausages, presents significant challenges for food safety, quality control, and waste reduction. This study presents a machine learning-based framework to classify spoilage intensity levels using sensory, physicochemical, and microbiological features. To overcome limitations caused by small datasets, we applied synthetic data augmentation using a tabular variational autoencoder (TVAE) to generate high-fidelity samples that enhance model generalization. Additionally, traditional oversampling techniques such as SMOTE and ADASYN were employed for comparative purposes and to further address class imbalance issues. Seven machine learning classifiers were evaluated logistic regression, support vector machine, K-nearest neighbors, random forest, gradient boosting, voting classifier, and multilayer perceptron. The best classification performance was achieved when models were trained on GAN-based synthetic data and tested on real samples. For poultry sausage spoilage prediction, the gradient boosting classifier reached the highest accuracy of 97%. For pork sausages, random forest achieved the highest accuracy of 95%. These results confirm the effectiveness of data augmentation in improving predictive robustness. To ensure model transparency, we integrated explainable AI techniques SHAP and LIME into the pipeline. These analyses revealed that sampling time, CO2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_2$$\end{document} concentration, pH, and microbial species such as Lactobacillus curvatus and Leuconostoc carnosum were among the most influential features in spoilage prediction. The combination of synthetic data generation and interpretable machine learning enables a reliable, scalable, and explainable approach to spoilage classification. This methodology has strong potential for enhancing quality control systems in the meat industry while reducing waste and improving safety along the food supply chain. en_US
dc.description.sponsorship University of Portsmouth; Ministry of National Education in Turkey en_US
dc.description.sponsorship The authors sincerely thank the University of Portsmouth for its support and the resources that made this research possible. Additionally, we extend our appreciation to our colleagues and collaborators for their insightful feedback and constructive contributions throughout the course of this work. We would also like to thank the Directorate General of Higher Foreign Education, the Ministry of National Education in Turkey. en_US
dc.description.woscitationindex Science Citation Index Expanded
dc.identifier.doi 10.1007/s11947-025-03971-x
dc.identifier.issn 1935-5130
dc.identifier.issn 1935-5149
dc.identifier.scopus 2-s2.0-105013208484
dc.identifier.scopusquality Q1
dc.identifier.uri https://doi.org/10.1007/s11947-025-03971-x
dc.identifier.uri https://hdl.handle.net/20.500.14720/28323
dc.identifier.wos WOS:001549113600001
dc.identifier.wosquality Q1
dc.language.iso en en_US
dc.publisher Springer en_US
dc.relation.ispartof Food and Bioprocess Technology en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Machine Learning en_US
dc.subject Generative AI en_US
dc.subject Explainable AI en_US
dc.subject Sausages Spoilage en_US
dc.title Predicting Spoilage Intensity Level in Sausage Products Using Explainable Machine Learning and GAN-Based Data Augmentation en_US
dc.type Article en_US
dspace.entity.type Publication

Files