Real-Time Food Allergen Detection Using OCR-Enhanced Machine Learning Techniques

dc.contributor.author Kina, Erol
dc.date.accessioned 2025-12-30T16:04:51Z
dc.date.available 2025-12-30T16:04:51Z
dc.date.issued 2025
dc.description.abstract Food allergies are a significant public health concern, emphasizing the need for precise and comprehensive allergen identification in food products. Despite the critical importance of allergen detection, existing allergen food datasets and detection approaches exhibit several limitations. These include small dataset sizes and low accuracy, particularly in real-time scenarios. To address these challenges, this study proposes a novel machine learning-based system evaluated in both real-time and offline environments. The proposed system is designed to analyze ingredient lists extracted from scanned product labels. By leveraging Optical Character Recognition (OCR) technology, the system efficiently retrieves ingredient information in real-time, enabling accurate identification of allergenic components. Once the ingredient information is extracted using OCR, feature extraction techniques such as Bag of Words (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), and Global Vectors for Word Representation (GloVe) are applied. These features play a critical role in training various machine learning and deep learning models. Among the tested models, Logistic Regression (LR) outperformed others, achieving an impressive accuracy of 0.99 with a low computational cost of 13 milliseconds in offline testing. In real-time testing, where product images are captured and processed through the pipeline, the system demonstrated robust performance with a 0.90 accuracy score. en_US
dc.identifier.doi 10.7717/peerj-cs.3338
dc.identifier.issn 2376-5992
dc.identifier.uri https://doi.org/10.7717/peerj-cs.3338
dc.identifier.uri https://hdl.handle.net/20.500.14720/29315
dc.language.iso en en_US
dc.publisher Peerj Inc en_US
dc.relation.ispartof Peerj Computer Science en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Food Allergy en_US
dc.subject Food Reaction en_US
dc.subject Machine Learning en_US
dc.subject Ocr en_US
dc.subject Feature Extraction en_US
dc.title Real-Time Food Allergen Detection Using OCR-Enhanced Machine Learning Techniques en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional Kina, Erol
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department T.C. Van Yüzüncü Yıl Üniversitesi en_US
gdc.description.departmenttemp [Kina, Erol] Van Yuzuncu Yil Univ, Ozalp Vocat Sch, Dept Comp Programming, Van, Turkiye en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.volume 11 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.wos WOS:001639365900001
gdc.index.type WoS

Files