A Comprehensive Hybrid Approach for Indoor Scene Recognition Combining CNNs and Text-Based Features

dc.authorscopusid 57200138639
dc.authorscopusid 60098967200
dc.authorscopusid 57200139869
dc.contributor.author Uckan, Taner
dc.contributor.author Aslan, Cengiz
dc.contributor.author Hark, Cengiz
dc.date.accessioned 2025-09-30T16:36:07Z
dc.date.available 2025-09-30T16:36:07Z
dc.date.issued 2025
dc.department T.C. Van Yüzüncü Yıl Üniversitesi en_US
dc.department-temp [Uckan] Taner, Department of Computer Engineering, Van Yüzüncü Yıl Üniversitesi, Van, Turkey; [Aslan] Cengiz, Department of Artificial Intelligence and Robotics, Van Yüzüncü Yıl Üniversitesi, Van, Turkey; [Hark] Cengiz, Department of Computer Engineering, Inönü Üniversitesi, Malatya, Turkey en_US
dc.description.abstract Highlights: What are the main findings? Proposed an innovative two-channel hybrid model by integrating convolutional neural networks (CNNs) with a text-based classifier. Leveraged an extended dataset derived from multiple object recognition models, increasing input data diversity and achieving a text-based classifier accuracy of 73.30%. Achieved a significant improvement of 8.33% in accuracy compared to CNN-only models, with the hybrid model attaining an accuracy of 90.46%. What is the implication of the main finding? Efficient and Scalable Methodology: Utilized EfficientNet for CNN-based feature extraction and Bag-of-Words for text representation, ensuring computational efficiency and scalability. Application Potential: Addressed challenges in indoor scene recognition, such as complex backgrounds and object diversity, demonstrating significant potential for applications in robotics, intelligent surveillance, and assistive systems. Indoor scene recognition is a computer vision task that identifies various indoor environments, such as offices, libraries, kitchens, and restaurants. This research area is particularly significant for applications in robotics, security, and assistance for individuals with disabilities, as it enables the categorization of spaces and the provision of contextual information. Convolutional Neural Networks (CNNs) are commonly employed in this field. While CNNs perform well in outdoor scene recognition by focusing on global features such as mountains and skies, they often struggle with indoor scenes, where local features like furniture and objects are more critical. In this study, the “MIT 67 Indoor Scene” dataset is used to extract and combine features from both a CNN and a text-based model utilizing object recognition outputs, resulting in a two-channel hybrid model. The experimental results demonstrate that this hybrid approach, which integrates natural language processing and image processing techniques, improves the test accuracy of the image processing model by 8.3%, achieving a notable success rate. Furthermore, this study offers contributions to new application areas in remote sensing, particularly in indoor scene understanding and indoor mapping. © 2025 Elsevier B.V., All rights reserved. en_US
dc.identifier.doi 10.3390/s25175350
dc.identifier.issn 1424-8220
dc.identifier.issue 17 en_US
dc.identifier.pmid 40942779
dc.identifier.scopus 2-s2.0-105015894592
dc.identifier.scopusquality Q2
dc.identifier.uri https://doi.org/10.3390/s25175350
dc.identifier.uri https://hdl.handle.net/20.500.14720/28619
dc.identifier.volume 25 en_US
dc.identifier.wosquality Q2
dc.language.iso en en_US
dc.publisher Multidisciplinary Digital Publishing Institute (MDPI) en_US
dc.relation.ispartof Sensors en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Deep Learning en_US
dc.subject EfficientNet en_US
dc.subject Indoor Scene Recognition en_US
dc.subject Object Recognition en_US
dc.subject Text Classification en_US
dc.subject Character Recognition en_US
dc.subject Computational Efficiency en_US
dc.subject Convolutional Neural Networks (CNNs) en_US
dc.subject Image Enhancement en_US
dc.subject Natural Language Processing (NLP) en_US
dc.subject Security Systems en_US
dc.subject Hybrid Model en_US
dc.subject Scene Recognition en_US
dc.subject Computer Vision en_US
dc.subject Robotics en_US
dc.title A Comprehensive Hybrid Approach for Indoor Scene Recognition Combining CNNs and Text-Based Features en_US
dc.type Article en_US
dspace.entity.type Publication

Files