Discovering Hidden Patterns: Applying Topic Modeling in Qualitative Research

dc.contributor.author Tat, Osman
dc.contributor.author Aydogan, Izzettin
dc.date.accessioned 2025-05-10T17:34:37Z
dc.date.available 2025-05-10T17:34:37Z
dc.date.issued 2024
dc.description Tat, Osman/0000-0003-2950-9647; Aydogan, Izzettin/0000-0002-5908-1285 en_US
dc.description.abstract In qualitative studies, researchers must devote a significant amount of time and effort to extracting meaningful themes from large sets of texts and examining the links between themes, which are frequently done manually. The availability of natural language models has enabled the application of a wide range of techniques to automatically detecting hierarchy, linkages, and latent themes in texts. This paper aims to investigate the coherence of the topics acquired from the analysis with the predefined themes, as well as the hierarchy between topics, the similarity, and the proximity-distance between topics by means of the topic model based on BERTopic using unstructured qualitative data. This paper aims to investigate the coherence of the topics acquired from the analysis with the predefined themes, as well as the hierarchy between topics, the similarity, and the proximity-distance between topics by means of the topic model based on BERTopic using unstructured qualitative data. The qualitative data for this study was gathered from 106 students engaged in a university-run pedagogical formation certificate program. In BERTopic procedure, the paraphrase-multilingual-MiniLM-L12-v2 model was used as the sentence transformer model, UMAP was used as the dimension reduction method, and HDBSCAN algorithm as the clustering method. It was found that BERTopic successfully identified six topics corresponding to the six predicted themes in unstructured texts. Moreover, 74% of the texts containing some certain themes could be classified accurately. The algorithm effectively discerned which themes were analogous and which had significant distinctions from others. It was concluded that BERTopic is a procedure which is capable of identifying themes that researchers may not notice, depending on the data density in qualitative data analysis, and has the potential to enable qualitative research to reach more detailed findings. en_US
dc.identifier.doi 10.21031/epod.1539694
dc.identifier.issn 1309-6575
dc.identifier.scopus 2-s2.0-85210928701
dc.identifier.uri https://doi.org/10.21031/epod.1539694
dc.identifier.uri https://hdl.handle.net/20.500.14720/13866
dc.language.iso en en_US
dc.publisher Assoc Measurement & Evaluation Education & Psychology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Bertopic en_US
dc.subject Natural Language Processing en_US
dc.subject Topic Modeling en_US
dc.title Discovering Hidden Patterns: Applying Topic Modeling in Qualitative Research en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Tat, Osman/0000-0003-2950-9647
gdc.author.id Aydogan, Izzettin/0000-0002-5908-1285
gdc.author.scopusid 57211186456
gdc.author.scopusid 57476477000
gdc.author.wosid Aydoğan, Izzettin/Adn-1750-2022
gdc.author.wosid Tat, Osman/Lmn-5948-2024
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department T.C. Van Yüzüncü Yıl Üniversitesi en_US
gdc.description.departmenttemp [Tat, Osman; Aydogan, Izzettin] Van Yuzuncu Yil Univ, Fac Educ, Van, Turkiye en_US
gdc.description.endpage 259 en_US
gdc.description.issue 3 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q4
gdc.description.startpage 247 en_US
gdc.description.volume 15 en_US
gdc.description.woscitationindex Emerging Sources Citation Index
gdc.description.wosquality N/A
gdc.identifier.trdizinid 1276844
gdc.identifier.wos WOS:001356202700001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type TR-Dizin

Files