Discovering Hidden Patterns: Applying Topic Modeling in Qualitative Research
| dc.contributor.author | Tat, Osman | |
| dc.contributor.author | Aydogan, Izzettin | |
| dc.date.accessioned | 2025-05-10T17:34:37Z | |
| dc.date.available | 2025-05-10T17:34:37Z | |
| dc.date.issued | 2024 | |
| dc.description | Tat, Osman/0000-0003-2950-9647; Aydogan, Izzettin/0000-0002-5908-1285 | en_US |
| dc.description.abstract | In qualitative studies, researchers must devote a significant amount of time and effort to extracting meaningful themes from large sets of texts and examining the links between themes, which are frequently done manually. The availability of natural language models has enabled the application of a wide range of techniques to automatically detecting hierarchy, linkages, and latent themes in texts. This paper aims to investigate the coherence of the topics acquired from the analysis with the predefined themes, as well as the hierarchy between topics, the similarity, and the proximity-distance between topics by means of the topic model based on BERTopic using unstructured qualitative data. This paper aims to investigate the coherence of the topics acquired from the analysis with the predefined themes, as well as the hierarchy between topics, the similarity, and the proximity-distance between topics by means of the topic model based on BERTopic using unstructured qualitative data. The qualitative data for this study was gathered from 106 students engaged in a university-run pedagogical formation certificate program. In BERTopic procedure, the paraphrase-multilingual-MiniLM-L12-v2 model was used as the sentence transformer model, UMAP was used as the dimension reduction method, and HDBSCAN algorithm as the clustering method. It was found that BERTopic successfully identified six topics corresponding to the six predicted themes in unstructured texts. Moreover, 74% of the texts containing some certain themes could be classified accurately. The algorithm effectively discerned which themes were analogous and which had significant distinctions from others. It was concluded that BERTopic is a procedure which is capable of identifying themes that researchers may not notice, depending on the data density in qualitative data analysis, and has the potential to enable qualitative research to reach more detailed findings. | en_US |
| dc.identifier.doi | 10.21031/epod.1539694 | |
| dc.identifier.issn | 1309-6575 | |
| dc.identifier.scopus | 2-s2.0-85210928701 | |
| dc.identifier.uri | https://doi.org/10.21031/epod.1539694 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14720/13866 | |
| dc.language.iso | en | en_US |
| dc.publisher | Assoc Measurement & Evaluation Education & Psychology | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Bertopic | en_US |
| dc.subject | Natural Language Processing | en_US |
| dc.subject | Topic Modeling | en_US |
| dc.title | Discovering Hidden Patterns: Applying Topic Modeling in Qualitative Research | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.id | Tat, Osman/0000-0003-2950-9647 | |
| gdc.author.id | Aydogan, Izzettin/0000-0002-5908-1285 | |
| gdc.author.scopusid | 57211186456 | |
| gdc.author.scopusid | 57476477000 | |
| gdc.author.wosid | Aydoğan, Izzettin/Adn-1750-2022 | |
| gdc.author.wosid | Tat, Osman/Lmn-5948-2024 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.description.department | T.C. Van Yüzüncü Yıl Üniversitesi | en_US |
| gdc.description.departmenttemp | [Tat, Osman; Aydogan, Izzettin] Van Yuzuncu Yil Univ, Fac Educ, Van, Turkiye | en_US |
| gdc.description.endpage | 259 | en_US |
| gdc.description.issue | 3 | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q4 | |
| gdc.description.startpage | 247 | en_US |
| gdc.description.volume | 15 | en_US |
| gdc.description.woscitationindex | Emerging Sources Citation Index | |
| gdc.description.wosquality | N/A | |
| gdc.identifier.trdizinid | 1276844 | |
| gdc.identifier.wos | WOS:001356202700001 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.index.type | TR-Dizin |
