Automating Thematic Analysis With Multi-Agent LLM Systems

Sankaranarayanan, S.; Borchers, C.; Simon, S.; Tajik, E.; Atas, A.H.; Celik, B.; Shahrokhian, B.

Automating Thematic Analysis With Multi-Agent LLM Systems

dc.authorscopusid	57200651663
dc.authorscopusid	57224723719
dc.authorscopusid	57889613800
dc.authorscopusid	57210696892
dc.authorscopusid	58408248400
dc.authorscopusid	56275235700
dc.authorscopusid	59243988400
dc.contributor.author	Sankaranarayanan, S.
dc.contributor.author	Borchers, C.
dc.contributor.author	Simon, S.
dc.contributor.author	Tajik, E.
dc.contributor.author	Atas, A.H.
dc.contributor.author	Celik, B.
dc.contributor.author	Shahrokhian, B.
dc.date.accessioned	2025-07-30T16:34:40Z
dc.date.available	2025-07-30T16:34:40Z
dc.date.issued	2025
dc.department	T.C. Van Yüzüncü Yıl Üniversitesi	en_US
dc.department-temp	[Sankaranarayanan S.] Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, 15213, PA, United States; [Borchers C.] Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, 15213, PA, United States; [Simon S.] Copenhagen University, Nørregade 10, København, 1172, Denmark; [Tajik E.] Florida State University, 222 S Copeland St, Tallahassee, 32306, FL, United States; [Atas A.H.] Galatasaray University, Ortaköy, Çırağan Cd. No:36, Beşiktaş, İstanbul, 34349, Turkey; [Celik B.] Van Yuzuncu Yil University, Bardakçı, Yüzüncü Yıl Üniversitesi Kampüsü, Tuşba, Van, 65090, Turkey; [Balzan F.] University of Bologna, Via Zamboni, 33, BO, Bologna, 40126, Italy; [Shahrokhian B.] Arizona State University, 1151 S Forest Ave, Tempe, 85281, AZ, United States	en_US
dc.description.abstract	Thematic analysis (TA) is a method used to identify, examine, and present themes within data. TA is often a manual, multistep, and time-intensive process requiring collaboration among multiple researchers. TA’s iterative subtasks, including coding data, identifying themes, and resolving inter-coder disagreements, are especially laborious for large data sets. Given recent advances in natural language processing, Large Language Models (LLMs) offer the potential for automation at scale. Recent literature has explored the automation of isolated steps of the TA process, tightly coupled with researcher involvement at each step. Research using such hybrid approaches has reported issues in LLM generations, such as hallucination, inconsistent output, and technical limitations (e.g., token limits). This paper proposes a multi-agent system, differing from previous systems using an orchestrator LLM agent that spins off multiple LLM sub-agents for each step of the TA process, mirroring all the steps previously done manually. In addition to more accurate analysis results, this iterative coding process based on agents is also expected to result in increased transparency of the process, as analytical stages are documented step-by-step. We study the extent to which such a system can perform a full TA without human supervision. Preliminary results indicate human-quality codes and themes based on alignment with human-derived codes. Nevertheless, we still observe differences in coding complexity and thematic depth. Despite these differences, the system provides critical insights on the path to TA automation while maintaining consistency, efficiency, and transparency in future qualitative data analysis, which our open-source datasets, coding results, and analysis enable. © 2025 for this paper by its authors.	en_US
dc.identifier.endpage	238	en_US
dc.identifier.issn	1613-0073
dc.identifier.scopus	2-s2.0-105011032981
dc.identifier.scopusquality	Q4
dc.identifier.startpage	229	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.14720/28187
dc.identifier.volume	3995	en_US
dc.identifier.wosquality	N/A
dc.language.iso	en	en_US
dc.publisher	CEUR-WS	en_US
dc.relation.ispartof	CEUR Workshop Proceedings -- Joint of LAK 2025 Workshops, LAK-WS 2025 -- 3 March 2025 through 4 March 2025 -- Dublin -- 210311	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Large Language Models	en_US
dc.subject	LLMs	en_US
dc.subject	Multi-Agent Systems	en_US
dc.subject	Qualitative Analysis	en_US
dc.subject	Qualitative Coding	en_US
dc.subject	Thematic Analysis	en_US
dc.title	Automating Thematic Analysis With Multi-Agent LLM Systems	en_US
dc.type	Conference Object	en_US

Collections

Scopus İndeksli Yayınlar Koleksiyonu

Automating Thematic Analysis With Multi-Agent LLM Systems

Files

Collections