Dados Bibliográficos

AUTOR(ES) Zhao Fang , Rob Procter , Lama Alqazlan , Michael Castelle
AFILIAÇÃO(ÕES) Centre for Education Studies, University of Warwick, Coventry, United Kingdom
ANO 2025
TIPO Artigo
PERIÓDICO Big Data & Society
ISSN 2053-9517
E-ISSN 2053-9517
DOI 10.1177/20539517251347598
ADICIONADO EM 2025-08-18

Resumo

The availability of big data has significantly influenced the possibilities and methodological choices for conducting large-scale behavioural and social science research. In the context of qualitative data analysis, a major challenge is that conventional methods require intensive manual labour and are often impractical to apply to large datasets. One effective way to address this issue is by integrating emerging computational methods to overcome scalability limitations. However, a critical concern for researchers is the trustworthiness of results when machine learning and natural language processing tools are used to analyse such data. We argue that confidence in the credibility and robustness of results depends on adopting a 'human-in-the-loop' methodology that is able to provide researchers with control over the analytical process, while retaining the benefits of using machine learning and natural language processing. With this in mind, we propose a novel methodological framework for computational grounded theory that supports the analysis of large qualitative datasets, while maintaining the rigour of established grounded theory methodologies. To illustrate the framework's value, we present the results of testing it on a dataset collected from Reddit in a study aimed at understanding tutors' experiences in the gig economy.

Ferramentas