A Technological Construction of Society: Comparing GPT-4 and Human Respondents for Occupational Evaluation in the UK

British Journal of Industrial Relations, online first, https://doi.org/10.1111/bjir.12840

54 Pages Posted: 13 Feb 2024

See all articles by Pawel Gmyrek

Pawel Gmyrek

ILO

Christoph Lutz

BI Norwegian Business School

Gemma Newlands

BI Norwegian Business School; University of Amsterdam

Date Written: January 19, 2024

Abstract

Despite initial research about the biases and perceptions of Large Language Models (LLMs), we lack evidence on how LLMs evaluate occupations, especially in comparison to human evaluators. In this paper, we present a systematic comparison of occupational evaluations by GPT-4 with those from an in-depth, high-quality and recent human respondents survey in the United Kingdom. Covering the full ISCO-08 occupational landscape, with 580 occupations and two distinct metrics (prestige and social value), our findings indicate that GPT-4 and human scores are highly correlated across all ISCO-08 major groups. In absolute terms, GPT-4 scores are more generous than those of the human respondents. At the same time, GPT-4 substantially under- or overestimates the occupational prestige and social value of many occupations, particularly for emerging digital and stigmatized occupations. Our analyses show both the potentials and risks of using LLM-generated data for sociological and occupational research. Potentials include LLMs’ efficiency, cost effectiveness, speed, and accuracy in capturing general tendencies. By contrast, there are risks of bias, contextual misalignment, and downstream issues, for example when problematic and opaque occupational evaluations of LLMs may feed back into working life, thus leading to potentially problematic technological constructions of society. We also discuss the policy implications of our findings for the integration of LLM tools into the world of work.

Keywords: artificial intelligence, generative AI, large language models, future of work, occupational classification, occupational qualification, technology, information and communication technologies, technological change, data analysis, occupational prestige, occupational social value

Suggested Citation

Gmyrek, Pawel and Lutz, Christoph and Newlands, Gemma, A Technological Construction of Society: Comparing GPT-4 and Human Respondents for Occupational Evaluation in the UK (January 19, 2024). British Journal of Industrial Relations, online first, https://doi.org/10.1111/bjir.12840, Available at SSRN: https://ssrn.com/abstract=4700366 or http://dx.doi.org/10.2139/ssrn.4700366

Pawel Gmyrek (Contact Author)

ILO ( email )

CH-1211 Geneva 22
Switzerland

Christoph Lutz

BI Norwegian Business School ( email )

Nydalsveien 37
Oslo, Oslo 0448
Norway
+4746410206 (Phone)

Gemma Newlands

BI Norwegian Business School ( email )

Nydalsveien 37
Oslo, 0442
Norway

University of Amsterdam ( email )

Roetersstraat 11
Amsterdam, NE 1018 WB
Netherlands

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
99
Abstract Views
603
Rank
513,980
PlumX Metrics