Can Large Language Models Transform Computational Social Science?

Bibliographic Details
Title: Can Large Language Models Transform Computational Social Science?
Authors: Ziems, Caleb1 (AUTHOR) cziems@stanford.edu, Held, William2 (AUTHOR) wheld3@gatech.edu, Shaikh, Omar1 (AUTHOR) oshaikh@stanford.edu, Chen, Jiaao2 (AUTHOR) jiaaochen@gatech.edu, Zhang, Zhehao3 (AUTHOR) zhehao.zhang.gr@dartmouth.edu, Yang, Diyi1 (AUTHOR) diyiy@stanford.edu
Superior Title: Computational Linguistics. Mar2024, Vol. 50 Issue 1, p237-291. 55p.
Subject Terms: *LANGUAGE models, *ROAD maps, *ENGLISH language
Abstract: Large language models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the computational social science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 language models on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers' gold references. We conclude that the performance of today's LLMs can augment the CSS research pipeline in two ways: (1) serving as zero-shot data annotators on human annotation teams, and (2) bootstrapping challenging creative generation tasks (e.g., explaining the underlying attributes of a text). In summary, LLMs are posed to meaningfully participate in social science analysis in partnership with humans. [ABSTRACT FROM AUTHOR]
Copyright of Computational Linguistics is the property of MIT Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Academic Search Premier
Full text is not displayed to guests.
Description
Description not available.