Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Messi H.J. Lee, Jacob M. Montgomery, Calvin K. Lai

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    8 Scopus citations

    Abstract

    Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLMs that resembles a social psychological phenomenon where socially subordinate groups are perceived as more homogeneous than socially dominant groups. We had ChatGPT, a state-of-the-art LLM, generate texts about intersectional group identities and compared those texts on measures of homogeneity. We consistently found that ChatGPT portrayed African, Asian, and Hispanic Americans as more homogeneous than White Americans, indicating that the model described racial minority groups with a narrower range of human experience. ChatGPT also portrayed women as more homogeneous than men, but these differences were small. Finally, we found that the effect of gender differed across racial/ethnic groups such that the effect of gender was consistent within African and Hispanic Americans but not within Asian and White Americans. We argue that the tendency of LLMs to describe groups as less diverse risks perpetuating stereotypes and discriminatory behavior.

    Original languageEnglish
    Title of host publication2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024
    PublisherAssociation for Computing Machinery, Inc
    Pages1321-1340
    Number of pages20
    ISBN (Electronic)9798400704505
    DOIs
    StatePublished - Jun 3 2024
    Event2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024 - Rio de Janeiro, Brazil
    Duration: Jun 3 2024Jun 6 2024

    Publication series

    Name2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024

    Conference

    Conference2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024
    Country/TerritoryBrazil
    CityRio de Janeiro
    Period06/3/2406/6/24

    Keywords

    • AI Bias
    • Homogeneity Bias
    • Large Language Models
    • Perceived Variability
    • Stereotyping

    Fingerprint

    Dive into the research topics of 'Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans'. Together they form a unique fingerprint.

    Cite this