Whose Emotions and Moral Sentiments do Language Models Reflect?

Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman


Abstract
Language models (LMs) are known to represent the perspectives of some social groups better than others, which may impact their performance, especially on subjective tasks such as content moderation and hate speech detection. To explore how LMs represent different perspectives, existing research focused on positional alignment, i.e., how closely the models mimic the opinions and stances of different groups, e.g., liberals or conservatives. However, human communication also encompasses emotional and moral dimensions. We define the problem of affective alignment, which measures how LMs’ emotional and moral tone represents those of different groups. By comparing the affect of responses generated by 36 LMs to the affect of Twitter messages written by two ideological groups, we observe significant misalignment of LMs with both ideological groups. This misalignment is larger than the partisan divide in the U.S. Even after steering the LMs towards specific ideological perspectives, the misalignment and liberal tendencies of the model persist, suggesting a systemic bias within LMs.
Anthology ID:
2024.findings-acl.395
Volume:
Findings of the Association for Computational Linguistics: ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6611–6631
Language:
URL:
https://aclanthology.org/2024.findings-acl.395
DOI:
10.18653/v1/2024.findings-acl.395
Bibkey:
Cite (ACL):
Zihao He, Siyi Guo, Ashwin Rao, and Kristina Lerman. 2024. Whose Emotions and Moral Sentiments do Language Models Reflect?. In Findings of the Association for Computational Linguistics: ACL 2024, pages 6611–6631, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Whose Emotions and Moral Sentiments do Language Models Reflect? (He et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.395.pdf