Prosocial behavior in Large Language Models:Value alignment and afective mechanisms

导出

摘要 While advanced Large Language Models(LLMs)can simulate human-like prosocial behaviors,the degree to which they align with human prosocial values and the underlying afective mechanisms remain unclear.This study addressed these gaps using the third-party punishment(TPP)paradigm,comparing LLM agents(GPT and DeepSeek series)with human participants(n=100).The LLM agents(n=500,100 agents per model)were one-to-one constructed based on the demographic and psychological features of human participants.Prompt engineering was employed to initiate TPP games and record punitive decisions and afective responses in LLM agents.Results revealed that:(1)GPT-4o,DeepSeek-V3,and DeepSeek-R1 models demonstrated stronger fairness value alignment,choosing punitive options more frequently than humans in TPP games;(2)all LLMs replicated the human pathway from unfairness through negative afective response to punitive decisions,with stronger mediation efects of negative emotions observed in DeepSeek models than GPT models;(3)only DeepSeek-R1 exhibited the human-like positive feedback loop from previous punitive decisions to positive afective feedback and subsequent punitive choices;(4)most LLMs(excluding GPT-3.5)showed signifcant representational similarity to human afect-decision patterns;(5)notably,all LLMs displayed rigid afective dynamics,characterized by lower afective variability and higher afective inertia than the fexible,contextsensitive fuctuations observed in humans.These fndings highlight notable advances in prosocial value alignment but underscore the necessity to enhance their afective dynamics to foster robust,adaptive prosocial LLMs.Such advancements could not only accelerate LLMs'alignment with human values but also provide empirical support for the broader applicability of prosocial theories to LLM agents.

作者 Hao LIU Yu LEI Zhen WU

机构地区 Department of Psychological and Cognitive Sciences Department of Artificial Intelligence

出处《Science China(Technological Sciences)》 2025年第8期185-199,共15页 中国科学(技术科学英文版)

基金 supported by the National Natural Science Foundation of China(Grant Nos.32271110,62441614) the Tsinghua University Initiative Scientific Research Program(Grant No.20235080047)。

关键词 Large Language Models value alignment prosocial behavior affective mechanisms

分类号 G63 [文化科学—教育学]

作者简介 Corresponding author: Zhen WU,email:zhen-wu@tsinghua.edu.cn。

引文网络
相关文献

参考文献1

1袁航,罗思阳.研究社会文化变迁的新视角——表征相似性分析:以老年人心理健康为例[J].心理学报,2024,56(7):938-953. 被引量：4

二级参考文献9

1辛自强,池丽萍.横断历史研究：以元分析考察社会变迁中的心理发展[J].华东师范大学学报（教育科学版）,2008,26(2):44-51. 被引量：101
2刘子曦.宗教信仰的代际传递:基于台湾地区的数据分析[J].社会学研究,2017(1):193-216. 被引量：14
3张建人,花少武,凌辉,唐忠.工作价值观代际差异的内隐实验研究[J].中国临床心理学杂志,2020,28(4):675-678. 被引量：4
4蔡华俭,黄梓航,林莉,张明杨,王潇欧,朱慧珺,谢怡萍,杨盈,杨紫嫣,敬一鸣.半个多世纪来中国人的心理与行为变化——心理学视野下的研究[J].心理科学进展,2020,28(10):1599-1618. 被引量：96
5江光荣,李丹阳,任志洪,闫玉朋,伍新春,朱旭,于丽霞,夏勉,李凤兰,韦辉,张衍,赵春晓,张琳.中国国民心理健康素养的现状与特点[J].心理学报,2021,53(2):182-198. 被引量：146
6黄丽芹,孙寅,罗思阳.个人主义文化价值观对疫情控制效果的影响及其计算心理机制[J].心理学报,2022,54(5):497-515. 被引量：8
7Qin Duan,Zhengchuan Xu,Qing Hu,Siyang Luo.Neural variability fingerprint predicts individuals,information security violation intentions[J].Fundamental Research,2022,2(2):303-310. 被引量：2
8蔡华俭,张明杨,包寒吴霜,朱慧珺,杨紫嫣,程曦,黄梓航,王梓西.心理学视野下的社会变迁研究:研究设计与分析方法[J].心理科学进展,2023,31(2):159-172. 被引量：9
9张积家,张航,冯晓慧.从“异己观”到“天下观”的民族心理变迁——基于族际通婚视角的元民族志分析[J].华南师范大学学报（社会科学版）,2023(2):63-83. 被引量：7

共引文献3

1白麒钰,陈尚仪,罗思阳.共情驱动的群体代际决策[J].科学通报,2025,70(8):1079-1090. 被引量：2
2Yue ZHANG,Shao LI,Xianger YUAN,Hang YUAN,Zhongyue CHE,Siyang LUO.The high-dimensional psychological profile of ChatGPT[J].Science China(Technological Sciences),2025,68(8):155-170.
3白麒钰,黄柯依,韩思嘉,陈尚仪,刘阔,张玥,李劭,罗思阳.风险感知驱动的网络亲社会行为及其对非常规突发事件发展模式的影响[J].心理科学,2025,48(4):1009-1023.

1Yao Chen,Yi-Jun Tang,Xin Li,Xiu-Min Wang.What can we do for the adolescents with polycystic ovary syndrome?[J].World Journal of Pediatrics,2024,20(12):1205-1208.
2倪子涵,覃建琴,邓艳.大学生亲子关系、自我控制与亲社会行为的关系研究[J].社会科学前沿,2025,14(2):533-539.
3Yili Zhao.Enhancing assessment and intervention for empathy deficits:the“zipper model of empathy”approach in neurodevelopmental disorders[J].Psychoradiology,2024,4(1):14-17.
4Lihan He,Tianguang Meng.How Facial Expressions of Recipients Influence Online Prosocial Behaviors?-Evidence from Big Data Analysis on Tencent Gongyi Platform[J].Journal of Social Computing,2023,4(4):337-356.
5Shinichiro Matsuguma,Miku Suzuki,Miki Kanamaru,Hitomi Tsuchiya,Masato Kawamoto,Masaya Kobayashi.Redefining Snacking as a Piece of Daily Happiness:A Randomized Controlled Trial of Engagement in Oyatsu Activities for Improving Well-Being[J].International Journal of Mental Health Promotion,2024,26(12):967-975.
6Agar Marín-Morales,SofiaAmaoui,Carmen Fernández-Fillol,Gustavo Carlo,Sandra Rivas-García.What Factors Predict Prosocial Behavior during Social Crisis?A Cross-Sectional Study during the COVID-19 Pandemic in Spain[J].International Journal of Mental Health Promotion,2025,27(4):561-576.
7Lin Yiheng,Zhang Junfei,Wei Shan,Fan Huimei.Exploration of innovative cryptographic application solutions under multimodal big data fusion[J].Advances in Engineering Innovation,2024,10(5):30-35.
8Yongqiang Sun,Ruhao Zhao,Hao Qu,Juntao Li.The Effect of Interventional Nursing on Treatment Outcome,Negative Emotions and Quality of Life of Patients Undergoing Cardiovascular Interventions[J].Journal of Clinical and Nursing Research,2025,9(7):165-172.
9Zhenghuan Song,Qinyu Bao,Jiaqin Cai,Tingting Bao,Zhu Yu,Yihu Zhou,Mengling Huwang,Miao Zhou,Jing Tan.Ethical challenges in research faced by master’s students in anesthesiology[J].Progress in Medical Education,2025,1(1):49-54.
10Baozhou Lu,Tailai Xu,Weiguo Fan.How do emotions affect giving?Examining the effects of textual and facial emotions in charitable crowdfunding[J].Financial Innovation,2024,10(1):1223-1266.

Science China(Technological Sciences)

2025年第8期

浏览历史

内容加载中请稍等...

Prosocial behavior in Large Language Models:Value alignment and afective mechanisms

参考文献1

二级参考文献9

共引文献3

相关作者

相关机构

相关主题

浏览历史