Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiayu Lin

SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

Apr 14, 2025

Xinnong Zhang, Jiayu Lin, Xinyi Mou, Shiyue Yang, Xiawei Liu, Libo Sun, Hanjia Lyu, Yihang Yang, Weihong Qi, Yue Chen(+11 more)

Abstract:Social simulation is transforming traditional social science research by modeling human behavior through interactions between virtual individuals and their environments. With recent advances in large language models (LLMs), this approach has shown growing potential in capturing individual differences and predicting group behaviors. However, existing methods face alignment challenges related to the environment, target users, interaction mechanisms, and behavioral patterns. To this end, we introduce SocioVerse, an LLM-agent-driven world model for social simulation. Our framework features four powerful alignment components and a user pool of 10 million real individuals. To validate its effectiveness, we conducted large-scale simulation experiments across three distinct domains: politics, news, and economics. Results demonstrate that SocioVerse can reflect large-scale population dynamics while ensuring diversity, credibility, and representativeness through standardized procedures and minimal manual adjustments.

* work in progress

Via

Access Paper or Ask Questions

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Dec 04, 2024

Xinyi Mou, Xuanwen Ding, Qi He, Liang Wang, Jingcong Liang, Xinnong Zhang, Libo Sun, Jiayu Lin, Jie Zhou, Xuanjing Huang(+1 more)

Figure 1 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Figure 2 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Figure 3 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Figure 4 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Abstract:Traditional sociological research often relies on human participation, which, though effective, is expensive, challenging to scale, and with ethical concerns. Recent advancements in large language models (LLMs) highlight their potential to simulate human behavior, enabling the replication of individual responses and facilitating studies on many interdisciplinary studies. In this paper, we conduct a comprehensive survey of this field, illustrating the recent progress in simulation driven by LLM-empowered agents. We categorize the simulations into three types: (1) Individual Simulation, which mimics specific individuals or demographic groups; (2) Scenario Simulation, where multiple agents collaborate to achieve goals within specific contexts; and (3) Society Simulation, which models interactions within agent societies to reflect the complexity and variety of real-world dynamics. These simulations follow a progression, ranging from detailed individual modeling to large-scale societal phenomena. We provide a detailed discussion of each simulation type, including the architecture or key components of the simulation, the classification of objectives or scenarios and the evaluation method. Afterward, we summarize commonly used datasets and benchmarks. Finally, we discuss the trends across these three types of simulation. A repository for the related sources is at {\url{https://github.com/FudanDISC/SocialAgent}}.

Via

Access Paper or Ask Questions

ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Oct 28, 2024

Xinnong Zhang, Jiayu Lin, Libo Sun, Weihong Qi, Yihang Yang, Yue Chen, Hanjia Lyu, Xinyi Mou, Siming Chen, Jiebo Luo(+3 more)

Figure 1 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Figure 2 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Figure 3 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Figure 4 for ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents

Abstract:The massive population election simulation aims to model the preferences of specific groups in particular election scenarios. It has garnered significant attention for its potential to forecast real-world social trends. Traditional agent-based modeling (ABM) methods are constrained by their ability to incorporate complex individual background information and provide interactive prediction results. In this paper, we introduce ElectionSim, an innovative election simulation framework based on large language models, designed to support accurate voter simulations and customized distributions, together with an interactive platform to dialogue with simulated voters. We present a million-level voter pool sampled from social media platforms to support accurate individual simulation. We also introduce PPE, a poll-based presidential election benchmark to assess the performance of our framework under the U.S. presidential election scenario. Through extensive experiments and analyses, we demonstrate the effectiveness and robustness of our framework in U.S. presidential election simulations.

* 41 pages, 13 figures

Via

Access Paper or Ask Questions

AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Oct 25, 2024

Xinyi Mou, Jingcong Liang, Jiayu Lin, Xinnong Zhang, Xiawei Liu, Shiyue Yang, Rong Ye, Lei Chen, Haoyu Kuang, Xuanjing Huang(+1 more)

Figure 1 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Figure 2 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Figure 3 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Figure 4 for AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios

Abstract:Large language models (LLMs) are increasingly leveraged to empower autonomous agents to simulate human beings in various fields of behavioral research. However, evaluating their capacity to navigate complex social interactions remains a challenge. Previous studies face limitations due to insufficient scenario diversity, complexity, and a single-perspective focus. To this end, we introduce AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios. Drawing on Dramaturgical Theory, AgentSense employs a bottom-up approach to create 1,225 diverse social scenarios constructed from extensive scripts. We evaluate LLM-driven agents through multi-turn interactions, emphasizing both goal completion and implicit reasoning. We analyze goals using ERG theory and conduct comprehensive experiments. Our findings highlight that LLMs struggle with goals in complex social scenarios, especially high-level growth needs, and even GPT-4o requires improvement in private information reasoning.

Via

Access Paper or Ask Questions

A Comparative Study of Pre-training and Self-training

Sep 04, 2024

Yiheng Wang, Jiayu Lin, Zuoquan Lin

Figure 1 for A Comparative Study of Pre-training and Self-training

Figure 2 for A Comparative Study of Pre-training and Self-training

Figure 3 for A Comparative Study of Pre-training and Self-training

Figure 4 for A Comparative Study of Pre-training and Self-training

Abstract:Pre-training and self-training are two approaches to semi-supervised learning. The comparison between pre-training and self-training has been explored. However, the previous works led to confusing findings: self-training outperforms pre-training experienced on some tasks in computer vision, and contrarily, pre-training outperforms self-training experienced on some tasks in natural language processing, under certain conditions of incomparable settings. We propose, comparatively and exhaustively, an ensemble method to empirical study all feasible training paradigms combining pre-training, self-training, and fine-tuning within consistent foundational settings comparable to data augmentation. We conduct experiments on six datasets, four data augmentation, and imbalanced data for sentiment analysis and natural language inference tasks. Our findings confirm that the pre-training and fine-tuning paradigm yields the best overall performances. Moreover, self-training offers no additional benefits when combined with semi-supervised pre-training.

* 19 pages, 2 figures, 9 tables

Via

Access Paper or Ask Questions

Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Jul 24, 2024

Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao(+19 more)

Figure 1 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Figure 2 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Figure 3 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Figure 4 for Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

Abstract:In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct dataset and baseline model respectively. In total, 32 competing teams register for the challenge, from which we received 11 successful submissions. In this paper, we will present the results of the challenge and a summary of the systems, highlighting commonalities and innovations among participating systems. Datasets and baseline models of the AI-Debater 2023 Challenge have been already released and can be accessed through the official website of the challenge.

Via

Access Paper or Ask Questions

Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

Dec 21, 2023

Jiayu Lin, Rong Ye, Meng Han, Qi Zhang, Ruofei Lai, Xinyu Zhang, Zhao Cao, Xuanjing Huang, Zhongyu Wei

Figure 1 for Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

Figure 2 for Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

Figure 3 for Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

Figure 4 for Argue with Me Tersely: Towards Sentence-Level Counter-Argument Generation

Abstract:Counter-argument generation -- a captivating area in computational linguistics -- seeks to craft statements that offer opposing views. While most research has ventured into paragraph-level generation, sentence-level counter-argument generation beckons with its unique constraints and brevity-focused challenges. Furthermore, the diverse nature of counter-arguments poses challenges for evaluating model performance solely based on n-gram-based metrics. In this paper, we present the ArgTersely benchmark for sentence-level counter-argument generation, drawing from a manually annotated dataset from the ChangeMyView debate forum. We also propose Arg-LlaMA for generating high-quality counter-argument. For better evaluation, we trained a BERT-based evaluator Arg-Judge with human preference data. We conducted comparative experiments involving various baselines such as LlaMA, Alpaca, GPT-3, and others. The results show the competitiveness of our proposed framework and evaluator in counter-argument generation tasks. Code and data are available at https://github.com/amazingljy1206/ArgTersely.

* EMNLP2023, main conference

Via

Access Paper or Ask Questions

Query-Efficient Adversarial Attack Based on Latin Hypercube Sampling

Jul 05, 2022

Dan Wang, Jiayu Lin, Yuan-Gen Wang

Figure 1 for Query-Efficient Adversarial Attack Based on Latin Hypercube Sampling

Figure 2 for Query-Efficient Adversarial Attack Based on Latin Hypercube Sampling

Figure 3 for Query-Efficient Adversarial Attack Based on Latin Hypercube Sampling

Figure 4 for Query-Efficient Adversarial Attack Based on Latin Hypercube Sampling

Abstract:In order to be applicable in real-world scenario, Boundary Attacks (BAs) were proposed and ensured one hundred percent attack success rate with only decision information. However, existing BA methods craft adversarial examples by leveraging a simple random sampling (SRS) to estimate the gradient, consuming a large number of model queries. To overcome the drawback of SRS, this paper proposes a Latin Hypercube Sampling based Boundary Attack (LHS-BA) to save query budget. Compared with SRS, LHS has better uniformity under the same limited number of random samples. Therefore, the average on these random samples is closer to the true gradient than that estimated by SRS. Various experiments are conducted on benchmark datasets including MNIST, CIFAR, and ImageNet-1K. Experimental results demonstrate the superiority of the proposed LHS-BA over the state-of-the-art BA methods in terms of query efficiency. The source codes are publicly available at https://github.com/GZHU-DVL/LHS-BA.

Via

Access Paper or Ask Questions

Multi-feature Co-learning for Image Inpainting

May 21, 2022

Jiayu Lin, Yuan-Gen Wang, Wenzhi Tang, Aifeng Li

Figure 1 for Multi-feature Co-learning for Image Inpainting

Figure 2 for Multi-feature Co-learning for Image Inpainting

Figure 3 for Multi-feature Co-learning for Image Inpainting

Figure 4 for Multi-feature Co-learning for Image Inpainting

Abstract:Image inpainting has achieved great advances by simultaneously leveraging image structure and texture features. However, due to lack of effective multi-feature fusion techniques, existing image inpainting methods still show limited improvement. In this paper, we design a deep multi-feature co-learning network for image inpainting, which includes Soft-gating Dual Feature Fusion (SDFF) and Bilateral Propagation Feature Aggregation (BPFA) modules. To be specific, we first use two branches to learn structure features and texture features separately. Then the proposed SDFF module integrates structure features into texture features, and meanwhile uses texture features as an auxiliary in generating structure features. Such a co-learning strategy makes the structure and texture features more consistent. Next, the proposed BPFA module enhances the connection from local feature to overall consistency by co-learning contextual attention, channel-wise information and feature space, which can further refine the generated structures and textures. Finally, extensive experiments are performed on benchmark datasets, including CelebA, Places2, and Paris StreetView. Experimental results demonstrate the superiority of the proposed method over the state-of-the-art. The source codes are available at https://github.com/GZHU-DVL/MFCL-Inpainting.

Via

Access Paper or Ask Questions