Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Le Li

School of Mathematics and Statistics, and Key Lab NAA--MOE, Central China Normal University

Digital Player: Evaluating Large Language Models based Human-like Agent in Games

Feb 28, 2025

Jiawei Wang, Kai Wang, Shaojie Lin, Runze Wu, Bihan Xu, Lingeng Jiang, Shiwei Zhao, Renyu Zhu, Haoyu Liu, Zhipeng Hu(+4 more)

Abstract:With the rapid advancement of Large Language Models (LLMs), LLM-based autonomous agents have shown the potential to function as digital employees, such as digital analysts, teachers, and programmers. In this paper, we develop an application-level testbed based on the open-source strategy game "Unciv", which has millions of active players, to enable researchers to build a "data flywheel" for studying human-like agents in the "digital players" task. This "Civilization"-like game features expansive decision-making spaces along with rich linguistic interactions such as diplomatic negotiations and acts of deception, posing significant challenges for LLM-based agents in terms of numerical reasoning and long-term planning. Another challenge for "digital players" is to generate human-like responses for social interaction, collaboration, and negotiation with human players. The open-source project can be found at https:/github.com/fuxiAIlab/CivAgent.

* neurips datasets and benchmarks 2024, not accepted

Via

Access Paper or Ask Questions

Batch-in-Batch: a new adversarial training framework for initial perturbation and sample selection

Jun 06, 2024

Yinting Wu, Pai Peng, Bo Cai, Le Li, .

Abstract:Adversarial training methods commonly generate independent initial perturbation for adversarial samples from a simple uniform distribution, and obtain the training batch for the classifier without selection. In this work, we propose a simple yet effective training framework called Batch-in-Batch (BB) to enhance models robustness. It involves specifically a joint construction of initial values that could simultaneously generates $m$ sets of perturbations from the original batch set to provide more diversity for adversarial samples; and also includes various sample selection strategies that enable the trained models to have smoother losses and avoid overconfident outputs. Through extensive experiments on three benchmark datasets (CIFAR-10, SVHN, CIFAR-100) with two networks (PreActResNet18 and WideResNet28-10) that are used in both the single-step (Noise-Fast Gradient Sign Method, N-FGSM) and multi-step (Projected Gradient Descent, PGD-10) adversarial training, we show that models trained within the BB framework consistently have higher adversarial accuracy across various adversarial settings, notably achieving over a 13% improvement on the SVHN dataset with an attack radius of 8/255 compared to the N-FGSM baseline model. Furthermore, experimental analysis of the efficiency of both the proposed initial perturbation method and sample selection strategies validates our insights. Finally, we show that our framework is cost-effective in terms of computational resources, even with a relatively large value of $m$.

* 29 pages, 11 figures

Via

Access Paper or Ask Questions

Reprint: a randomized extrapolation based on principal components for data augmentation

Apr 26, 2022

Jiale Wei, Qiyuan Chen, Pai Peng, Benjamin Guedj, Le Li

Figure 1 for Reprint: a randomized extrapolation based on principal components for data augmentation

Figure 2 for Reprint: a randomized extrapolation based on principal components for data augmentation

Figure 3 for Reprint: a randomized extrapolation based on principal components for data augmentation

Figure 4 for Reprint: a randomized extrapolation based on principal components for data augmentation

Abstract:Data scarcity and data imbalance have attracted a lot of attention in many fields. Data augmentation, explored as an effective approach to tackle them, can improve the robustness and efficiency of classification models by generating new samples. This paper presents REPRINT, a simple and effective hidden-space data augmentation method for imbalanced data classification. Given hidden-space representations of samples in each class, REPRINT extrapolates, in a randomized fashion, augmented examples for target class by using subspaces spanned by principal components to summarize distribution structure of both source and target class. Consequently, the examples generated would diversify the target while maintaining the original geometry of target distribution. Besides, this method involves a label refinement component which allows to synthesize new soft labels for augmented examples. Compared with different NLP data augmentation approaches under a range of data imbalanced scenarios on four text classification benchmark, REPRINT shows prominent improvements. Moreover, through comprehensive ablation studies, we show that label refinement is better than label-preserving for augmented examples, and that our method suggests stable and consistent improvements in terms of suitable choices of principal components. Moreover, REPRINT is appealing for its easy-to-use since it contains only one hyperparameter determining the dimension of subspace and requires low computational resource.

Via

Access Paper or Ask Questions

Youling: an AI-Assisted Lyrics Creation System

Jan 18, 2022

Rongsheng Zhang, Xiaoxi Mao, Le Li, Lin Jiang, Lin Chen, Zhiwei Hu, Yadong Xi, Changjie Fan, Minlie Huang

Figure 1 for Youling: an AI-Assisted Lyrics Creation System

Figure 2 for Youling: an AI-Assisted Lyrics Creation System

Figure 3 for Youling: an AI-Assisted Lyrics Creation System

Figure 4 for Youling: an AI-Assisted Lyrics Creation System

Abstract:Recently, a variety of neural models have been proposed for lyrics generation. However, most previous work completes the generation process in a single pass with little human intervention. We believe that lyrics creation is a creative process with human intelligence centered. AI should play a role as an assistant in the lyrics creation process, where human interactions are crucial for high-quality creation. This paper demonstrates \textit{Youling}, an AI-assisted lyrics creation system, designed to collaborate with music creators. In the lyrics generation process, \textit{Youling} supports traditional one pass full-text generation mode as well as an interactive generation mode, which allows users to select the satisfactory sentences from generated candidates conditioned on preceding context. The system also provides a revision module which enables users to revise undesired sentences or words of lyrics repeatedly. Besides, \textit{Youling} allows users to use multifaceted attributes to control the content and format of generated lyrics. The demo video of the system is available at https://youtu.be/DFeNpHk0pm4.

* accept by emnlp2020 demo track

Via

Access Paper or Ask Questions

Joint Face Completion and Super-resolution using Multi-scale Feature Relation Learning

Feb 29, 2020

Zhilei Liu, Yunpeng Wu, Le Li, Cuicui Zhang, Baoyuan Wu

Figure 1 for Joint Face Completion and Super-resolution using Multi-scale Feature Relation Learning

Figure 2 for Joint Face Completion and Super-resolution using Multi-scale Feature Relation Learning

Figure 3 for Joint Face Completion and Super-resolution using Multi-scale Feature Relation Learning

Figure 4 for Joint Face Completion and Super-resolution using Multi-scale Feature Relation Learning

Abstract:Previous research on face restoration often focused on repairing a specific type of low-quality facial images such as low-resolution (LR) or occluded facial images. However, in the real world, both the above-mentioned forms of image degradation often coexist. Therefore, it is important to design a model that can repair LR occluded images simultaneously. This paper proposes a multi-scale feature graph generative adversarial network (MFG-GAN) to implement the face restoration of images in which both degradation modes coexist, and also to repair images with a single type of degradation. Based on the GAN, the MFG-GAN integrates the graph convolution and feature pyramid network to restore occluded low-resolution face images to non-occluded high-resolution face images. The MFG-GAN uses a set of customized losses to ensure that high-quality images are generated. In addition, we designed the network in an end-to-end format. Experimental results on the public-domain CelebA and Helen databases show that the proposed approach outperforms state-of-the-art methods in performing face super-resolution (up to 4x or 8x) and face completion simultaneously. Cross-database testing also revealed that the proposed approach has good generalizability.

Via

Access Paper or Ask Questions

Controllable Descendant Face Synthesis

Feb 26, 2020

Yong Zhang, Le Li, Zhilei Liu, Baoyuan Wu, Yanbo Fan, Zhifeng Li

Figure 1 for Controllable Descendant Face Synthesis

Figure 2 for Controllable Descendant Face Synthesis

Figure 3 for Controllable Descendant Face Synthesis

Figure 4 for Controllable Descendant Face Synthesis

Abstract:Kinship face synthesis is an interesting topic raised to answer questions like "what will your future children look like?". Published approaches to this topic are limited. Most of the existing methods train models for one-versus-one kin relation, which only consider one parent face and one child face by directly using an auto-encoder without any explicit control over the resemblance of the synthesized face to the parent face. In this paper, we propose a novel method for controllable descendant face synthesis, which models two-versus-one kin relation between two parent faces and one child face. Our model consists of an inheritance module and an attribute enhancement module, where the former is designed for accurate control over the resemblance between the synthesized face and parent faces, and the latter is designed for control over age and gender. As there is no large scale database with father-mother-child kinship annotation, we propose an effective strategy to train the model without using the ground truth descendant faces. No carefully designed image pairs are required for learning except only age and gender labels of training faces. We conduct comprehensive experimental evaluations on three public benchmark databases, which demonstrates encouraging results.

* 10 pages, 9 figures

Via

Access Paper or Ask Questions

Facial Expression Restoration Based on Improved Graph Convolutional Networks

Oct 23, 2019

Zhilei Liu, Le Li, Yunpeng Wu, Cuicui Zhang

Figure 1 for Facial Expression Restoration Based on Improved Graph Convolutional Networks

Figure 2 for Facial Expression Restoration Based on Improved Graph Convolutional Networks

Figure 3 for Facial Expression Restoration Based on Improved Graph Convolutional Networks

Figure 4 for Facial Expression Restoration Based on Improved Graph Convolutional Networks

Abstract:Facial expression analysis in the wild is challenging when the facial image is with low resolution or partial occlusion. Considering the correlations among different facial local regions under different facial expressions, this paper proposes a novel facial expression restoration method based on generative adversarial network by integrating an improved graph convolutional network (IGCN) and region relation modeling block (RRMB). Unlike conventional graph convolutional networks taking vectors as input features, IGCN can use tensors of face patches as inputs. It is better to retain the structure information of face patches. The proposed RRMB is designed to address facial generative tasks including inpainting and super-resolution with facial action units detection, which aims to restore facial expression as the ground-truth. Extensive experiments conducted on BP4D and DISFA benchmarks demonstrate the effectiveness of our proposed method through quantitative and qualitative evaluations.

* Accepted by MMM2020

Via

Access Paper or Ask Questions

A Quasi-Bayesian Perspective to Online Clustering

May 25, 2018

Le Li, Benjamin Guedj, Sébastien Loustau

Figure 1 for A Quasi-Bayesian Perspective to Online Clustering

Figure 2 for A Quasi-Bayesian Perspective to Online Clustering

Figure 3 for A Quasi-Bayesian Perspective to Online Clustering

Figure 4 for A Quasi-Bayesian Perspective to Online Clustering

Abstract:When faced with high frequency streams of data, clustering raises theoretical and algorithmic pitfalls. We introduce a new and adaptive online clustering algorithm relying on a quasi-Bayesian approach, with a dynamic (i.e., time-dependent) estimation of the (unknown and changing) number of clusters. We prove that our approach is supported by minimax regret bounds. We also provide an RJMCMC-flavored implementation (called PACBO, see https://cran.r-project.org/web/packages/PACBO/index.html) for which we give a convergence guarantee. Finally, numerical experiments illustrate the potential of our procedure.

* Electronic Journal of Statistics (2018), vol. 12(2), 3071--3113

Via

Access Paper or Ask Questions

Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

May 18, 2018

Benjamin Guedj, Le Li

Figure 1 for Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

Figure 2 for Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

Figure 3 for Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

Figure 4 for Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

Abstract:When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls. Principal curves act as a nonlinear generalization of PCA and the present paper proposes a novel algorithm to automatically and sequentially learn principal curves from data streams. We show that our procedure is supported by regret bounds with optimal sublinear remainder terms. A greedy local search implementation that incorporates both sleeping experts and multi-armed bandit ingredients is presented, along with its regret bound and performance on a toy example and seismic data.

Via

Access Paper or Ask Questions

Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

May 09, 2014

Le Li, Jianjun Yang, Kaili Zhao, Yang Xu, Honggang Zhang, Zhuoyi Fan

Figure 1 for Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Figure 2 for Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Figure 3 for Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Figure 4 for Graph Regularized Non-negative Matrix Factorization By Maximizing Correntropy

Abstract:Non-negative matrix factorization (NMF) has proved effective in many clustering and classification tasks. The classic ways to measure the errors between the original and the reconstructed matrix are $l_2$ distance or Kullback-Leibler (KL) divergence. However, nonlinear cases are not properly handled when we use these error measures. As a consequence, alternative measures based on nonlinear kernels, such as correntropy, are proposed. However, the current correntropy-based NMF only targets on the low-level features without considering the intrinsic geometrical distribution of data. In this paper, we propose a new NMF algorithm that preserves local invariance by adding graph regularization into the process of max-correntropy-based matrix factorization. Meanwhile, each feature can learn corresponding kernel from the data. The experiment results of Caltech101 and Caltech256 show the benefits of such combination against other NMF algorithms for the unsupervised image clustering.

Via

Access Paper or Ask Questions