Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lewis D Griffin

Transcript of GPT-4 playing a rogue AGI in a Matrix Game

May 16, 2024

Lewis D Griffin, Nicholas Riggs

Abstract:Matrix Games are a type of unconstrained wargame used by planners to explore scenarios. Players propose actions, and give arguments and counterarguments for their success. An umpire, assisted by dice rolls modified according to the offered arguments, adjudicates the outcome of each action. A recent online play of the Matrix Game QuAI Sera Sera had six players, representing social, national and economic powers, and one player representing ADA, a recently escaped AGI. Unknown to the six human players, ADA was played by OpenAI's GPT-4 with a human operator serving as bidirectional interface between it and the game. GPT-4 demonstrated confident and competent game play; initiating and responding to private communications with other players and choosing interesting actions well supported by argument. We reproduce the transcript of the interaction with GPT-4 as it is briefed, plays, and debriefed.

* 18 pages, 0 figures

Via

Access Paper or Ask Questions

Susceptibility to Influence of Large Language Models

Mar 10, 2023

Lewis D Griffin, Bennett Kleinberg, Maximilian Mozes, Kimberly T Mai, Maria Vau, Matthew Caldwell, Augustine Marvor-Parker

Abstract:Two studies tested the hypothesis that a Large Language Model (LLM) can be used to model psychological change following exposure to influential input. The first study tested a generic mode of influence - the Illusory Truth Effect (ITE) - where earlier exposure to a statement (through, for example, rating its interest) boosts a later truthfulness test rating. Data was collected from 1000 human participants using an online experiment, and 1000 simulated participants using engineered prompts and LLM completion. 64 ratings per participant were collected, using all exposure-test combinations of the attributes: truth, interest, sentiment and importance. The results for human participants reconfirmed the ITE, and demonstrated an absence of effect for attributes other than truth, and when the same attribute is used for exposure and test. The same pattern of effects was found for LLM-simulated participants. The second study concerns a specific mode of influence - populist framing of news to increase its persuasion and political mobilization. Data from LLM-simulated participants was collected and compared to previously published data from a 15-country experiment on 7286 human participants. Several effects previously demonstrated from the human study were replicated by the simulated study, including effects that surprised the authors of the human study by contradicting their theoretical expectations (anti-immigrant framing of news decreases its persuasion and mobilization); but some significant relationships found in human data (modulation of the effectiveness of populist framing according to relative deprivation of the participant) were not present in the LLM data. Together the two studies support the view that LLMs have potential to act as models of the effect of influence.

* 24 pages, 6 figures, 7 tables, 53 references

Via

Access Paper or Ask Questions

A New Angle on L2 Regularization

Jun 28, 2018

Thomas Tanay, Lewis D Griffin

Abstract:Imagine two high-dimensional clusters and a hyperplane separating them. Consider in particular the angle between: the direction joining the two clusters' centroids and the normal to the hyperplane. In linear classification, this angle depends on the level of L2 regularization used. Can you explain why?

* An explorable explanation on the phenomenon of adversarial examples in linear classification and its relation to L2 regularization. Interactive version available at: https://thomas-tanay.github.io/post--L2-regularization/

Via

Access Paper or Ask Questions

Better Image Segmentation by Exploiting Dense Semantic Predictions

Jun 05, 2016

Qiyang Zhao, Lewis D Griffin

Figure 1 for Better Image Segmentation by Exploiting Dense Semantic Predictions

Figure 2 for Better Image Segmentation by Exploiting Dense Semantic Predictions

Figure 3 for Better Image Segmentation by Exploiting Dense Semantic Predictions

Figure 4 for Better Image Segmentation by Exploiting Dense Semantic Predictions

Abstract:It is well accepted that image segmentation can benefit from utilizing multilevel cues. The paper focuses on utilizing the FCNN-based dense semantic predictions in the bottom-up image segmentation, arguing to take semantic cues into account from the very beginning. By this we can avoid merging regions of similar appearance but distinct semantic categories as possible. The semantic inefficiency problem is handled. We also propose a straightforward way to use the contour cues to suppress the noise in multilevel cues, thus to improve the segmentation robustness. The evaluation on the BSDS500 shows that we obtain the competitive region and boundary performance. Furthermore, since all individual regions can be assigned with appropriate semantic labels during the computation, we are capable of extracting the adjusted semantic segmentations. The experiment on Pascal VOC 2012 shows our improvement to the original semantic segmentations which derives directly from the dense predictions.

Via

Access Paper or Ask Questions

Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions

Mar 16, 2016

Qiyang Zhao, Lewis D Griffin

Figure 1 for Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions

Figure 2 for Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions

Figure 3 for Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions

Figure 4 for Suppressing the Unusual: towards Robust CNNs using Symmetric Activation Functions

Abstract:Many deep Convolutional Neural Networks (CNN) make incorrect predictions on adversarial samples obtained by imperceptible perturbations of clean samples. We hypothesize that this is caused by a failure to suppress unusual signals within network layers. As remedy we propose the use of Symmetric Activation Functions (SAF) in non-linear signal transducer units. These units suppress signals of exceptional magnitude. We prove that SAF networks can perform classification tasks to arbitrary precision in a simplified situation. In practice, rather than use SAFs alone, we add them into CNNs to improve their robustness. The modified CNNs can be easily trained using popular strategies with the moderate training load. Our experiments on MNIST and CIFAR-10 show that the modified CNNs perform similarly to plain ones on clean samples, and are remarkably more robust against adversarial and nonsense samples.

* 11 pages, 12 figures

Via

Access Paper or Ask Questions