Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vishwas Sathish

LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Apr 14, 2024

Vishwas Sathish, Hannah Lin, Aditya K Kamath, Anish Nyayachavadi

Figure 1 for LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Figure 2 for LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Figure 3 for LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Figure 4 for LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

Abstract:Large Language Models (LLMs) are a powerful technology that augment human skill to create new opportunities, akin to the development of steam engines and the internet. However, LLMs come with a high cost. They require significant computing resources and energy to train and serve. Inequity in their control and access has led to concentration of ownership and power to a small collection of corporations. In our study, we collect training and inference requirements for various LLMs. We then analyze the economic strengths of nations and organizations in the context of developing and serving these models. Additionally, we also look at whether individuals around the world can access and use this emerging technology. We compare and contrast these groups to show that these technologies are monopolized by a surprisingly few entities. We conclude with a qualitative study on the ethical implications of our findings and discuss future directions towards equity in LLM access.

* 11 total pages, 7 page text, 4 page references, 3 figures (with subfigures), 1 table

Via

Access Paper or Ask Questions

Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

Oct 23, 2022

Rajesh P. N. Rao, Dimitrios C. Gklezakos, Vishwas Sathish

Figure 1 for Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

Figure 2 for Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

Figure 3 for Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

Figure 4 for Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

Abstract:Predictive coding has emerged as a prominent model of how the brain learns through predictions, anticipating the importance accorded to predictive learning in recent AI architectures such as transformers. Here we propose a new framework for predictive coding called active predictive coding which can learn hierarchical world models and solve two radically different open problems in AI: (1) how do we learn compositional representations, e.g., part-whole hierarchies, for equivariant vision? and (2) how do we solve large-scale planning problems, which are hard for traditional reinforcement learning, by composing complex action sequences from primitive policies? Our approach exploits hypernetworks, self-supervised learning and reinforcement learning to learn hierarchical world models that combine task-invariant state transition networks and task-dependent policy networks at multiple abstraction levels. We demonstrate the viability of our approach on a variety of vision datasets (MNIST, FashionMNIST, Omniglot) as well as on a scalable hierarchical planning problem. Our results represent, to our knowledge, the first demonstration of a unified solution to the part-whole learning problem posed by Hinton, the nested reference frames problem posed by Hawkins, and the integrated state-action hierarchy learning problem in reinforcement learning.

* 15 pages, 10 figures, 2 supplementary figures

Via

Access Paper or Ask Questions