Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthew Choi

Large Language Models on Lexical Semantic Change Detection: An Evaluation

Dec 10, 2023

Ruiyu Wang, Matthew Choi

Figure 1 for Large Language Models on Lexical Semantic Change Detection: An Evaluation

Figure 2 for Large Language Models on Lexical Semantic Change Detection: An Evaluation

Figure 3 for Large Language Models on Lexical Semantic Change Detection: An Evaluation

Figure 4 for Large Language Models on Lexical Semantic Change Detection: An Evaluation

Abstract:Lexical Semantic Change Detection stands out as one of the few areas where Large Language Models (LLMs) have not been extensively involved. Traditional methods like PPMI, and SGNS remain prevalent in research, alongside newer BERT-based approaches. Despite the comprehensive coverage of various natural language processing domains by LLMs, there is a notable scarcity of literature concerning their application in this specific realm. In this work, we seek to bridge this gap by introducing LLMs into the domain of Lexical Semantic Change Detection. Our work presents novel prompting solutions and a comprehensive evaluation that spans all three generations of language models, contributing to the exploration of LLMs in this research area.

Via

Access Paper or Ask Questions

FlexModel: A Framework for Interpretability of Distributed Large Language Models

Dec 05, 2023

Matthew Choi, Muhammad Adil Asif, John Willes, David Emerson

Figure 1 for FlexModel: A Framework for Interpretability of Distributed Large Language Models

Figure 2 for FlexModel: A Framework for Interpretability of Distributed Large Language Models

Figure 3 for FlexModel: A Framework for Interpretability of Distributed Large Language Models

Figure 4 for FlexModel: A Framework for Interpretability of Distributed Large Language Models

Abstract:With the growth of large language models, now incorporating billions of parameters, the hardware prerequisites for their training and deployment have seen a corresponding increase. Although existing tools facilitate model parallelization and distributed training, deeper model interactions, crucial for interpretability and responsible AI techniques, still demand thorough knowledge of distributed computing. This often hinders contributions from researchers with machine learning expertise but limited distributed computing background. Addressing this challenge, we present FlexModel, a software package providing a streamlined interface for engaging with models distributed across multi-GPU and multi-node configurations. The library is compatible with existing model distribution libraries and encapsulates PyTorch models. It exposes user-registerable HookFunctions to facilitate straightforward interaction with distributed model internals, bridging the gap between distributed and single-device model paradigms. Primarily, FlexModel enhances accessibility by democratizing model interactions and promotes more inclusive research in the domain of large-scale neural networks. The package is found at https://github.com/VectorInstitute/flex_model.

* 14 pages, 8 figures. To appear at the Socially Responsible Language Modelling Research (SoLaR) Workshop, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Via

Access Paper or Ask Questions

Learning quantum dynamics with latent neural ODEs

Oct 20, 2021

Matthew Choi, Daniel Flam-Shepherd, Thi Ha Kyaw, Alán Aspuru-Guzik

Figure 1 for Learning quantum dynamics with latent neural ODEs

Figure 2 for Learning quantum dynamics with latent neural ODEs

Figure 3 for Learning quantum dynamics with latent neural ODEs

Abstract:The core objective of machine-assisted scientific discovery is to learn physical laws from experimental data without prior knowledge of the systems in question. In the area of quantum physics, making progress towards these goals is significantly more challenging due to the curse of dimensionality as well as the counter-intuitive nature of quantum mechanics. Here, we present the QNODE, a latent neural ODE trained on dynamics from closed and open quantum systems. The QNODE can learn to generate quantum dynamics and extrapolate outside of its training region that satisfy the von Neumann and time-local Lindblad master equations for closed and open quantum systems. Furthermore the QNODE rediscovers quantum mechanical laws such as Heisenberg's uncertainty principle in a totally data-driven way, without constraints or guidance. Additionally, we show that trajectories that are generated from the QNODE and are close in its latent space have similar quantum dynamics while preserving the physics of the training system.

Via

Access Paper or Ask Questions