Picture for Charles O'Neill

Charles O'Neill

UniverseTBD

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

Add code
Nov 20, 2024
Viaarxiv icon

Disentangling Dense Embeddings with Sparse Autoencoders

Add code
Aug 05, 2024
Figure 1 for Disentangling Dense Embeddings with Sparse Autoencoders
Figure 2 for Disentangling Dense Embeddings with Sparse Autoencoders
Figure 3 for Disentangling Dense Embeddings with Sparse Autoencoders
Figure 4 for Disentangling Dense Embeddings with Sparse Autoencoders
Viaarxiv icon

pathfinder: A Semantic Framework for Literature Review and Knowledge Discovery in Astronomy

Add code
Aug 02, 2024
Viaarxiv icon

Designing an Evaluation Framework for Large Language Models in Astronomy Research

Add code
May 30, 2024
Viaarxiv icon

Sparse Autoencoders Enable Scalable and Reliable Circuit Identification in Language Models

Add code
May 21, 2024
Viaarxiv icon

Measuring Sharpness in Grokking

Add code
Feb 14, 2024
Viaarxiv icon

Grokking Beyond Neural Networks: An Empirical Exploration with Model Complexity

Add code
Oct 26, 2023
Viaarxiv icon

Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content

Add code
Aug 26, 2023
Viaarxiv icon

Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation

Add code
Aug 17, 2023
Viaarxiv icon

Rice paddy disease classifications using CNNs

Add code
Mar 15, 2023
Viaarxiv icon