Picture for Xuyang Ge

Xuyang Ge

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Viaarxiv icon

Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures

Add code
Oct 10, 2024
Figure 1 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 2 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 3 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Figure 4 for Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
Viaarxiv icon

TravelAgent: An AI Assistant for Personalized Travel Planning

Add code
Sep 12, 2024
Figure 1 for TravelAgent: An AI Assistant for Personalized Travel Planning
Figure 2 for TravelAgent: An AI Assistant for Personalized Travel Planning
Figure 3 for TravelAgent: An AI Assistant for Personalized Travel Planning
Figure 4 for TravelAgent: An AI Assistant for Personalized Travel Planning
Viaarxiv icon

Automatically Identifying Local and Global Circuits with Linear Computation Graphs

Add code
May 22, 2024
Viaarxiv icon

SurveyAgent: A Conversational System for Personalized and Efficient Research Survey

Add code
Apr 09, 2024
Viaarxiv icon

Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT

Add code
Feb 19, 2024
Viaarxiv icon

Distilling Script Knowledge from Large Language Models for Constrained Language Planning

Add code
May 22, 2023
Viaarxiv icon

Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies after Structure Abduction

Add code
May 22, 2023
Viaarxiv icon