Picture for Yan Sun

Yan Sun

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon

TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs

Add code
Jan 31, 2025
Figure 1 for TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs
Figure 2 for TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs
Figure 3 for TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs
Figure 4 for TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs
Viaarxiv icon

Trustworthy Evaluation of Generative AI Models

Add code
Jan 31, 2025
Viaarxiv icon

MolGraph-xLSTM: A graph-based dual-level xLSTM framework with multi-head mixture-of-experts for enhanced molecular representation and interpretability

Add code
Jan 30, 2025
Viaarxiv icon

Unsupervised Domain Adaptation with Dynamic Clustering and Contrastive Refinement for Gait Recognition

Add code
Jan 28, 2025
Viaarxiv icon

FGATT: A Robust Framework for Wireless Data Imputation Using Fuzzy Graph Attention Networks and Transformer Encoders

Add code
Dec 02, 2024
Viaarxiv icon

Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective

Add code
Nov 22, 2024
Figure 1 for Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective
Figure 2 for Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective
Figure 3 for Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective
Figure 4 for Comparative Analysis of Pooling Mechanisms in LLMs: A Sentiment Analysis Perspective
Viaarxiv icon

A Unified Analysis for Finite Weight Averaging

Add code
Nov 20, 2024
Figure 1 for A Unified Analysis for Finite Weight Averaging
Figure 2 for A Unified Analysis for Finite Weight Averaging
Figure 3 for A Unified Analysis for Finite Weight Averaging
Figure 4 for A Unified Analysis for Finite Weight Averaging
Viaarxiv icon

Stability and Generalization for Distributed SGDA

Add code
Nov 14, 2024
Figure 1 for Stability and Generalization for Distributed SGDA
Figure 2 for Stability and Generalization for Distributed SGDA
Figure 3 for Stability and Generalization for Distributed SGDA
Figure 4 for Stability and Generalization for Distributed SGDA
Viaarxiv icon

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior

Add code
Nov 01, 2024
Viaarxiv icon