Picture for Tsui-Wei Weng

Tsui-Wei Weng

Concept Bottleneck Large Language Models

Add code
Dec 11, 2024
Viaarxiv icon

Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification

Add code
Nov 01, 2024
Figure 1 for Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Figure 2 for Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Figure 3 for Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Figure 4 for Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification
Viaarxiv icon

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Add code
Oct 24, 2024
Figure 1 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 2 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 3 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 4 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Viaarxiv icon

VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance

Add code
Jul 18, 2024
Figure 1 for VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Figure 2 for VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Figure 3 for VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Figure 4 for VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance
Viaarxiv icon

Crafting Large Language Models for Enhanced Interpretability

Add code
Jul 05, 2024
Viaarxiv icon

AND: Audio Network Dissection for Interpreting Deep Acoustic Models

Add code
Jun 26, 2024
Viaarxiv icon

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Add code
Jun 26, 2024
Figure 1 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 2 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 3 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 4 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Viaarxiv icon

Linear Explanations for Individual Neurons

Add code
May 10, 2024
Viaarxiv icon

Provably Robust Conformal Prediction with Improved Efficiency

Add code
Apr 30, 2024
Figure 1 for Provably Robust Conformal Prediction with Improved Efficiency
Figure 2 for Provably Robust Conformal Prediction with Improved Efficiency
Figure 3 for Provably Robust Conformal Prediction with Improved Efficiency
Figure 4 for Provably Robust Conformal Prediction with Improved Efficiency
Viaarxiv icon

Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models

Add code
Mar 20, 2024
Figure 1 for Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models
Figure 2 for Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models
Figure 3 for Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models
Figure 4 for Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models
Viaarxiv icon