Picture for Maxwell Lin

Maxwell Lin

Technical Design Review of Duke Robotics Club's Oogway: An AUV for RoboSub 2024

Add code
Oct 13, 2024
Viaarxiv icon

Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023

Add code
Oct 13, 2024
Figure 1 for Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023
Figure 2 for Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023
Figure 3 for Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023
Figure 4 for Oogway: Designing, Implementing, and Testing an AUV for RoboSub 2023
Viaarxiv icon

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Add code
Oct 11, 2024
Viaarxiv icon

Tamper-Resistant Safeguards for Open-Weight LLMs

Add code
Aug 01, 2024
Viaarxiv icon

Improving Alignment and Robustness with Circuit Breakers

Add code
Jun 10, 2024
Figure 1 for Improving Alignment and Robustness with Circuit Breakers
Figure 2 for Improving Alignment and Robustness with Circuit Breakers
Figure 3 for Improving Alignment and Robustness with Circuit Breakers
Figure 4 for Improving Alignment and Robustness with Circuit Breakers
Viaarxiv icon

Improving Alignment and Robustness with Short Circuiting

Add code
Jun 06, 2024
Figure 1 for Improving Alignment and Robustness with Short Circuiting
Figure 2 for Improving Alignment and Robustness with Short Circuiting
Figure 3 for Improving Alignment and Robustness with Short Circuiting
Figure 4 for Improving Alignment and Robustness with Short Circuiting
Viaarxiv icon

Teaching Large Language Models to Self-Debug

Add code
Apr 11, 2023
Viaarxiv icon