Picture for Xudong Han

Xudong Han

Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring

Add code
Oct 28, 2024
Figure 1 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 2 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 3 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 4 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Viaarxiv icon

ToolGen: Unified Tool Retrieval and Calling via Generation

Add code
Oct 04, 2024
Figure 1 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 2 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 3 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 4 for ToolGen: Unified Tool Retrieval and Calling via Generation
Viaarxiv icon

Loki: An Open-Source Tool for Fact Verification

Add code
Oct 02, 2024
Figure 1 for Loki: An Open-Source Tool for Fact Verification
Figure 2 for Loki: An Open-Source Tool for Fact Verification
Figure 3 for Loki: An Open-Source Tool for Fact Verification
Figure 4 for Loki: An Open-Source Tool for Fact Verification
Viaarxiv icon

On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding

Add code
Jul 27, 2024
Viaarxiv icon

ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking

Add code
May 24, 2024
Viaarxiv icon

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

Add code
Mar 31, 2024
Figure 1 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 2 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 3 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 4 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Viaarxiv icon

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Add code
Feb 19, 2024
Viaarxiv icon

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Add code
Feb 18, 2024
Viaarxiv icon

Understanding the Instruction Mixture for Large Language Model Fine-tuning

Add code
Dec 19, 2023
Viaarxiv icon

Proprioceptive State Estimation for Amphibious Tactile Sensing

Add code
Dec 15, 2023
Viaarxiv icon