Picture for Xudong Han

Xudong Han

SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning

Add code
Feb 19, 2025
Viaarxiv icon

RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises

Add code
Feb 18, 2025
Viaarxiv icon

Enhancing Fetal Plane Classification Accuracy with Data Augmentation Using Diffusion Models

Add code
Jan 25, 2025
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Figure 1 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 2 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 3 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 4 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Viaarxiv icon

Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring

Add code
Oct 28, 2024
Figure 1 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 2 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 3 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Figure 4 for Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
Viaarxiv icon

ToolGen: Unified Tool Retrieval and Calling via Generation

Add code
Oct 04, 2024
Figure 1 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 2 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 3 for ToolGen: Unified Tool Retrieval and Calling via Generation
Figure 4 for ToolGen: Unified Tool Retrieval and Calling via Generation
Viaarxiv icon

Loki: An Open-Source Tool for Fact Verification

Add code
Oct 02, 2024
Figure 1 for Loki: An Open-Source Tool for Fact Verification
Figure 2 for Loki: An Open-Source Tool for Fact Verification
Figure 3 for Loki: An Open-Source Tool for Fact Verification
Figure 4 for Loki: An Open-Source Tool for Fact Verification
Viaarxiv icon

On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding

Add code
Jul 27, 2024
Figure 1 for On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding
Figure 2 for On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding
Figure 3 for On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding
Figure 4 for On Flange-based 3D Hand-Eye Calibration for Soft Robotic Tactile Welding
Viaarxiv icon

ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking

Add code
May 24, 2024
Figure 1 for ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Figure 2 for ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Figure 3 for ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Figure 4 for ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Viaarxiv icon

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

Add code
Mar 31, 2024
Figure 1 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 2 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 3 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Figure 4 for Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
Viaarxiv icon