Picture for Zhen Xiang

Zhen Xiang

Data Free Backdoor Attacks

Add code
Dec 09, 2024
Viaarxiv icon

Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review

Add code
Dec 02, 2024
Figure 1 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 2 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 3 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Figure 4 for Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review
Viaarxiv icon

Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios

Add code
Nov 16, 2024
Figure 1 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 2 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 3 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Figure 4 for Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Add code
Jul 17, 2024
Viaarxiv icon

GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning

Add code
Jun 13, 2024
Viaarxiv icon

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Add code
Feb 22, 2024
Viaarxiv icon

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Add code
Jan 20, 2024
Viaarxiv icon

CBD: A Certified Backdoor Detector Based on Local Dominant Probability

Add code
Oct 26, 2023
Viaarxiv icon

Backdoor Mitigation by Correcting the Distribution of Neural Activations

Add code
Aug 18, 2023
Viaarxiv icon