Picture for Xianjun Yang

Xianjun Yang

CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy

Add code
Oct 17, 2024
Figure 1 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 2 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 3 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Figure 4 for CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy
Viaarxiv icon

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon

MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate

Add code
Jun 26, 2024
Viaarxiv icon

Improving Logits-based Detector without Logits from Black-box LLMs

Add code
Jun 11, 2024
Viaarxiv icon

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Add code
May 30, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Viaarxiv icon

Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning

Add code
Apr 16, 2024
Viaarxiv icon

A Safe Harbor for AI Evaluation and Red Teaming

Add code
Mar 07, 2024
Figure 1 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 2 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 3 for A Safe Harbor for AI Evaluation and Red Teaming
Figure 4 for A Safe Harbor for AI Evaluation and Red Teaming
Viaarxiv icon

A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets

Add code
Feb 22, 2024
Viaarxiv icon

Test-Time Backdoor Attacks on Multimodal Large Language Models

Add code
Feb 13, 2024
Viaarxiv icon