Picture for Jen-tse Huang

Jen-tse Huang

On the Shortcut Learning in Multilingual Neural Machine Translation

Add code
Nov 15, 2024
Viaarxiv icon

Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs

Add code
Oct 10, 2024
Figure 1 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 2 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 3 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Figure 4 for Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Viaarxiv icon

Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step

Add code
Oct 04, 2024
Viaarxiv icon

Learning to Ask: When LLMs Meet Unclear Instruction

Add code
Aug 31, 2024
Viaarxiv icon

On the Resilience of Multi-Agent Systems with Malicious Agents

Add code
Aug 02, 2024
Viaarxiv icon

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Add code
Jul 12, 2024
Figure 1 for Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Figure 2 for Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Figure 3 for Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Figure 4 for Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Viaarxiv icon

InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context

Add code
Jun 18, 2024
Viaarxiv icon

How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO

Add code
Apr 22, 2024
Viaarxiv icon

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

Add code
Mar 18, 2024
Viaarxiv icon

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

Add code
Jan 01, 2024
Viaarxiv icon