Picture for Jinlan Fu

Jinlan Fu

FlipAttack: Jailbreak LLMs via Flipping

Add code
Oct 02, 2024
Viaarxiv icon

Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism

Add code
Jul 24, 2024
Viaarxiv icon

Cross-Modality Safety Alignment

Add code
Jun 21, 2024
Viaarxiv icon

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Add code
Jun 17, 2024
Viaarxiv icon

Chain of Thought Explanation for Dialogue State Tracking

Add code
Mar 09, 2024
Viaarxiv icon

CET2: Modelling Topic Transitions for Coherent and Engaging Knowledge-Grounded Conversations

Add code
Mar 04, 2024
Viaarxiv icon

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

Add code
Feb 22, 2024
Viaarxiv icon

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Add code
Feb 17, 2024
Viaarxiv icon

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Add code
Jan 29, 2024
Figure 1 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 2 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 3 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Figure 4 for From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Viaarxiv icon

How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation

Add code
Dec 28, 2023
Viaarxiv icon