Picture for Ruifeng Xu

Ruifeng Xu

xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking

Add code
Jan 30, 2025
Figure 1 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 2 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 3 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 4 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Viaarxiv icon

AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling

Add code
Jan 16, 2025
Figure 1 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 2 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 3 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Figure 4 for AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling
Viaarxiv icon

Distilling Fine-grained Sentiment Understanding from Large Language Models

Add code
Dec 24, 2024
Viaarxiv icon

Correcting Large Language Model Behavior via Influence Function

Add code
Dec 21, 2024
Viaarxiv icon

DS$^2$-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis

Add code
Dec 19, 2024
Viaarxiv icon

Multi-Task Model Merging via Adaptive Weight Disentanglement

Add code
Nov 27, 2024
Figure 1 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 2 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 3 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 4 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Viaarxiv icon

DualCoTs: Dual Chain-of-Thoughts Prompting for Sentiment Lexicon Expansion of Idioms

Add code
Sep 26, 2024
Figure 1 for DualCoTs: Dual Chain-of-Thoughts Prompting for Sentiment Lexicon Expansion of Idioms
Figure 2 for DualCoTs: Dual Chain-of-Thoughts Prompting for Sentiment Lexicon Expansion of Idioms
Figure 3 for DualCoTs: Dual Chain-of-Thoughts Prompting for Sentiment Lexicon Expansion of Idioms
Figure 4 for DualCoTs: Dual Chain-of-Thoughts Prompting for Sentiment Lexicon Expansion of Idioms
Viaarxiv icon

Training on the Benchmark Is Not All You Need

Add code
Sep 03, 2024
Figure 1 for Training on the Benchmark Is Not All You Need
Figure 2 for Training on the Benchmark Is Not All You Need
Figure 3 for Training on the Benchmark Is Not All You Need
Figure 4 for Training on the Benchmark Is Not All You Need
Viaarxiv icon

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

Add code
Aug 16, 2024
Figure 1 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 2 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 3 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 4 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Viaarxiv icon

Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction

Add code
Jun 26, 2024
Figure 1 for Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
Figure 2 for Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
Figure 3 for Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
Figure 4 for Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
Viaarxiv icon