Picture for Shiwen Ni

Shiwen Ni

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking

Add code
Jan 30, 2025
Figure 1 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 2 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 3 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 4 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Viaarxiv icon

Pre-training, Fine-tuning and Re-ranking: A Three-Stage Framework for Legal Question Answering

Add code
Dec 27, 2024
Figure 1 for Pre-training, Fine-tuning and Re-ranking: A Three-Stage Framework for Legal Question Answering
Figure 2 for Pre-training, Fine-tuning and Re-ranking: A Three-Stage Framework for Legal Question Answering
Figure 3 for Pre-training, Fine-tuning and Re-ranking: A Three-Stage Framework for Legal Question Answering
Viaarxiv icon

Small Language Model as Data Prospector for Large Language Model

Add code
Dec 13, 2024
Viaarxiv icon

AutoPatent: A Multi-Agent Framework for Automatic Patent Generation

Add code
Dec 13, 2024
Viaarxiv icon

Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration

Add code
Dec 05, 2024
Figure 1 for Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration
Figure 2 for Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration
Figure 3 for Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration
Figure 4 for Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration
Viaarxiv icon

Can MLLMs Understand the Deep Implication Behind Chinese Images?

Add code
Oct 17, 2024
Figure 1 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 2 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 3 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Figure 4 for Can MLLMs Understand the Deep Implication Behind Chinese Images?
Viaarxiv icon

LIME-M: Less Is More for Evaluation of MLLMs

Add code
Sep 10, 2024
Figure 1 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 2 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 3 for LIME-M: Less Is More for Evaluation of MLLMs
Figure 4 for LIME-M: Less Is More for Evaluation of MLLMs
Viaarxiv icon

Training on the Benchmark Is Not All You Need

Add code
Sep 03, 2024
Figure 1 for Training on the Benchmark Is Not All You Need
Figure 2 for Training on the Benchmark Is Not All You Need
Figure 3 for Training on the Benchmark Is Not All You Need
Figure 4 for Training on the Benchmark Is Not All You Need
Viaarxiv icon

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

Add code
Aug 16, 2024
Figure 1 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 2 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 3 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Figure 4 for Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused
Viaarxiv icon