Picture for Xipeng Qiu

Xipeng Qiu

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Add code
Feb 07, 2025
Viaarxiv icon

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Add code
Jan 28, 2025
Viaarxiv icon

Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework

Add code
Jan 26, 2025
Viaarxiv icon

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Viaarxiv icon

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Add code
Dec 18, 2024
Viaarxiv icon

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Add code
Dec 04, 2024
Figure 1 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 2 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 3 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 4 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Viaarxiv icon

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection

Add code
Nov 29, 2024
Figure 1 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 2 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 3 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 4 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Viaarxiv icon

Can Language Models Learn to Skip Steps?

Add code
Nov 04, 2024
Viaarxiv icon