Picture for Xipeng Qiu

Xipeng Qiu

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Add code
Dec 24, 2024
Viaarxiv icon

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Add code
Dec 18, 2024
Viaarxiv icon

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Add code
Dec 04, 2024
Figure 1 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 2 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 3 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 4 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Viaarxiv icon

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection

Add code
Nov 29, 2024
Figure 1 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 2 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 3 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 4 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Viaarxiv icon

Can Language Models Learn to Skip Steps?

Add code
Nov 04, 2024
Viaarxiv icon

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Add code
Oct 31, 2024
Figure 1 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 2 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 3 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 4 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Viaarxiv icon