Picture for Xiaolong Jiang

Xiaolong Jiang

Xiaohongshu Inc

Impostor: An Agent-Curated Benchmark for Realistic AIGC Manipulation Localization

Add code
Jun 03, 2026
Viaarxiv icon

Preference-Aware Rubric Learning for Personalized Evaluation

Add code
May 29, 2026
Viaarxiv icon

AgentCVR: Active Multi-Agent Cross-Video Reasoning via Script-Simulated Reinforcement Learning

Add code
May 28, 2026
Viaarxiv icon

Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models

Add code
Nov 15, 2025
Viaarxiv icon

Progressive Scaling Visual Object Tracking

Add code
May 26, 2025
Viaarxiv icon

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

Add code
Feb 06, 2025
Figure 1 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 2 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 3 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 4 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Viaarxiv icon

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Add code
Dec 02, 2024
Figure 1 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 2 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 3 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 4 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Viaarxiv icon

P4Q: Learning to Prompt for Quantization in Visual-language Models

Add code
Sep 26, 2024
Figure 1 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 2 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 3 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 4 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Viaarxiv icon

Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective

Add code
Aug 13, 2024
Figure 1 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 2 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 3 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 4 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Viaarxiv icon