Picture for Shijian Lu

Shijian Lu

Nanyang Technological University

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Add code
Mar 17, 2025
Viaarxiv icon

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

Add code
Mar 17, 2025
Viaarxiv icon

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

Add code
Mar 13, 2025
Viaarxiv icon

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

Add code
Mar 10, 2025
Viaarxiv icon

Data-Efficient Generalization for Zero-shot Composed Image Retrieval

Add code
Mar 07, 2025
Viaarxiv icon

Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger

Add code
Dec 10, 2024
Figure 1 for Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger
Figure 2 for Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger
Figure 3 for Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger
Figure 4 for Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger
Viaarxiv icon

Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior

Add code
Dec 02, 2024
Figure 1 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 2 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 3 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 4 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Viaarxiv icon

Multimodal 3D Reasoning Segmentation with Complex Scenes

Add code
Nov 21, 2024
Viaarxiv icon

Novel View Extrapolation with Video Diffusion Priors

Add code
Nov 21, 2024
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon