Picture for Shijian Lu

Shijian Lu

Nanyang Technological University

Backdoor Attacks against No-Reference Image Quality Assessment Models via A Scalable Trigger

Add code
Dec 10, 2024
Viaarxiv icon

Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior

Add code
Dec 02, 2024
Figure 1 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 2 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 3 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Figure 4 for Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior
Viaarxiv icon

Multimodal 3D Reasoning Segmentation with Complex Scenes

Add code
Nov 21, 2024
Viaarxiv icon

Novel View Extrapolation with Video Diffusion Priors

Add code
Nov 21, 2024
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon

Open-Vocabulary Object Detection via Language Hierarchy

Add code
Oct 27, 2024
Figure 1 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 2 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 3 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 4 for Open-Vocabulary Object Detection via Language Hierarchy
Viaarxiv icon

Foundation Models for Remote Sensing and Earth Observation: A Survey

Add code
Oct 22, 2024
Viaarxiv icon

Mitigating Object Hallucination via Concentric Causal Attention

Add code
Oct 21, 2024
Viaarxiv icon

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Add code
Oct 16, 2024
Figure 1 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 2 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 3 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 4 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Viaarxiv icon

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Figure 1 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 2 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 3 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 4 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Viaarxiv icon