Picture for Jiahao Nie

Jiahao Nie

Unleashing the Potential of Model Bias for Generalized Category Discovery

Add code
Dec 17, 2024
Viaarxiv icon

VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking

Add code
Aug 05, 2024
Viaarxiv icon

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Add code
Jul 22, 2024
Figure 1 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 2 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 3 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 4 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Viaarxiv icon

P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds

Add code
Jul 09, 2024
Viaarxiv icon

Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models

Add code
Jun 27, 2024
Figure 1 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 2 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 3 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 4 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Viaarxiv icon

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon

MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

Add code
Jun 13, 2024
Figure 1 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 2 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 3 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 4 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Viaarxiv icon

Color Space Learning for Cross-Color Person Re-Identification

Add code
May 15, 2024
Viaarxiv icon

Towards Category Unification of 3D Single Object Tracking on Point Clouds

Add code
Jan 20, 2024
Viaarxiv icon

Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

Add code
Jan 16, 2024
Figure 1 for Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
Figure 2 for Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
Figure 3 for Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
Figure 4 for Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
Viaarxiv icon