Picture for Shunping Ji

Shunping Ji

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Add code
Jan 08, 2025
Viaarxiv icon

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Add code
Jan 07, 2025
Figure 1 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 2 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 3 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 4 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Viaarxiv icon

A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images

Add code
Dec 31, 2024
Figure 1 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 2 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 3 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 4 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Viaarxiv icon

Online Temporal Fusion for Vectorized Map Construction in Mapless Autonomous Driving

Add code
Sep 01, 2024
Viaarxiv icon

3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images

Add code
Aug 31, 2024
Figure 1 for 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images
Figure 2 for 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images
Figure 3 for 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images
Figure 4 for 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images
Viaarxiv icon

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Add code
Jun 27, 2024
Viaarxiv icon

P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images

Add code
Jun 05, 2024
Viaarxiv icon

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Add code
Apr 05, 2024
Viaarxiv icon

Point Could Mamba: Point Cloud Learning via State Space Model

Add code
Mar 01, 2024
Viaarxiv icon

DVIS++: Improved Decoupled Framework for Universal Video Segmentation

Add code
Dec 20, 2023
Viaarxiv icon