Picture for Jianbin Jiao

Jianbin Jiao

Recent Advances in Attack and Defense Approaches of Large Language Models

Add code
Sep 05, 2024
Viaarxiv icon

Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input

Add code
Aug 28, 2024
Figure 1 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 2 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 3 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 4 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Viaarxiv icon

Depth-guided Texture Diffusion for Image Semantic Segmentation

Add code
Aug 17, 2024
Figure 1 for Depth-guided Texture Diffusion for Image Semantic Segmentation
Figure 2 for Depth-guided Texture Diffusion for Image Semantic Segmentation
Figure 3 for Depth-guided Texture Diffusion for Image Semantic Segmentation
Figure 4 for Depth-guided Texture Diffusion for Image Semantic Segmentation
Viaarxiv icon

Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS

Add code
Aug 16, 2024
Viaarxiv icon

Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian

Add code
May 30, 2024
Viaarxiv icon

Position: Foundation Agents as the Paradigm Shift for Decision Making

Add code
May 29, 2024
Viaarxiv icon

Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers

Add code
Jan 30, 2024
Figure 1 for Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
Figure 2 for Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
Figure 3 for Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
Figure 4 for Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
Viaarxiv icon

CPR++: Object Localization via Single Coarse Point Supervision

Add code
Jan 30, 2024
Viaarxiv icon

ChatterBox: Multi-round Multimodal Referring and Grounding

Add code
Jan 24, 2024
Viaarxiv icon

P2Seg: Pointly-supervised Segmentation via Mutual Distillation

Add code
Jan 18, 2024
Viaarxiv icon