Picture for Jiaying Lin

Jiaying Lin

Do Multimodal Large Language Models See Like Humans?

Add code
Dec 12, 2024
Figure 1 for Do Multimodal Large Language Models See Like Humans?
Figure 2 for Do Multimodal Large Language Models See Like Humans?
Figure 3 for Do Multimodal Large Language Models See Like Humans?
Figure 4 for Do Multimodal Large Language Models See Like Humans?
Viaarxiv icon

Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension

Add code
Oct 02, 2024
Figure 1 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 2 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 3 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Figure 4 for Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Viaarxiv icon

OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding

Add code
Aug 20, 2024
Figure 1 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 2 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 3 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 4 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

SemiPL: A Semi-supervised Method for Event Sound Source Localization

Add code
Apr 30, 2024
Figure 1 for SemiPL: A Semi-supervised Method for Event Sound Source Localization
Figure 2 for SemiPL: A Semi-supervised Method for Event Sound Source Localization
Figure 3 for SemiPL: A Semi-supervised Method for Event Sound Source Localization
Figure 4 for SemiPL: A Semi-supervised Method for Event Sound Source Localization
Viaarxiv icon

HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

Add code
Apr 25, 2024
Figure 1 for HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition
Figure 2 for HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition
Figure 3 for HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition
Figure 4 for HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition
Viaarxiv icon

SFMViT: SlowFast Meet ViT in Chaotic World

Add code
Apr 25, 2024
Viaarxiv icon

Efficient Mirror Detection via Multi-level Heterogeneous Learning

Add code
Nov 28, 2022
Figure 1 for Efficient Mirror Detection via Multi-level Heterogeneous Learning
Figure 2 for Efficient Mirror Detection via Multi-level Heterogeneous Learning
Figure 3 for Efficient Mirror Detection via Multi-level Heterogeneous Learning
Figure 4 for Efficient Mirror Detection via Multi-level Heterogeneous Learning
Viaarxiv icon

Weakly-Supervised Camouflaged Object Detection with Scribble Annotations

Add code
Jul 28, 2022
Figure 1 for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations
Figure 2 for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations
Figure 3 for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations
Figure 4 for Weakly-Supervised Camouflaged Object Detection with Scribble Annotations
Viaarxiv icon

Symmetry-Aware Transformer-based Mirror Detection

Add code
Jul 13, 2022
Figure 1 for Symmetry-Aware Transformer-based Mirror Detection
Figure 2 for Symmetry-Aware Transformer-based Mirror Detection
Figure 3 for Symmetry-Aware Transformer-based Mirror Detection
Figure 4 for Symmetry-Aware Transformer-based Mirror Detection
Viaarxiv icon

Depth-aware Glass Surface Detection with Cross-modal Context Mining

Add code
Jun 22, 2022
Figure 1 for Depth-aware Glass Surface Detection with Cross-modal Context Mining
Figure 2 for Depth-aware Glass Surface Detection with Cross-modal Context Mining
Figure 3 for Depth-aware Glass Surface Detection with Cross-modal Context Mining
Figure 4 for Depth-aware Glass Surface Detection with Cross-modal Context Mining
Viaarxiv icon