Picture for Shuquan Ye

Shuquan Ye

Do Multimodal Large Language Models See Like Humans?

Add code
Dec 12, 2024
Figure 1 for Do Multimodal Large Language Models See Like Humans?
Figure 2 for Do Multimodal Large Language Models See Like Humans?
Figure 3 for Do Multimodal Large Language Models See Like Humans?
Figure 4 for Do Multimodal Large Language Models See Like Humans?
Viaarxiv icon

OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding

Add code
Aug 20, 2024
Figure 1 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 2 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 3 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Figure 4 for OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

Robust Point Cloud Segmentation with Noisy Annotations

Add code
Dec 06, 2022
Viaarxiv icon

Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles

Add code
Nov 29, 2022
Viaarxiv icon

3D Question Answering

Add code
Dec 15, 2021
Figure 1 for 3D Question Answering
Figure 2 for 3D Question Answering
Figure 3 for 3D Question Answering
Figure 4 for 3D Question Answering
Viaarxiv icon

Learning with Noisy Labels for Robust Point Cloud Segmentation

Add code
Aug 05, 2021
Figure 1 for Learning with Noisy Labels for Robust Point Cloud Segmentation
Figure 2 for Learning with Noisy Labels for Robust Point Cloud Segmentation
Figure 3 for Learning with Noisy Labels for Robust Point Cloud Segmentation
Figure 4 for Learning with Noisy Labels for Robust Point Cloud Segmentation
Viaarxiv icon

Exemplar-Based 3D Portrait Stylization

Add code
Apr 29, 2021
Figure 1 for Exemplar-Based 3D Portrait Stylization
Figure 2 for Exemplar-Based 3D Portrait Stylization
Figure 3 for Exemplar-Based 3D Portrait Stylization
Figure 4 for Exemplar-Based 3D Portrait Stylization
Viaarxiv icon