Picture for Xinyu Chen

Xinyu Chen

DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild

Add code
Nov 20, 2024
Figure 1 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 2 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 3 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 4 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Viaarxiv icon

Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review

Add code
Oct 29, 2024
Figure 1 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 2 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 3 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 4 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Viaarxiv icon

Correlating Time Series with Interpretable Convolutional Kernels

Add code
Sep 02, 2024
Viaarxiv icon

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Add code
Jun 17, 2024
Figure 1 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 2 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 3 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 4 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Viaarxiv icon

On Unified Prompt Tuning for Request Quality Assurance in Public Code Review

Add code
Apr 11, 2024
Viaarxiv icon

LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs

Add code
Feb 21, 2024
Figure 1 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 2 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 3 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 4 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Viaarxiv icon

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment

Add code
Feb 21, 2024
Figure 1 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 2 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 3 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 4 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Viaarxiv icon

A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering

Add code
Nov 13, 2023
Viaarxiv icon

QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping

Add code
Oct 08, 2023
Viaarxiv icon

Understanding Deep Neural Networks via Linear Separability of Hidden Layers

Add code
Jul 26, 2023
Viaarxiv icon