Picture for Xinyu Chen

Xinyu Chen

DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild

Add code
Nov 20, 2024
Figure 1 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 2 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 3 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 4 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Viaarxiv icon

Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review

Add code
Oct 29, 2024
Figure 1 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 2 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 3 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Figure 4 for Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review
Viaarxiv icon

Correlating Time Series with Interpretable Convolutional Kernels

Add code
Sep 02, 2024
Viaarxiv icon

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Add code
Jun 17, 2024
Figure 1 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 2 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 3 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 4 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Viaarxiv icon

On Unified Prompt Tuning for Request Quality Assurance in Public Code Review

Add code
Apr 11, 2024
Viaarxiv icon

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment

Add code
Feb 21, 2024
Figure 1 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 2 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 3 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 4 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Viaarxiv icon

LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs

Add code
Feb 21, 2024
Figure 1 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 2 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 3 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Figure 4 for LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Viaarxiv icon

A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering

Add code
Nov 13, 2023
Viaarxiv icon

QwenGrasp: A Usage of Large Vision-Language Model for Target-Oriented Grasping

Add code
Oct 08, 2023
Viaarxiv icon

Understanding Deep Neural Networks via Linear Separability of Hidden Layers

Add code
Jul 26, 2023
Viaarxiv icon