Picture for Haoyuan Shi

Haoyuan Shi

Controllable Edge-Type-Specific Interpretation in Multi-Relational Graph Neural Networks for Drug Response Prediction

Add code
Sep 03, 2024
Viaarxiv icon

DRExplainer: Quantifiable Interpretability in Drug Response Prediction with Directed Graph Convolutional Network

Add code
Aug 22, 2024
Viaarxiv icon

Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation

Add code
Aug 19, 2024
Figure 1 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 2 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 3 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Figure 4 for Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Viaarxiv icon

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning

Add code
Jun 17, 2024
Figure 1 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 2 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 3 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Figure 4 for VideoVista: A Versatile Benchmark for Video Understanding and Reasoning
Viaarxiv icon

TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction

Add code
May 27, 2024
Viaarxiv icon

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

Add code
May 08, 2024
Viaarxiv icon

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment

Add code
Feb 21, 2024
Figure 1 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 2 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 3 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Figure 4 for Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Viaarxiv icon

Toward Moiré-Free and Detail-Preserving Demosaicking

Add code
May 15, 2023
Figure 1 for Toward Moiré-Free and Detail-Preserving Demosaicking
Figure 2 for Toward Moiré-Free and Detail-Preserving Demosaicking
Figure 3 for Toward Moiré-Free and Detail-Preserving Demosaicking
Figure 4 for Toward Moiré-Free and Detail-Preserving Demosaicking
Viaarxiv icon