Picture for Di Lin

Di Lin

Tianjin University

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

Add code
Dec 14, 2025
Viaarxiv icon

CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

Add code
Nov 17, 2025
Viaarxiv icon

Zero-Shot Learning with Subsequence Reordering Pretraining for Compound-Protein Interaction

Add code
Jul 28, 2025
Viaarxiv icon

IMPA-HGAE:Intra-Meta-Path Augmented Heterogeneous Graph Autoencoder

Add code
Jun 07, 2025
Viaarxiv icon

Learning Robust Heterogeneous Graph Representations via Contrastive-Reconstruction under Sparse Semantics

Add code
Jun 07, 2025
Viaarxiv icon

Light as Deception: GPT-driven Natural Relighting Against Vision-Language Pre-training Models

Add code
May 30, 2025
Viaarxiv icon

NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration

Add code
Apr 25, 2025
Viaarxiv icon

Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning

Add code
Apr 23, 2025
Viaarxiv icon

Casual Inference via Style Bias Deconfounding for Domain Generalization

Add code
Mar 21, 2025
Figure 1 for Casual Inference via Style Bias Deconfounding for Domain Generalization
Figure 2 for Casual Inference via Style Bias Deconfounding for Domain Generalization
Figure 3 for Casual Inference via Style Bias Deconfounding for Domain Generalization
Figure 4 for Casual Inference via Style Bias Deconfounding for Domain Generalization
Viaarxiv icon

SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments

Add code
Nov 28, 2024
Figure 1 for SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
Figure 2 for SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
Figure 3 for SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
Figure 4 for SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
Viaarxiv icon