Picture for Qi Song

Qi Song

Envision4D: Envisioning Visual Futures via Feed-forward 4D Gaussian Splatting for Autonomous Driving

Add code
Jun 09, 2026
Viaarxiv icon

TIGER: Text-Informed Generalized Enzyme-Reaction Retrieval

Add code
May 23, 2026
Viaarxiv icon

Case-Aware Medical Image Classification with Multimodal Knowledge Graphs and Reliability-Guided Refinement

Add code
May 21, 2026
Viaarxiv icon

Ray-Aware Pointer Memory with Adaptive Updates for Streaming 3D Reconstruction

Add code
May 07, 2026
Viaarxiv icon

Physics-informed Deep Mixture-of-Koopmans Vehicle Dynamics Model with Dual-branch Encoder for Distributed Electric-drive Trucks

Add code
Mar 18, 2026
Viaarxiv icon

SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering

Add code
Jan 06, 2026
Viaarxiv icon

Towards Long-window Anchoring in Vision-Language Model Distillation

Add code
Dec 25, 2025
Viaarxiv icon

CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning

Add code
Dec 19, 2025
Viaarxiv icon

GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt

Add code
Nov 13, 2025
Figure 1 for GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt
Figure 2 for GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt
Figure 3 for GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt
Figure 4 for GraphIF: Enhancing Multi-Turn Instruction Following for Large Language Models with Relation Graph Prompt
Viaarxiv icon

Align 3D Representation and Text Embedding for 3D Content Personalization

Add code
Aug 23, 2025
Viaarxiv icon