Picture for Chaoyang Wang

Chaoyang Wang

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Add code
Feb 05, 2026
Viaarxiv icon

Revisiting Diffusion Model Predictions Through Dimensionality

Add code
Jan 29, 2026
Viaarxiv icon

A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice

Add code
Dec 23, 2025
Figure 1 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 2 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 3 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Figure 4 for A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon

EasyV2V: A High-quality Instruction-based Video Editing Framework

Add code
Dec 18, 2025
Figure 1 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 2 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 3 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Figure 4 for EasyV2V: A High-quality Instruction-based Video Editing Framework
Viaarxiv icon

OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

Add code
Dec 11, 2025
Figure 1 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 2 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 3 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Figure 4 for OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
Viaarxiv icon

OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research

Add code
Oct 30, 2025
Viaarxiv icon

Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning

Add code
Jun 07, 2025
Figure 1 for Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Figure 2 for Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Figure 3 for Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Figure 4 for Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Viaarxiv icon

Grounding Chest X-Ray Visual Question Answering with Generated Radiology Reports

Add code
May 22, 2025
Viaarxiv icon

Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Add code
May 22, 2025
Figure 1 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 2 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 3 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 4 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Viaarxiv icon