Picture for Shan Jiang

Shan Jiang

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Add code
Dec 12, 2025
Viaarxiv icon

Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities

Add code
Nov 18, 2025
Viaarxiv icon

CASCADE: LLM-Powered JavaScript Deobfuscator at Google

Add code
Jul 23, 2025
Viaarxiv icon

FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks

Add code
May 26, 2025
Viaarxiv icon

High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update

Add code
Apr 01, 2025
Viaarxiv icon

Target-aware Bidirectional Fusion Transformer for Aerial Object Tracking

Add code
Mar 13, 2025
Viaarxiv icon

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Add code
Feb 22, 2025
Viaarxiv icon

Real-Time Neural-Enhancement for Online Cloud Gaming

Add code
Jan 12, 2025
Figure 1 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 2 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 3 for Real-Time Neural-Enhancement for Online Cloud Gaming
Figure 4 for Real-Time Neural-Enhancement for Online Cloud Gaming
Viaarxiv icon

Distributed satellite information networks: Architecture, enabling technologies, and trends

Add code
Dec 17, 2024
Figure 1 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 2 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 3 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 4 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Viaarxiv icon

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Add code
Jun 27, 2024
Figure 1 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 2 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 3 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Figure 4 for Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Viaarxiv icon