Picture for Tianyi Wang

Tianyi Wang

Zhejiang University, Hangzhou, China

PT-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Papers

Add code
Feb 14, 2026
Viaarxiv icon

Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

Add code
Feb 11, 2026
Viaarxiv icon

Rethinking Latency Denial-of-Service: Attacking the LLM Serving Framework, Not the Model

Add code
Feb 08, 2026
Viaarxiv icon

D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning

Add code
Feb 08, 2026
Viaarxiv icon

Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification

Add code
Feb 05, 2026
Viaarxiv icon

PILD: Physics-Informed Learning via Diffusion

Add code
Jan 29, 2026
Viaarxiv icon

ScenePilot-Bench: A Large-Scale Dataset and Benchmark for Evaluation of Vision-Language Models in Autonomous Driving

Add code
Jan 27, 2026
Viaarxiv icon

Do VLMs Have a Moral Backbone? A Study on the Fragile Morality of Vision-Language Models

Add code
Jan 23, 2026
Viaarxiv icon

WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Add code
Sep 26, 2025
Figure 1 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 2 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 3 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Figure 4 for WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM
Viaarxiv icon

UAV-Based Intelligent Traffic Surveillance System: Real-Time Vehicle Detection, Classification, Tracking, and Behavioral Analysis

Add code
Sep 04, 2025
Viaarxiv icon