Picture for Xinyu Tian

Xinyu Tian

Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

Add code
Dec 26, 2025
Viaarxiv icon

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Add code
Oct 02, 2025
Figure 1 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 2 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 3 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Figure 4 for Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Viaarxiv icon

Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal

Add code
Jun 13, 2025
Figure 1 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 2 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 3 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 4 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Viaarxiv icon

Conditional Data Synthesis Augmentation

Add code
Apr 10, 2025
Viaarxiv icon

Identifying and Mitigating Position Bias of Multi-image Vision-Language Models

Add code
Mar 18, 2025
Viaarxiv icon

Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition

Add code
Feb 19, 2025
Figure 1 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 2 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 3 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Figure 4 for Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
Viaarxiv icon

Generative Distribution Prediction: A Unified Approach to Multimodal Learning

Add code
Feb 10, 2025
Figure 1 for Generative Distribution Prediction: A Unified Approach to Multimodal Learning
Figure 2 for Generative Distribution Prediction: A Unified Approach to Multimodal Learning
Figure 3 for Generative Distribution Prediction: A Unified Approach to Multimodal Learning
Figure 4 for Generative Distribution Prediction: A Unified Approach to Multimodal Learning
Viaarxiv icon

Speech Translation Refinement using Large Language Models

Add code
Jan 25, 2025
Figure 1 for Speech Translation Refinement using Large Language Models
Figure 2 for Speech Translation Refinement using Large Language Models
Figure 3 for Speech Translation Refinement using Large Language Models
Figure 4 for Speech Translation Refinement using Large Language Models
Viaarxiv icon

SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Add code
Jan 20, 2025
Viaarxiv icon

Continuous-Time Digital Twin with Analogue Memristive Neural Ordinary Differential Equation Solver

Add code
Jun 12, 2024
Viaarxiv icon