Picture for Wenxuan Wang

Wenxuan Wang

Vehicle Acceleration Prediction Considering Environmental Influence and Individual Driving Behavior

Add code
Apr 05, 2025
Viaarxiv icon

Image Difference Grounding with Natural Language

Add code
Apr 02, 2025
Viaarxiv icon

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities

Add code
Apr 02, 2025
Viaarxiv icon

TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM

Add code
Mar 17, 2025
Viaarxiv icon

VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models

Add code
Mar 10, 2025
Viaarxiv icon

SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks

Add code
Mar 10, 2025
Viaarxiv icon

VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models

Add code
Feb 23, 2025
Viaarxiv icon

Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation

Add code
Feb 21, 2025
Viaarxiv icon

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Add code
Feb 10, 2025
Figure 1 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 2 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 3 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 4 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Viaarxiv icon

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries

Add code
Feb 09, 2025
Viaarxiv icon