Picture for Wenxuan Wang

Wenxuan Wang

TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM

Add code
Mar 17, 2025
Viaarxiv icon

VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models

Add code
Mar 10, 2025
Viaarxiv icon

SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks

Add code
Mar 10, 2025
Viaarxiv icon

VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models

Add code
Feb 23, 2025
Viaarxiv icon

Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation

Add code
Feb 21, 2025
Viaarxiv icon

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Add code
Feb 10, 2025
Figure 1 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 2 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 3 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Figure 4 for EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Viaarxiv icon

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries

Add code
Feb 09, 2025
Viaarxiv icon

How Should I Build A Benchmark?

Add code
Jan 18, 2025
Viaarxiv icon

MRWeb: An Exploration of Generating Multi-Page Resource-Aware Web Code from UI Designs

Add code
Dec 19, 2024
Viaarxiv icon

Sustainable Self-evolution Adversarial Training

Add code
Dec 03, 2024
Figure 1 for Sustainable Self-evolution Adversarial Training
Figure 2 for Sustainable Self-evolution Adversarial Training
Figure 3 for Sustainable Self-evolution Adversarial Training
Figure 4 for Sustainable Self-evolution Adversarial Training
Viaarxiv icon