Picture for Yu Yao

Yu Yao

DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection

Add code
Nov 18, 2025
Viaarxiv icon

VitalBench: A Rigorous Multi-Center Benchmark for Long-Term Vital Sign Prediction in Intraoperative Care

Add code
Nov 14, 2025
Viaarxiv icon

SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

Add code
Jul 29, 2025
Figure 1 for SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Figure 2 for SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Figure 3 for SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Figure 4 for SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Viaarxiv icon

A Sample Efficient Conditional Independence Test in the Presence of Discretization

Add code
Jun 10, 2025
Viaarxiv icon

Beyond Optimal Transport: Model-Aligned Coupling for Flow Matching

Add code
May 29, 2025
Viaarxiv icon

SP2RINT: Spatially-Decoupled Physics-Inspired Progressive Inverse Optimization for Scalable, PDE-Constrained Meta-Optical Neural Network Training

Add code
May 23, 2025
Viaarxiv icon

Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space

Add code
Mar 13, 2025
Viaarxiv icon

Semantic Data Augmentation Enhanced Invariant Risk Minimization for Medical Image Domain Generalization

Add code
Feb 08, 2025
Viaarxiv icon

Flow: A Modular Approach to Automated Agentic Workflow Generation

Add code
Jan 14, 2025
Figure 1 for Flow: A Modular Approach to Automated Agentic Workflow Generation
Figure 2 for Flow: A Modular Approach to Automated Agentic Workflow Generation
Figure 3 for Flow: A Modular Approach to Automated Agentic Workflow Generation
Figure 4 for Flow: A Modular Approach to Automated Agentic Workflow Generation
Viaarxiv icon

Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels

Add code
Dec 09, 2024
Figure 1 for Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels
Figure 2 for Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels
Figure 3 for Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels
Figure 4 for Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels
Viaarxiv icon