Picture for Ye Xia

Ye Xia

Gemini Embedding: Generalizable Embeddings from Gemini

Add code
Mar 10, 2025
Viaarxiv icon

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Add code
Feb 20, 2025
Viaarxiv icon

Learning Visual Composition through Improved Semantic Guidance

Add code
Dec 19, 2024
Figure 1 for Learning Visual Composition through Improved Semantic Guidance
Figure 2 for Learning Visual Composition through Improved Semantic Guidance
Figure 3 for Learning Visual Composition through Improved Semantic Guidance
Figure 4 for Learning Visual Composition through Improved Semantic Guidance
Viaarxiv icon

Transferring self-supervised pre-trained models for SHM data anomaly detection with scarce labeled data

Add code
Dec 05, 2024
Viaarxiv icon

TIPS: Text-Image Pretraining with Spatial Awareness

Add code
Oct 21, 2024
Figure 1 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 2 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 3 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 4 for TIPS: Text-Image Pretraining with Spatial Awareness
Viaarxiv icon

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

Add code
Feb 11, 2021
Figure 1 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 2 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 3 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 4 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Viaarxiv icon

Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention

Add code
Mar 24, 2019
Figure 1 for Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Figure 2 for Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Figure 3 for Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Figure 4 for Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Viaarxiv icon

Predicting Driver Attention in Critical Situations

Add code
Aug 16, 2018
Figure 1 for Predicting Driver Attention in Critical Situations
Figure 2 for Predicting Driver Attention in Critical Situations
Figure 3 for Predicting Driver Attention in Critical Situations
Figure 4 for Predicting Driver Attention in Critical Situations
Viaarxiv icon