Picture for Yuan Gao

Yuan Gao

Department of Information Technology, Uppsala University, Uppsala, Sweden

Risk-Aware Driving Scenario Analysis with Large Language Models

Add code
Feb 04, 2025
Viaarxiv icon

AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges

Add code
Jan 24, 2025
Viaarxiv icon

Valley2: Exploring Multimodal Models with Scalable Vision-Language Design

Add code
Jan 13, 2025
Figure 1 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 2 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 3 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Figure 4 for Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Viaarxiv icon

Reach Measurement, Optimization and Frequency Capping In Targeted Online Advertising Under k-Anonymity

Add code
Jan 08, 2025
Viaarxiv icon

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

Improved Forecasts of Global Extreme Marine Heatwaves Through a Physics-guided Data-driven Approach

Add code
Dec 20, 2024
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

Self-test loss functions for learning weak-form operators and gradient flows

Add code
Dec 04, 2024
Viaarxiv icon

GrokFormer: Graph Fourier Kolmogorov-Arnold Transformers

Add code
Nov 26, 2024
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon