Picture for Nan Wang

Nan Wang

University of California, Santa Cruz

A$^2$ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization

Add code
Feb 18, 2025
Viaarxiv icon

PUGS: Zero-shot Physical Understanding with Gaussian Splatting

Add code
Feb 17, 2025
Viaarxiv icon

A Safe Hybrid Control Framework for Car-like Robot with Guaranteed Global Path-Invariance using a Control Barrier Function

Add code
Feb 11, 2025
Viaarxiv icon

XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications

Add code
Feb 03, 2025
Viaarxiv icon

StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation

Add code
Jan 10, 2025
Figure 1 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 2 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 3 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Figure 4 for StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Viaarxiv icon

GenX: Mastering Code and Test Generation with Execution Feedback

Add code
Dec 18, 2024
Viaarxiv icon

AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark

Add code
Dec 17, 2024
Figure 1 for AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Figure 2 for AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Figure 3 for AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Figure 4 for AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Viaarxiv icon

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images

Add code
Dec 11, 2024
Figure 1 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 2 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 3 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Figure 4 for jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Viaarxiv icon

ACE-Net: AutofoCus-Enhanced Convolutional Network for Field Imperfection Estimation with application to high b-value spiral Diffusion MRI

Add code
Nov 21, 2024
Viaarxiv icon

cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems

Add code
Nov 18, 2024
Figure 1 for cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems
Figure 2 for cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems
Figure 3 for cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems
Figure 4 for cHyRRT and cHySST: Two Motion Planning Tools for Hybrid Dynamical Systems
Viaarxiv icon