Picture for Zhou Yu

Zhou Yu

University of California, Davis

Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation

Add code
Mar 26, 2025
Viaarxiv icon

Growing a Twig to Accelerate Large Vision-Language Models

Add code
Mar 18, 2025
Viaarxiv icon

Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs

Add code
Feb 28, 2025
Viaarxiv icon

Program Synthesis Dialog Agents for Interactive Decision-Making

Add code
Feb 26, 2025
Viaarxiv icon

Fréchet Cumulative Covariance Net for Deep Nonlinear Sufficient Dimension Reduction with Random Objects

Add code
Feb 21, 2025
Viaarxiv icon

ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Add code
Feb 19, 2025
Viaarxiv icon

A General Framework for Inference-time Scaling and Steering of Diffusion Models

Add code
Jan 16, 2025
Viaarxiv icon

Neural Networks Perform Sufficient Dimension Reduction

Add code
Dec 26, 2024
Figure 1 for Neural Networks Perform Sufficient Dimension Reduction
Figure 2 for Neural Networks Perform Sufficient Dimension Reduction
Figure 3 for Neural Networks Perform Sufficient Dimension Reduction
Figure 4 for Neural Networks Perform Sufficient Dimension Reduction
Viaarxiv icon

Probability-density-aware Semi-supervised Learning

Add code
Dec 23, 2024
Viaarxiv icon

PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles

Add code
Oct 22, 2024
Viaarxiv icon