Picture for Yu Gu

Yu Gu

Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search

Add code
Feb 18, 2025
Viaarxiv icon

Magma: A Foundation Model for Multimodal AI Agents

Add code
Feb 18, 2025
Viaarxiv icon

SimSort: A Powerful Framework for Spike Sorting by Large-Scale Electrophysiology Simulation

Add code
Feb 05, 2025
Viaarxiv icon

Universal Abstraction: Harnessing Frontier Models to Structure Real-World Data at Scale

Add code
Feb 02, 2025
Viaarxiv icon

CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder

Add code
Dec 12, 2024
Figure 1 for CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
Figure 2 for CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
Figure 3 for CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
Figure 4 for CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
Viaarxiv icon

XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference

Add code
Dec 08, 2024
Figure 1 for XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference
Figure 2 for XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference
Figure 3 for XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference
Figure 4 for XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference
Viaarxiv icon

Improving Accuracy and Generalization for Efficient Visual Tracking

Add code
Nov 28, 2024
Figure 1 for Improving Accuracy and Generalization for Efficient Visual Tracking
Figure 2 for Improving Accuracy and Generalization for Efficient Visual Tracking
Figure 3 for Improving Accuracy and Generalization for Efficient Visual Tracking
Figure 4 for Improving Accuracy and Generalization for Efficient Visual Tracking
Viaarxiv icon

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Add code
Nov 10, 2024
Figure 1 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 2 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 3 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 4 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Viaarxiv icon

Building A Coding Assistant via the Retrieval-Augmented Language Model

Add code
Oct 21, 2024
Figure 1 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 2 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 3 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 4 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Viaarxiv icon

DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis

Add code
Oct 17, 2024
Viaarxiv icon