Picture for Yue Yu

Yue Yu

Revealing the Implicit Noise-based Imprint of Generative Models

Add code
Mar 12, 2025
Viaarxiv icon

An optimal Petrov-Galerkin framework for operator networks

Add code
Mar 06, 2025
Viaarxiv icon

Accurate Expert Predictions in MoE Inference via Cross-Layer Gate

Add code
Feb 17, 2025
Viaarxiv icon

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline

Add code
Feb 09, 2025
Viaarxiv icon

Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data

Add code
Jan 19, 2025
Figure 1 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 2 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 3 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 4 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

Continuous Knowledge-Preserving Decomposition for Few-Shot Continual Learning

Add code
Jan 09, 2025
Viaarxiv icon

Correcting Large Language Model Behavior via Influence Function

Add code
Dec 21, 2024
Viaarxiv icon

GraphLoRA: Empowering LLMs Fine-Tuning via Graph Collaboration of MoE

Add code
Dec 18, 2024
Viaarxiv icon

Centaur: Bridging the Impossible Trinity of Privacy, Efficiency, and Performance in Privacy-Preserving Transformer Inference

Add code
Dec 14, 2024
Viaarxiv icon