Picture for Yuwei Hu

Yuwei Hu

Unifying KV Cache Compression for Large Language Models with LeanKV

Add code
Dec 04, 2024
Figure 1 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 2 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 3 for Unifying KV Cache Compression for Large Language Models with LeanKV
Figure 4 for Unifying KV Cache Compression for Large Language Models with LeanKV
Viaarxiv icon

Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents

Add code
Oct 07, 2024
Viaarxiv icon

RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization

Add code
Aug 21, 2024
Viaarxiv icon

Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning

Add code
Jun 24, 2024
Viaarxiv icon

Intruding with Words: Towards Understanding Graph Injection Attacks at the Text Level

Add code
May 26, 2024
Viaarxiv icon

An Asynchronous Updating Reinforcement Learning Framework for Task-oriented Dialog System

Add code
May 04, 2023
Viaarxiv icon

Benchmarking GNN-Based Recommender Systems on Intel Optane Persistent Memory

Add code
Jul 25, 2022
Figure 1 for Benchmarking GNN-Based Recommender Systems on Intel Optane Persistent Memory
Figure 2 for Benchmarking GNN-Based Recommender Systems on Intel Optane Persistent Memory
Figure 3 for Benchmarking GNN-Based Recommender Systems on Intel Optane Persistent Memory
Figure 4 for Benchmarking GNN-Based Recommender Systems on Intel Optane Persistent Memory
Viaarxiv icon

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots

Add code
Mar 21, 2022
Figure 1 for A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots
Figure 2 for A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots
Figure 3 for A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots
Figure 4 for A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots
Viaarxiv icon

Dense Pruning of Pointwise Convolutions in the Frequency Domain

Add code
Sep 16, 2021
Figure 1 for Dense Pruning of Pointwise Convolutions in the Frequency Domain
Figure 2 for Dense Pruning of Pointwise Convolutions in the Frequency Domain
Figure 3 for Dense Pruning of Pointwise Convolutions in the Frequency Domain
Figure 4 for Dense Pruning of Pointwise Convolutions in the Frequency Domain
Viaarxiv icon

FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems

Add code
Sep 29, 2020
Figure 1 for FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Figure 2 for FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Figure 3 for FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Figure 4 for FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems
Viaarxiv icon