Picture for Zhaozhuo Xu

Zhaozhuo Xu

Fox-1 Technical Report

Add code
Nov 08, 2024
Viaarxiv icon

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs

Add code
Nov 07, 2024
Viaarxiv icon

Do LLMs Know to Respect Copyright Notice?

Add code
Nov 02, 2024
Viaarxiv icon

Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery

Add code
Oct 21, 2024
Viaarxiv icon

SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching

Add code
Oct 08, 2024
Viaarxiv icon

Measuring Copyright Risks of Large Language Model via Partial Information Probing

Add code
Sep 20, 2024
Viaarxiv icon

Sirius: Contextual Sparsity with Correction for Efficient LLMs

Add code
Sep 05, 2024
Viaarxiv icon

PolyRouter: A Multi-LLM Querying System

Add code
Aug 26, 2024
Figure 1 for PolyRouter: A Multi-LLM Querying System
Figure 2 for PolyRouter: A Multi-LLM Querying System
Figure 3 for PolyRouter: A Multi-LLM Querying System
Figure 4 for PolyRouter: A Multi-LLM Querying System
Viaarxiv icon

ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency

Add code
Jul 23, 2024
Viaarxiv icon

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

Add code
Jul 01, 2024
Viaarxiv icon