Picture for Zhaozhuo Xu

Zhaozhuo Xu

Fox-1 Technical Report

Add code
Nov 08, 2024
Figure 1 for Fox-1 Technical Report
Figure 2 for Fox-1 Technical Report
Figure 3 for Fox-1 Technical Report
Figure 4 for Fox-1 Technical Report
Viaarxiv icon

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs

Add code
Nov 07, 2024
Figure 1 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 2 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 3 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 4 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Viaarxiv icon

Do LLMs Know to Respect Copyright Notice?

Add code
Nov 02, 2024
Figure 1 for Do LLMs Know to Respect Copyright Notice?
Figure 2 for Do LLMs Know to Respect Copyright Notice?
Figure 3 for Do LLMs Know to Respect Copyright Notice?
Figure 4 for Do LLMs Know to Respect Copyright Notice?
Viaarxiv icon

Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery

Add code
Oct 21, 2024
Figure 1 for Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery
Figure 2 for Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery
Figure 3 for Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery
Figure 4 for Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery
Viaarxiv icon

SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching

Add code
Oct 08, 2024
Figure 1 for SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching
Figure 2 for SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching
Figure 3 for SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching
Figure 4 for SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching
Viaarxiv icon

Measuring Copyright Risks of Large Language Model via Partial Information Probing

Add code
Sep 20, 2024
Viaarxiv icon

Sirius: Contextual Sparsity with Correction for Efficient LLMs

Add code
Sep 05, 2024
Figure 1 for Sirius: Contextual Sparsity with Correction for Efficient LLMs
Figure 2 for Sirius: Contextual Sparsity with Correction for Efficient LLMs
Figure 3 for Sirius: Contextual Sparsity with Correction for Efficient LLMs
Figure 4 for Sirius: Contextual Sparsity with Correction for Efficient LLMs
Viaarxiv icon

PolyRouter: A Multi-LLM Querying System

Add code
Aug 26, 2024
Figure 1 for PolyRouter: A Multi-LLM Querying System
Figure 2 for PolyRouter: A Multi-LLM Querying System
Figure 3 for PolyRouter: A Multi-LLM Querying System
Figure 4 for PolyRouter: A Multi-LLM Querying System
Viaarxiv icon

ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency

Add code
Jul 23, 2024
Viaarxiv icon

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

Add code
Jul 01, 2024
Figure 1 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 2 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 3 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 4 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Viaarxiv icon