Picture for Yan Gao

Yan Gao

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Figure 1 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 2 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 3 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 4 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Viaarxiv icon

ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval

Add code
Nov 24, 2024
Figure 1 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 2 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 3 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 4 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Viaarxiv icon

Photon: Federated LLM Pre-Training

Add code
Nov 05, 2024
Figure 1 for Photon: Federated LLM Pre-Training
Figure 2 for Photon: Federated LLM Pre-Training
Figure 3 for Photon: Federated LLM Pre-Training
Figure 4 for Photon: Federated LLM Pre-Training
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Figure 1 for MoDification: Mixture of Depths Made Easy
Figure 2 for MoDification: Mixture of Depths Made Easy
Figure 3 for MoDification: Mixture of Depths Made Easy
Figure 4 for MoDification: Mixture of Depths Made Easy
Viaarxiv icon

StraGo: Harnessing Strategic Guidance for Prompt Optimization

Add code
Oct 11, 2024
Figure 1 for StraGo: Harnessing Strategic Guidance for Prompt Optimization
Figure 2 for StraGo: Harnessing Strategic Guidance for Prompt Optimization
Figure 3 for StraGo: Harnessing Strategic Guidance for Prompt Optimization
Figure 4 for StraGo: Harnessing Strategic Guidance for Prompt Optimization
Viaarxiv icon

AMPO: Automatic Multi-Branched Prompt Optimization

Add code
Oct 11, 2024
Figure 1 for AMPO: Automatic Multi-Branched Prompt Optimization
Figure 2 for AMPO: Automatic Multi-Branched Prompt Optimization
Figure 3 for AMPO: Automatic Multi-Branched Prompt Optimization
Figure 4 for AMPO: Automatic Multi-Branched Prompt Optimization
Viaarxiv icon

DEPT: Decoupled Embeddings for Pre-training Language Models

Add code
Oct 07, 2024
Figure 1 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 2 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 3 for DEPT: Decoupled Embeddings for Pre-training Language Models
Figure 4 for DEPT: Decoupled Embeddings for Pre-training Language Models
Viaarxiv icon

VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation

Add code
Aug 29, 2024
Figure 1 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 2 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 3 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 4 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Viaarxiv icon

Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective

Add code
Jun 19, 2024
Figure 1 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 2 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 3 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 4 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Viaarxiv icon

DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Add code
Jun 18, 2024
Figure 1 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 2 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 3 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 4 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Viaarxiv icon