Picture for Jiahao Liu

Jiahao Liu

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon

Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation

Add code
Oct 07, 2024
Figure 1 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 2 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 3 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Figure 4 for Constructing and Masking Preference Profile with LLMs for Filtering Discomforting Recommendation
Viaarxiv icon

ReMamba: Equip Mamba with Effective Long-Sequence Modeling

Add code
Sep 01, 2024
Viaarxiv icon

Graph-Structured Speculative Decoding

Add code
Jul 23, 2024
Figure 1 for Graph-Structured Speculative Decoding
Figure 2 for Graph-Structured Speculative Decoding
Figure 3 for Graph-Structured Speculative Decoding
Figure 4 for Graph-Structured Speculative Decoding
Viaarxiv icon

EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction

Add code
Jun 10, 2024
Viaarxiv icon

Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism

Add code
Jun 06, 2024
Viaarxiv icon

Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

Add code
Apr 18, 2024
Viaarxiv icon

What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation

Add code
Mar 11, 2024
Viaarxiv icon

C-ICL: Contrastive In-context Learning for Information Extraction

Add code
Feb 17, 2024
Viaarxiv icon