Picture for Heming Xia

Heming Xia

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Add code
Oct 09, 2024
Viaarxiv icon

Enhancing Tool Retrieval with Iterative Feedback from Large Language Models

Add code
Jun 25, 2024
Viaarxiv icon

Taking a Deep Breath: Enhancing Language Modeling of Large Language Models with Sentinel Tokens

Add code
Jun 16, 2024
Viaarxiv icon

Can Large Multimodal Models Uncover Deep Semantics Behind Images?

Add code
Feb 17, 2024
Viaarxiv icon

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

Add code
Jan 15, 2024
Viaarxiv icon

ImageNetVC: Zero-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories

Add code
May 24, 2023
Viaarxiv icon

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization

Add code
May 24, 2023
Viaarxiv icon

Enhancing Continual Relation Extraction via Classifier Decomposition

Add code
May 08, 2023
Viaarxiv icon

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding

Add code
May 20, 2022
Viaarxiv icon

Lossless Speedup of Autoregressive Translation with Generalized Aggressive Decoding

Add code
Apr 02, 2022
Viaarxiv icon