Picture for Ngai Wong

Ngai Wong

MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers

Add code
Oct 23, 2024
Figure 1 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 2 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 3 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Figure 4 for MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Viaarxiv icon

UNComp: Uncertainty-Aware Long-Context Compressor for Efficient Large Language Model Inference

Add code
Oct 04, 2024
Viaarxiv icon

UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

Add code
Oct 03, 2024
Viaarxiv icon

A Survey on the Honesty of Large Language Models

Add code
Sep 27, 2024
Figure 1 for A Survey on the Honesty of Large Language Models
Figure 2 for A Survey on the Honesty of Large Language Models
Figure 3 for A Survey on the Honesty of Large Language Models
Figure 4 for A Survey on the Honesty of Large Language Models
Viaarxiv icon

LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Add code
Jul 18, 2024
Viaarxiv icon

Mixture-of-Subspaces in Low-Rank Adaptation

Add code
Jun 16, 2024
Viaarxiv icon

ASMR: Activation-sharing Multi-resolution Coordinate Networks For Efficient Inference

Add code
May 20, 2024
Viaarxiv icon

Nonparametric Teaching of Implicit Neural Representations

Add code
May 17, 2024
Viaarxiv icon

Poisoning-based Backdoor Attacks for Arbitrary Target Label with Positive Triggers

Add code
May 09, 2024
Viaarxiv icon