Picture for Hai Zhao

Hai Zhao

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing

Add code
Oct 24, 2024
Viaarxiv icon

Instruction-Driven Game Engine: A Poker Case Study

Add code
Oct 17, 2024
Viaarxiv icon

Are LLMs Aware that Some Questions are not Open-ended?

Add code
Oct 01, 2024
Viaarxiv icon

VHASR: A Multimodal Speech Recognition System With Vision Hotwords

Add code
Oct 01, 2024
Figure 1 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 2 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 3 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 4 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Viaarxiv icon

Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models

Add code
Sep 30, 2024
Figure 1 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 2 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 3 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 4 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Viaarxiv icon

A Coin Has Two Sides: A Novel Detector-Corrector Framework for Chinese Spelling Correction

Add code
Sep 06, 2024
Viaarxiv icon

Nothing in Excess: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering

Add code
Aug 21, 2024
Viaarxiv icon

MEGen: Generative Backdoor in Large Language Models via Model Editing

Add code
Aug 20, 2024
Figure 1 for MEGen: Generative Backdoor in Large Language Models via Model Editing
Figure 2 for MEGen: Generative Backdoor in Large Language Models via Model Editing
Figure 3 for MEGen: Generative Backdoor in Large Language Models via Model Editing
Figure 4 for MEGen: Generative Backdoor in Large Language Models via Model Editing
Viaarxiv icon

BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction

Add code
Aug 19, 2024
Figure 1 for BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Figure 2 for BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Figure 3 for BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Figure 4 for BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Viaarxiv icon

Self-Directed Turing Test for Large Language Models

Add code
Aug 19, 2024
Viaarxiv icon