Picture for Lixing Shen

Lixing Shen

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Add code
Jan 12, 2024
Figure 1 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 2 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 3 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Figure 4 for Secrets of RLHF in Large Language Models Part II: Reward Modeling
Viaarxiv icon

Two Step Joint Model for Drug Drug Interaction Extraction

Add code
Aug 28, 2020
Figure 1 for Two Step Joint Model for Drug Drug Interaction Extraction
Figure 2 for Two Step Joint Model for Drug Drug Interaction Extraction
Figure 3 for Two Step Joint Model for Drug Drug Interaction Extraction
Figure 4 for Two Step Joint Model for Drug Drug Interaction Extraction
Viaarxiv icon