Picture for Qingqing Mao

Qingqing Mao

Application-Driven Pedagogical Knowledge Optimization of Open-Source LLMs via Reinforcement Learning and Supervised Fine-Tuning

Add code
Apr 07, 2026
Viaarxiv icon

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

Add code
Apr 07, 2026
Viaarxiv icon

OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

Add code
Feb 29, 2024
Viaarxiv icon

Optimal discharge of patients from intensive care via a data-driven policy learning framework

Add code
Dec 17, 2021
Figure 1 for Optimal discharge of patients from intensive care via a data-driven policy learning framework
Figure 2 for Optimal discharge of patients from intensive care via a data-driven policy learning framework
Figure 3 for Optimal discharge of patients from intensive care via a data-driven policy learning framework
Figure 4 for Optimal discharge of patients from intensive care via a data-driven policy learning framework
Viaarxiv icon