Picture for Yanru Chen

Yanru Chen

Muon is Scalable for LLM Training

Add code
Feb 24, 2025
Viaarxiv icon

MoBA: Mixture of Block Attention for Long-Context LLMs

Add code
Feb 18, 2025
Viaarxiv icon

A Comparison of Self-Supervised Pretraining Approaches for Predicting Disease Risk from Chest Radiograph Images

Add code
Jun 15, 2023
Viaarxiv icon

Prompt-Based Metric Learning for Few-Shot NER

Add code
Nov 08, 2022
Viaarxiv icon