Picture for Mingyu Xu

Mingyu Xu

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Add code
Jan 08, 2026
Viaarxiv icon

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

Understanding Transformer from the Perspective of Associative Memory

Add code
May 26, 2025
Viaarxiv icon

Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN

Add code
May 22, 2025
Viaarxiv icon

Baichuan-M1: Pushing the Medical Capability of Large Language Models

Add code
Feb 18, 2025
Figure 1 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 2 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 3 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 4 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Viaarxiv icon

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation

Add code
Feb 11, 2025
Figure 1 for LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
Figure 2 for LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
Figure 3 for LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
Figure 4 for LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
Viaarxiv icon

KV Shifting Attention Enhances Language Modeling

Add code
Nov 29, 2024
Figure 1 for KV Shifting Attention Enhances Language Modeling
Figure 2 for KV Shifting Attention Enhances Language Modeling
Figure 3 for KV Shifting Attention Enhances Language Modeling
Figure 4 for KV Shifting Attention Enhances Language Modeling
Viaarxiv icon

Class-balanced Open-set Semi-supervised Object Detection for Medical Images

Add code
Aug 22, 2024
Figure 1 for Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Figure 2 for Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Figure 3 for Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Figure 4 for Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Viaarxiv icon

Base of RoPE Bounds Context Length

Add code
May 23, 2024
Figure 1 for Base of RoPE Bounds Context Length
Figure 2 for Base of RoPE Bounds Context Length
Figure 3 for Base of RoPE Bounds Context Length
Figure 4 for Base of RoPE Bounds Context Length
Viaarxiv icon

Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild

Add code
Mar 22, 2024
Figure 1 for Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild
Figure 2 for Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild
Figure 3 for Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild
Viaarxiv icon