Picture for Xin Men

Xin Men

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Kimi Linear: An Expressive, Efficient Attention Architecture

Add code
Oct 30, 2025
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

Baichuan-M1: Pushing the Medical Capability of Large Language Models

Add code
Feb 18, 2025
Figure 1 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 2 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 3 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Figure 4 for Baichuan-M1: Pushing the Medical Capability of Large Language Models
Viaarxiv icon

Exploring Context Window of Large Language Models via Decomposed Positional Vectors

Add code
May 28, 2024
Figure 1 for Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Figure 2 for Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Figure 3 for Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Figure 4 for Exploring Context Window of Large Language Models via Decomposed Positional Vectors
Viaarxiv icon

Base of RoPE Bounds Context Length

Add code
May 23, 2024
Figure 1 for Base of RoPE Bounds Context Length
Figure 2 for Base of RoPE Bounds Context Length
Figure 3 for Base of RoPE Bounds Context Length
Figure 4 for Base of RoPE Bounds Context Length
Viaarxiv icon

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Add code
Mar 07, 2024
Figure 1 for ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Figure 2 for ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Figure 3 for ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Figure 4 for ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Viaarxiv icon

Baichuan 2: Open Large-scale Language Models

Add code
Sep 20, 2023
Viaarxiv icon