Picture for Maosong Sun

Maosong Sun

LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Add code
Apr 08, 2025
Viaarxiv icon

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Add code
Apr 04, 2025
Viaarxiv icon

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

Add code
Mar 31, 2025
Viaarxiv icon

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Add code
Mar 31, 2025
Viaarxiv icon

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Add code
Mar 17, 2025
Viaarxiv icon

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Add code
Mar 17, 2025
Viaarxiv icon

Towards Self-Improving Systematic Cognition for Next-Generation Foundation MLLMs

Add code
Mar 16, 2025
Viaarxiv icon

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Add code
Mar 12, 2025
Viaarxiv icon

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

Add code
Feb 26, 2025
Viaarxiv icon

Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon