Picture for Maosong Sun

Maosong Sun

Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values

Add code
Jan 13, 2025
Viaarxiv icon

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Add code
Jan 13, 2025
Viaarxiv icon

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Add code
Jan 11, 2025
Viaarxiv icon

Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation

Add code
Dec 25, 2024
Viaarxiv icon

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Add code
Dec 18, 2024
Viaarxiv icon

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Add code
Dec 10, 2024
Viaarxiv icon

Densing Law of LLMs

Add code
Dec 05, 2024
Figure 1 for Densing Law of LLMs
Figure 2 for Densing Law of LLMs
Figure 3 for Densing Law of LLMs
Figure 4 for Densing Law of LLMs
Viaarxiv icon

A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs

Add code
Dec 02, 2024
Figure 1 for A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
Figure 2 for A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
Figure 3 for A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
Figure 4 for A Top-down Graph-based Tool for Modeling Classical Semantic Maps: A Crosslinguistic Case Study of Supplementary Adverbs
Viaarxiv icon

KBAlign: Efficient Self Adaptation on Specific Knowledge Bases

Add code
Nov 25, 2024
Figure 1 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 2 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 3 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 4 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Viaarxiv icon

Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Add code
Nov 21, 2024
Viaarxiv icon