Picture for Maosong Sun

Maosong Sun

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

Add code
Nov 08, 2024
Viaarxiv icon

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Add code
Nov 06, 2024
Viaarxiv icon

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Add code
Nov 04, 2024
Viaarxiv icon

Exploring Tokenization Methods for Multitrack Sheet Music Generation

Add code
Oct 23, 2024
Viaarxiv icon

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement

Add code
Oct 21, 2024
Viaarxiv icon

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Add code
Oct 17, 2024
Figure 1 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 2 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 3 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 4 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Viaarxiv icon

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Add code
Oct 17, 2024
Figure 1 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 2 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 3 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Figure 4 for CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
Viaarxiv icon

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

Add code
Oct 16, 2024
Figure 1 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 2 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 3 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 4 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Viaarxiv icon

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Add code
Oct 14, 2024
Figure 1 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 2 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 3 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 4 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Viaarxiv icon

LLM$\times$MapReduce: Simplified Long-Sequence Processing using Large Language Models

Add code
Oct 12, 2024
Viaarxiv icon