Picture for Ming Yan

Ming Yan

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Add code
Mar 31, 2025
Viaarxiv icon

ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate

Add code
Mar 27, 2025
Viaarxiv icon

WritingBench: A Comprehensive Benchmark for Generative Writing

Add code
Mar 07, 2025
Viaarxiv icon

MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio

Add code
Mar 07, 2025
Viaarxiv icon

Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration

Add code
Feb 25, 2025
Viaarxiv icon

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Add code
Feb 21, 2025
Viaarxiv icon

A Training-free LLM-based Approach to General Chinese Character Error Correction

Add code
Feb 21, 2025
Viaarxiv icon

Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization

Add code
Feb 20, 2025
Viaarxiv icon

Complex Physics-Informed Neural Network

Add code
Feb 07, 2025
Viaarxiv icon

Dark Distillation: Backdooring Distilled Datasets without Accessing Raw Data

Add code
Feb 06, 2025
Viaarxiv icon