Picture for Weisen Jiang

Weisen Jiang

RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models

Add code
Sep 30, 2024
Viaarxiv icon

Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

Add code
Aug 15, 2024
Viaarxiv icon

Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Add code
Jul 04, 2024
Viaarxiv icon

MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders

Add code
Jul 02, 2024
Viaarxiv icon

Rendering Graphs for Graph Reasoning in Multimodal Large Language Models

Add code
Feb 03, 2024
Viaarxiv icon

Large Language Models as Visual Cross-Domain Learners

Add code
Jan 06, 2024
Viaarxiv icon

Effective and Parameter-Efficient Reusing Fine-Tuned Models

Add code
Oct 04, 2023
Viaarxiv icon

Domain-Guided Conditional Diffusion Model for Unsupervised Domain Adaptation

Add code
Sep 23, 2023
Viaarxiv icon

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Add code
Sep 22, 2023
Viaarxiv icon

A Scale-Invariant Task Balancing Approach for Multi-Task Learning

Add code
Aug 23, 2023
Viaarxiv icon