Picture for Xing Wu

Xing Wu

FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection

Add code
Mar 14, 2025
Viaarxiv icon

LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models

Add code
Mar 04, 2025
Viaarxiv icon

SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Add code
Jan 26, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Add code
Oct 21, 2024
Viaarxiv icon

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Add code
Aug 20, 2024
Viaarxiv icon

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Add code
Jul 13, 2024
Viaarxiv icon

Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Add code
May 30, 2024
Viaarxiv icon

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

Add code
Jan 20, 2024
Viaarxiv icon