Picture for Xing Wu

Xing Wu

SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Add code
Jan 26, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Add code
Oct 21, 2024
Viaarxiv icon

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Add code
Aug 20, 2024
Viaarxiv icon

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Add code
Jul 13, 2024
Viaarxiv icon

Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model

Add code
May 30, 2024
Viaarxiv icon

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

Add code
Jan 20, 2024
Viaarxiv icon

InfoEntropy Loss to Mitigate Bias of Learning Difficulties for Generative Language Models

Add code
Nov 01, 2023
Viaarxiv icon

HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus

Add code
Sep 06, 2023
Viaarxiv icon