Picture for Chenyan Xiong

Chenyan Xiong

Microsoft Research

AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning

Add code
Jun 18, 2025
Viaarxiv icon

Semi-structured LLM Reasoners Can Be Rigorously Audited

Add code
May 30, 2025
Viaarxiv icon

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling

Add code
May 28, 2025
Viaarxiv icon

FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

Add code
May 26, 2025
Viaarxiv icon

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Add code
May 25, 2025
Viaarxiv icon

Aligning Web Query Generation with Ranking Objectives via Direct Preference Optimization

Add code
May 25, 2025
Viaarxiv icon

PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning

Add code
Feb 21, 2025
Viaarxiv icon

Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews

Add code
Feb 21, 2025
Viaarxiv icon

Data-Efficient Pretraining with Group-Level Data Influence Modeling

Add code
Feb 20, 2025
Viaarxiv icon

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Add code
Feb 19, 2025
Viaarxiv icon