Picture for Xiaohua Wang

Xiaohua Wang

BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation

Add code
Jan 30, 2026
Viaarxiv icon

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Add code
Jan 08, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon

Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling

Add code
Nov 13, 2025
Figure 1 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 2 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 3 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Figure 4 for Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Viaarxiv icon

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Add code
Oct 16, 2025
Viaarxiv icon

Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading

Add code
Oct 06, 2025
Figure 1 for Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading
Figure 2 for Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading
Figure 3 for Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading
Figure 4 for Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading
Viaarxiv icon

Enhancing Model Privacy in Federated Learning with Random Masking and Quantization

Add code
Aug 27, 2025
Viaarxiv icon

Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning

Add code
Jun 04, 2025
Viaarxiv icon

Improving Continual Pre-training Through Seamless Data Packing

Add code
May 29, 2025
Figure 1 for Improving Continual Pre-training Through Seamless Data Packing
Figure 2 for Improving Continual Pre-training Through Seamless Data Packing
Figure 3 for Improving Continual Pre-training Through Seamless Data Packing
Figure 4 for Improving Continual Pre-training Through Seamless Data Packing
Viaarxiv icon

RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data

Add code
May 25, 2025
Viaarxiv icon