Picture for Chaojie Wang

Chaojie Wang

Member, IEEE

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Add code
Jul 02, 2025
Viaarxiv icon

Skywork Open Reasoner 1 Technical Report

Add code
May 29, 2025
Viaarxiv icon

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization

Add code
Dec 24, 2024
Figure 1 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 2 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 3 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 4 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Viaarxiv icon

Mars-PO: Multi-Agent Reasoning System Preference Optimization

Add code
Nov 28, 2024
Figure 1 for Mars-PO: Multi-Agent Reasoning System Preference Optimization
Figure 2 for Mars-PO: Multi-Agent Reasoning System Preference Optimization
Figure 3 for Mars-PO: Multi-Agent Reasoning System Preference Optimization
Figure 4 for Mars-PO: Multi-Agent Reasoning System Preference Optimization
Viaarxiv icon

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Add code
Oct 24, 2024
Viaarxiv icon

Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks

Add code
Oct 13, 2024
Figure 1 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 2 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 3 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 4 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Viaarxiv icon

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

Add code
Sep 05, 2024
Viaarxiv icon

MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

Add code
Jun 20, 2024
Figure 1 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 2 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 3 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 4 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Viaarxiv icon

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Add code
Jun 20, 2024
Viaarxiv icon

Latent Logic Tree Extraction for Event Sequence Explanation from LLMs

Add code
Jun 04, 2024
Viaarxiv icon