Picture for Chaojie Wang

Chaojie Wang

Member, IEEE

Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization

Add code
Dec 24, 2024
Figure 1 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 2 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 3 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Figure 4 for Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization
Viaarxiv icon

Mars-PO: Multi-Agent Reasoning System Preference Optimization

Add code
Nov 28, 2024
Viaarxiv icon

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Add code
Oct 24, 2024
Viaarxiv icon

Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks

Add code
Oct 13, 2024
Figure 1 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 2 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 3 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Figure 4 for Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks
Viaarxiv icon

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

Add code
Sep 05, 2024
Viaarxiv icon

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Add code
Jun 20, 2024
Viaarxiv icon

MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

Add code
Jun 20, 2024
Figure 1 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 2 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 3 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Figure 4 for MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
Viaarxiv icon

Latent Logic Tree Extraction for Event Sequence Explanation from LLMs

Add code
Jun 04, 2024
Viaarxiv icon

keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM

Add code
Dec 31, 2023
Viaarxiv icon

Knowledge-Aware Bayesian Deep Topic Model

Add code
Sep 20, 2022
Figure 1 for Knowledge-Aware Bayesian Deep Topic Model
Figure 2 for Knowledge-Aware Bayesian Deep Topic Model
Figure 3 for Knowledge-Aware Bayesian Deep Topic Model
Figure 4 for Knowledge-Aware Bayesian Deep Topic Model
Viaarxiv icon