Picture for Dake Zhang

Dake Zhang

Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning

Add code
Jul 24, 2024
Viaarxiv icon

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Add code
Jul 10, 2024
Viaarxiv icon

ReadProbe: A Demo of Retrieval-Enhanced Large Language Models to Support Lateral Reading

Add code
Jun 13, 2023
Viaarxiv icon

A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems

Add code
Nov 21, 2014
Figure 1 for A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems
Figure 2 for A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems
Figure 3 for A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems
Figure 4 for A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems
Viaarxiv icon