Picture for Qiyuan Zhang

Qiyuan Zhang

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Add code
Oct 07, 2024
Figure 1 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 2 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 3 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Figure 4 for RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Viaarxiv icon

Collaborative Performance Prediction for Large Language Models

Add code
Jul 01, 2024
Figure 1 for Collaborative Performance Prediction for Large Language Models
Figure 2 for Collaborative Performance Prediction for Large Language Models
Figure 3 for Collaborative Performance Prediction for Large Language Models
Figure 4 for Collaborative Performance Prediction for Large Language Models
Viaarxiv icon

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

Add code
Oct 14, 2021
Figure 1 for NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset
Figure 2 for NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset
Figure 3 for NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset
Figure 4 for NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset
Viaarxiv icon

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Add code
Sep 18, 2021
Figure 1 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
Figure 2 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
Figure 3 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
Figure 4 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers
Viaarxiv icon

Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning

Add code
Jun 07, 2021
Figure 1 for Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
Figure 2 for Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
Figure 3 for Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
Figure 4 for Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
Viaarxiv icon

Distributional Soft Actor Critic for Risk Sensitive Learning

Add code
Apr 30, 2020
Figure 1 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 2 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 3 for Distributional Soft Actor Critic for Risk Sensitive Learning
Figure 4 for Distributional Soft Actor Critic for Risk Sensitive Learning
Viaarxiv icon