Picture for Daoming Lyu

Daoming Lyu

Auburn University

PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent

Add code
Feb 13, 2022
Figure 1 for PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Figure 2 for PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Figure 3 for PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Figure 4 for PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Viaarxiv icon

TOPS: Transition-based VOlatility-controlled Policy Search and its Global Convergence

Add code
Jan 24, 2022
Figure 1 for TOPS: Transition-based VOlatility-controlled Policy Search and its Global Convergence
Figure 2 for TOPS: Transition-based VOlatility-controlled Policy Search and its Global Convergence
Figure 3 for TOPS: Transition-based VOlatility-controlled Policy Search and its Global Convergence
Figure 4 for TOPS: Transition-based VOlatility-controlled Policy Search and its Global Convergence
Viaarxiv icon

TDM: Trustworthy Decision-Making via Interpretability Enhancement

Add code
Aug 13, 2021
Figure 1 for TDM: Trustworthy Decision-Making via Interpretability Enhancement
Figure 2 for TDM: Trustworthy Decision-Making via Interpretability Enhancement
Figure 3 for TDM: Trustworthy Decision-Making via Interpretability Enhancement
Figure 4 for TDM: Trustworthy Decision-Making via Interpretability Enhancement
Viaarxiv icon

Variance-Reduced Off-Policy Memory-Efficient Policy Search

Add code
Sep 14, 2020
Figure 1 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 2 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 3 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Figure 4 for Variance-Reduced Off-Policy Memory-Efficient Policy Search
Viaarxiv icon

Stable and Efficient Policy Evaluation

Add code
Jun 06, 2020
Figure 1 for Stable and Efficient Policy Evaluation
Figure 2 for Stable and Efficient Policy Evaluation
Figure 3 for Stable and Efficient Policy Evaluation
Figure 4 for Stable and Efficient Policy Evaluation
Viaarxiv icon

A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming

Add code
Sep 18, 2019
Figure 1 for A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
Figure 2 for A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
Figure 3 for A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
Figure 4 for A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming
Viaarxiv icon

PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning

Add code
Aug 01, 2019
Figure 1 for PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning
Figure 2 for PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning
Figure 3 for PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning
Figure 4 for PACMAN: A Planner-Actor-Critic Architecture for Human-Centered Planning and Learning
Viaarxiv icon

Knowledge-Based Sequential Decision-Making Under Uncertainty

Add code
May 16, 2019
Figure 1 for Knowledge-Based Sequential Decision-Making Under Uncertainty
Figure 2 for Knowledge-Based Sequential Decision-Making Under Uncertainty
Figure 3 for Knowledge-Based Sequential Decision-Making Under Uncertainty
Figure 4 for Knowledge-Based Sequential Decision-Making Under Uncertainty
Viaarxiv icon

SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning

Add code
Nov 05, 2018
Figure 1 for SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning
Figure 2 for SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning
Figure 3 for SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning
Figure 4 for SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning
Viaarxiv icon

A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Add code
Nov 01, 2018
Figure 1 for A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Figure 2 for A Block Coordinate Ascent Algorithm for Mean-Variance Optimization
Viaarxiv icon