Picture for Ziyu Shao

Ziyu Shao

Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization

Add code
Oct 25, 2024
Figure 1 for Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Figure 2 for Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Figure 3 for Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Figure 4 for Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Viaarxiv icon

Learning to Schedule Online Tasks with Bandit Feedback

Add code
Feb 26, 2024
Viaarxiv icon

Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective

Add code
Aug 01, 2020
Figure 1 for Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective
Figure 2 for Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective
Figure 3 for Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective
Figure 4 for Service Chain Composition with Failures in NFV Systems: A Game-Theoretic Perspective
Viaarxiv icon

Online Task Scheduling for Fog Computing with Multi-Resource Fairness

Add code
Aug 01, 2020
Figure 1 for Online Task Scheduling for Fog Computing with Multi-Resource Fairness
Figure 2 for Online Task Scheduling for Fog Computing with Multi-Resource Fairness
Figure 3 for Online Task Scheduling for Fog Computing with Multi-Resource Fairness
Figure 4 for Online Task Scheduling for Fog Computing with Multi-Resource Fairness
Viaarxiv icon

Green Offloading in Fog-Assisted IoT Systems: An Online Perspective Integrating Learning and Control

Add code
Aug 01, 2020
Figure 1 for Green Offloading in Fog-Assisted IoT Systems: An Online Perspective Integrating Learning and Control
Figure 2 for Green Offloading in Fog-Assisted IoT Systems: An Online Perspective Integrating Learning and Control
Figure 3 for Green Offloading in Fog-Assisted IoT Systems: An Online Perspective Integrating Learning and Control
Viaarxiv icon

Data-Driven Bandit Learning for Proactive Cache Placement in Fog-Assisted IoT Systems

Add code
Aug 01, 2020
Figure 1 for Data-Driven Bandit Learning for Proactive Cache Placement in Fog-Assisted IoT Systems
Figure 2 for Data-Driven Bandit Learning for Proactive Cache Placement in Fog-Assisted IoT Systems
Figure 3 for Data-Driven Bandit Learning for Proactive Cache Placement in Fog-Assisted IoT Systems
Figure 4 for Data-Driven Bandit Learning for Proactive Cache Placement in Fog-Assisted IoT Systems
Viaarxiv icon