Picture for Guanlin Liu

Guanlin Liu

Knowledge Distillation with Training Wheels

Add code
Feb 24, 2025
Viaarxiv icon

DNMDR: Dynamic Networks and Multi-view Drug Representations for Safe Medication Recommendation

Add code
Jan 15, 2025
Figure 1 for DNMDR: Dynamic Networks and Multi-view Drug Representations for Safe Medication Recommendation
Figure 2 for DNMDR: Dynamic Networks and Multi-view Drug Representations for Safe Medication Recommendation
Figure 3 for DNMDR: Dynamic Networks and Multi-view Drug Representations for Safe Medication Recommendation
Figure 4 for DNMDR: Dynamic Networks and Multi-view Drug Representations for Safe Medication Recommendation
Viaarxiv icon

Flaming-hot Initiation with Regular Execution Sampling for Large Language Models

Add code
Oct 28, 2024
Figure 1 for Flaming-hot Initiation with Regular Execution Sampling for Large Language Models
Figure 2 for Flaming-hot Initiation with Regular Execution Sampling for Large Language Models
Figure 3 for Flaming-hot Initiation with Regular Execution Sampling for Large Language Models
Figure 4 for Flaming-hot Initiation with Regular Execution Sampling for Large Language Models
Viaarxiv icon

Process Supervision-Guided Policy Optimization for Code Generation

Add code
Oct 23, 2024
Figure 1 for Process Supervision-Guided Policy Optimization for Code Generation
Figure 2 for Process Supervision-Guided Policy Optimization for Code Generation
Figure 3 for Process Supervision-Guided Policy Optimization for Code Generation
Figure 4 for Process Supervision-Guided Policy Optimization for Code Generation
Viaarxiv icon

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

Add code
Oct 11, 2024
Viaarxiv icon

Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems

Add code
Nov 01, 2023
Figure 1 for Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems
Figure 2 for Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems
Figure 3 for Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems
Viaarxiv icon

Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty

Add code
Jul 20, 2023
Viaarxiv icon

Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning

Add code
Jul 15, 2023
Viaarxiv icon

Efficient Action Poisoning Attacks on Linear Contextual Bandits

Add code
Dec 10, 2021
Figure 1 for Efficient Action Poisoning Attacks on Linear Contextual Bandits
Figure 2 for Efficient Action Poisoning Attacks on Linear Contextual Bandits
Figure 3 for Efficient Action Poisoning Attacks on Linear Contextual Bandits
Viaarxiv icon

Provably Efficient Black-Box Action Poisoning Attacks Against Reinforcement Learning

Add code
Oct 26, 2021
Figure 1 for Provably Efficient Black-Box Action Poisoning Attacks Against Reinforcement Learning
Viaarxiv icon