Picture for Yuhang Jiang

Yuhang Jiang

Doubly Mild Generalization for Offline Reinforcement Learning

Add code
Nov 13, 2024
Viaarxiv icon

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Add code
Oct 03, 2024
Figure 1 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 2 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 3 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Figure 4 for Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Viaarxiv icon

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks

Add code
Aug 20, 2024
Figure 1 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 2 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 3 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Figure 4 for Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Viaarxiv icon

LLM-Empowered State Representation for Reinforcement Learning

Add code
Jul 18, 2024
Figure 1 for LLM-Empowered State Representation for Reinforcement Learning
Figure 2 for LLM-Empowered State Representation for Reinforcement Learning
Figure 3 for LLM-Empowered State Representation for Reinforcement Learning
Figure 4 for LLM-Empowered State Representation for Reinforcement Learning
Viaarxiv icon

End-to-End $n$-ary Relation Extraction for Combination Drug Therapies

Add code
Mar 29, 2023
Viaarxiv icon

COVID-19 event extraction from Twitter via extractive question answering with continuous prompts

Add code
Mar 22, 2023
Viaarxiv icon

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

Add code
Oct 15, 2022
Figure 1 for Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
Viaarxiv icon

Improved lightweight identification of agricultural diseases based on MobileNetV3

Add code
Jul 19, 2022
Figure 1 for Improved lightweight identification of agricultural diseases based on MobileNetV3
Figure 2 for Improved lightweight identification of agricultural diseases based on MobileNetV3
Figure 3 for Improved lightweight identification of agricultural diseases based on MobileNetV3
Figure 4 for Improved lightweight identification of agricultural diseases based on MobileNetV3
Viaarxiv icon

Wasserstein Unsupervised Reinforcement Learning

Add code
Oct 15, 2021
Figure 1 for Wasserstein Unsupervised Reinforcement Learning
Figure 2 for Wasserstein Unsupervised Reinforcement Learning
Figure 3 for Wasserstein Unsupervised Reinforcement Learning
Figure 4 for Wasserstein Unsupervised Reinforcement Learning
Viaarxiv icon

Reducing Conservativeness Oriented Offline Reinforcement Learning

Add code
Feb 27, 2021
Figure 1 for Reducing Conservativeness Oriented Offline Reinforcement Learning
Figure 2 for Reducing Conservativeness Oriented Offline Reinforcement Learning
Figure 3 for Reducing Conservativeness Oriented Offline Reinforcement Learning
Figure 4 for Reducing Conservativeness Oriented Offline Reinforcement Learning
Viaarxiv icon