Picture for Shufan Wang

Shufan Wang

On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations

Add code
Nov 22, 2024
Viaarxiv icon

SODA: a Soft Origami Dynamic utensil for Assisted feeding

Add code
Oct 25, 2024
Viaarxiv icon

Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

Add code
Apr 25, 2024
Viaarxiv icon

Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints

Add code
Dec 22, 2023
Viaarxiv icon

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing

Add code
May 24, 2023
Viaarxiv icon

KNN-LM Does Not Improve Open-ended Text Generation

Add code
May 24, 2023
Figure 1 for KNN-LM Does Not Improve Open-ended Text Generation
Figure 2 for KNN-LM Does Not Improve Open-ended Text Generation
Figure 3 for KNN-LM Does Not Improve Open-ended Text Generation
Figure 4 for KNN-LM Does Not Improve Open-ended Text Generation
Viaarxiv icon

You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Add code
Oct 28, 2022
Viaarxiv icon

Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding

Add code
Oct 07, 2022
Figure 1 for Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding
Figure 2 for Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding
Figure 3 for Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding
Figure 4 for Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding
Viaarxiv icon

Modeling Exemplification in Long-form Question Answering via Retrieval

Add code
May 19, 2022
Figure 1 for Modeling Exemplification in Long-form Question Answering via Retrieval
Figure 2 for Modeling Exemplification in Long-form Question Answering via Retrieval
Figure 3 for Modeling Exemplification in Long-form Question Answering via Retrieval
Figure 4 for Modeling Exemplification in Long-form Question Answering via Retrieval
Viaarxiv icon

Model-free Reinforcement Learning for Content Caching at the Wireless Edge via Restless Bandits

Add code
Feb 26, 2022
Figure 1 for Model-free Reinforcement Learning for Content Caching at the Wireless Edge via Restless Bandits
Figure 2 for Model-free Reinforcement Learning for Content Caching at the Wireless Edge via Restless Bandits
Figure 3 for Model-free Reinforcement Learning for Content Caching at the Wireless Edge via Restless Bandits
Figure 4 for Model-free Reinforcement Learning for Content Caching at the Wireless Edge via Restless Bandits
Viaarxiv icon