Picture for Zhiyuan Sun

Zhiyuan Sun

When Less is More: The LLM Scaling Paradox in Context Compression

Add code
Feb 10, 2026
Viaarxiv icon

TRE: Encouraging Exploration in the Trust Region

Add code
Feb 03, 2026
Viaarxiv icon

Advancing General-Purpose Reasoning Models with Modular Gradient Surgery

Add code
Feb 02, 2026
Viaarxiv icon

H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic

Add code
Mar 13, 2025
Figure 1 for H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic
Figure 2 for H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic
Figure 3 for H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic
Figure 4 for H2-MARL: Multi-Agent Reinforcement Learning for Pareto Optimality in Hospital Capacity Strain and Human Mobility during Epidemic
Viaarxiv icon

Enhancing Agent Learning through World Dynamics Modeling

Add code
Jul 25, 2024
Figure 1 for Enhancing Agent Learning through World Dynamics Modeling
Figure 2 for Enhancing Agent Learning through World Dynamics Modeling
Figure 3 for Enhancing Agent Learning through World Dynamics Modeling
Figure 4 for Enhancing Agent Learning through World Dynamics Modeling
Viaarxiv icon

OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following

Add code
Mar 05, 2024
Figure 1 for OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Figure 2 for OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Figure 3 for OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Figure 4 for OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Viaarxiv icon

Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games

Add code
Dec 01, 2023
Figure 1 for Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Figure 2 for Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Figure 3 for Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Figure 4 for Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Viaarxiv icon

Augmented smartphone bilirubinometer enabled by a mobile app that turns smartphone into multispectral imager

Add code
Mar 04, 2023
Figure 1 for Augmented smartphone bilirubinometer enabled by a mobile app that turns smartphone into multispectral imager
Figure 2 for Augmented smartphone bilirubinometer enabled by a mobile app that turns smartphone into multispectral imager
Figure 3 for Augmented smartphone bilirubinometer enabled by a mobile app that turns smartphone into multispectral imager
Viaarxiv icon