Picture for Sunbowen Lee

Sunbowen Lee

Finding RELIEF: Shaping Reasoning Behavior without Reasoning Supervision via Belief Engineering

Add code
Jan 20, 2026
Viaarxiv icon

M-MRE: Extending the Mutual Reinforcement Effect to Multimodal Information Extraction

Add code
Apr 24, 2025
Viaarxiv icon

Counterfactual experience augmented off-policy reinforcement learning

Add code
Mar 18, 2025
Figure 1 for Counterfactual experience augmented off-policy reinforcement learning
Figure 2 for Counterfactual experience augmented off-policy reinforcement learning
Figure 3 for Counterfactual experience augmented off-policy reinforcement learning
Figure 4 for Counterfactual experience augmented off-policy reinforcement learning
Viaarxiv icon

xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking

Add code
Jan 30, 2025
Figure 1 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 2 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 3 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Figure 4 for xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking
Viaarxiv icon

MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control

Add code
Dec 24, 2024
Figure 1 for MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control
Figure 2 for MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control
Figure 3 for MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control
Figure 4 for MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control
Viaarxiv icon