Picture for Mingyi Hong

Mingyi Hong

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

Add code
Mar 15, 2025
Viaarxiv icon

Effectively Steer LLM To Follow Preference via Building Confident Directions

Add code
Mar 04, 2025
Viaarxiv icon

LUME: LLM Unlearning with Multitask Evaluations

Add code
Feb 20, 2025
Viaarxiv icon

Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs

Add code
Feb 13, 2025
Viaarxiv icon

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond

Add code
Feb 07, 2025
Viaarxiv icon

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Add code
Jan 31, 2025
Figure 1 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 2 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 3 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 4 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Viaarxiv icon

Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization

Add code
Dec 05, 2024
Figure 1 for Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization
Figure 2 for Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization
Figure 3 for Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization
Figure 4 for Safeguarding Text-to-Image Generation via Inference-Time Prompt-Noise Optimization
Viaarxiv icon

Downlink MIMO Channel Estimation from Bits: Recoverability and Algorithm

Add code
Nov 25, 2024
Viaarxiv icon

Unraveling the Gradient Descent Dynamics of Transformers

Add code
Nov 12, 2024
Figure 1 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 2 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 3 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 4 for Unraveling the Gradient Descent Dynamics of Transformers
Viaarxiv icon