Picture for Ming Lin

Ming Lin

DRPO: Efficient Reasoning via Decoupled Reward Policy Optimization

Add code
Oct 06, 2025
Viaarxiv icon

HART: Human Aligned Reconstruction Transformer

Add code
Sep 30, 2025
Figure 1 for HART: Human Aligned Reconstruction Transformer
Figure 2 for HART: Human Aligned Reconstruction Transformer
Figure 3 for HART: Human Aligned Reconstruction Transformer
Figure 4 for HART: Human Aligned Reconstruction Transformer
Viaarxiv icon

Time-Aware World Model for Adaptive Prediction and Control

Add code
Jun 10, 2025
Viaarxiv icon

DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization

Add code
May 18, 2025
Viaarxiv icon

Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws

Add code
May 13, 2025
Viaarxiv icon

AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction

Add code
Feb 25, 2025
Viaarxiv icon

CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems

Add code
Feb 25, 2025
Viaarxiv icon

Search for Efficient Large Language Models

Add code
Sep 25, 2024
Figure 1 for Search for Efficient Large Language Models
Figure 2 for Search for Efficient Large Language Models
Figure 3 for Search for Efficient Large Language Models
Figure 4 for Search for Efficient Large Language Models
Viaarxiv icon

Prompt Mixing in Diffusion Models using the Black Scholes Algorithm

Add code
May 22, 2024
Figure 1 for Prompt Mixing in Diffusion Models using the Black Scholes Algorithm
Figure 2 for Prompt Mixing in Diffusion Models using the Black Scholes Algorithm
Figure 3 for Prompt Mixing in Diffusion Models using the Black Scholes Algorithm
Figure 4 for Prompt Mixing in Diffusion Models using the Black Scholes Algorithm
Viaarxiv icon

Merino: Entropy-driven Design for Generative Language Models on IoT Devices

Add code
Feb 28, 2024
Viaarxiv icon