Picture for Min Yang

Min Yang

RxSafeBench: Identifying Medication Safety Issues of Large Language Models in Simulated Consultation

Add code
Nov 06, 2025
Viaarxiv icon

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Add code
Oct 30, 2025
Viaarxiv icon

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Add code
Oct 23, 2025
Viaarxiv icon

Beyond Retrieval-Ranking: A Multi-Agent Cognitive Decision Framework for E-Commerce Search

Add code
Oct 23, 2025
Viaarxiv icon

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Add code
Aug 18, 2025
Figure 1 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 2 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 3 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 4 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Viaarxiv icon

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices

Add code
Aug 10, 2025
Viaarxiv icon

ReasoningGuard: Safeguarding Large Reasoning Models with Inference-time Safety Aha Moments

Add code
Aug 06, 2025
Viaarxiv icon

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Add code
Jun 11, 2025
Viaarxiv icon

Training Superior Sparse Autoencoders for Instruct Models

Add code
Jun 09, 2025
Viaarxiv icon

CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Add code
May 30, 2025
Viaarxiv icon