Picture for Yuexing Hao

Yuexing Hao

Automating SKILL.md Generation for Computer-Using Agents via Interaction Trajectory Mining

Add code
Jun 18, 2026
Viaarxiv icon

UXBench: Measuring the Actionability of LLM-Generated UX Critiques

Add code
Jun 15, 2026
Viaarxiv icon

Exploration of Foundation Model-Based Robots in Patient and Elderly Care

Add code
Jun 08, 2026
Viaarxiv icon

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

Add code
May 07, 2026
Viaarxiv icon

ProbeLLM: Automating Principled Diagnosis of LLM Failures

Add code
Feb 13, 2026
Viaarxiv icon

PC2P: Multi-Agent Path Finding via Personalized-Enhanced Communication and Crowd Perception

Add code
Jan 06, 2026
Viaarxiv icon

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

Add code
May 29, 2025
Viaarxiv icon

MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models

Add code
May 16, 2025
Figure 1 for MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models
Figure 2 for MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models
Figure 3 for MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models
Figure 4 for MedGUIDE: Benchmarking Clinical Decision-Making in Large Language Models
Viaarxiv icon

AI-Based Teat Shape and Skin Condition Prediction for Dairy Management

Add code
Dec 22, 2024
Viaarxiv icon

Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams

Add code
Sep 26, 2024
Figure 1 for Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Figure 2 for Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Figure 3 for Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Figure 4 for Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Viaarxiv icon