Picture for Bingsheng Yao

Bingsheng Yao

Northeastern University, USA

DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans

Add code
Oct 16, 2025
Viaarxiv icon

SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis

Add code
Aug 21, 2025
Viaarxiv icon

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

Add code
Jul 28, 2025
Viaarxiv icon

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Add code
Apr 13, 2025
Figure 1 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 2 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 3 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 4 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Viaarxiv icon

AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents

Add code
Apr 13, 2025
Viaarxiv icon

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design

Add code
Feb 18, 2025
Figure 1 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 2 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 3 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 4 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Viaarxiv icon

Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents

Add code
Feb 18, 2025
Figure 1 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 2 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 3 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 4 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Viaarxiv icon

WatchGuardian: Enabling User-Defined Personalized Just-in-Time Intervention on Smartwatch

Add code
Feb 09, 2025
Viaarxiv icon

RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care

Add code
Feb 09, 2025
Figure 1 for RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care
Figure 2 for RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care
Figure 3 for RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care
Figure 4 for RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care
Viaarxiv icon

"It Felt Like I Was Left in the Dark": Exploring Information Needs and Design Opportunities for Family Caregivers of Older Adult Patients in Critical Care Settings

Add code
Feb 07, 2025
Viaarxiv icon