Picture for Yuheng Huang

Yuheng Huang

LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation

Add code
Oct 07, 2024
Viaarxiv icon

LeCov: Multi-level Testing Criteria for Large Language Models

Add code
Aug 20, 2024
Figure 1 for LeCov: Multi-level Testing Criteria for Large Language Models
Figure 2 for LeCov: Multi-level Testing Criteria for Large Language Models
Figure 3 for LeCov: Multi-level Testing Criteria for Large Language Models
Figure 4 for LeCov: Multi-level Testing Criteria for Large Language Models
Viaarxiv icon

Active Testing of Large Language Model via Multi-Stage Sampling

Add code
Aug 07, 2024
Viaarxiv icon

Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture

Add code
Jul 10, 2024
Viaarxiv icon

Vortex under Ripplet: An Empirical Study of RAG-enabled Applications

Add code
Jul 06, 2024
Figure 1 for Vortex under Ripplet: An Empirical Study of RAG-enabled Applications
Figure 2 for Vortex under Ripplet: An Empirical Study of RAG-enabled Applications
Figure 3 for Vortex under Ripplet: An Empirical Study of RAG-enabled Applications
Figure 4 for Vortex under Ripplet: An Empirical Study of RAG-enabled Applications
Viaarxiv icon

Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing

Add code
Apr 14, 2024
Viaarxiv icon

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

Add code
Apr 12, 2024
Viaarxiv icon

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Add code
Mar 06, 2024
Viaarxiv icon

LUNA: A Model-Based Universal Analysis Framework for Large Language Models

Add code
Oct 22, 2023
Figure 1 for LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Figure 2 for LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Figure 3 for LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Figure 4 for LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Viaarxiv icon

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

Add code
Jul 16, 2023
Figure 1 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 2 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 3 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Figure 4 for Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Viaarxiv icon