Picture for Zhiyuan Fan

Zhiyuan Fan

CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering

Add code
Jan 30, 2025
Viaarxiv icon

SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Add code
Jan 26, 2025
Viaarxiv icon

Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent

Add code
Dec 07, 2024
Viaarxiv icon

From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents

Add code
Nov 12, 2024
Figure 1 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 2 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 3 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 4 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Viaarxiv icon

On the Optimality of Dilated Entropy and Lower Bounds for Online Learning in Extensive-Form Games

Add code
Oct 30, 2024
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon

Calibrated Self-Rewarding Vision Language Models

Add code
May 23, 2024
Viaarxiv icon

Settling Constant Regrets in Linear Markov Decision Processes

Add code
Apr 16, 2024
Viaarxiv icon

Efficient Data Learning for Open Information Extraction with Pre-trained Language Models

Add code
Oct 23, 2023
Viaarxiv icon

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

Add code
Mar 16, 2023
Figure 1 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 2 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 3 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 4 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Viaarxiv icon