Picture for Zhiyuan Fan

Zhiyuan Fan

Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive Adversaries

Add code
Apr 01, 2025
Viaarxiv icon

Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction

Add code
Mar 13, 2025
Viaarxiv icon

CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering

Add code
Jan 30, 2025
Viaarxiv icon

SedarEval: Automated Evaluation using Self-Adaptive Rubrics

Add code
Jan 26, 2025
Viaarxiv icon

Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent

Add code
Dec 07, 2024
Figure 1 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 2 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 3 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 4 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Viaarxiv icon

From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents

Add code
Nov 12, 2024
Figure 1 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 2 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 3 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Figure 4 for From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents
Viaarxiv icon

On the Optimality of Dilated Entropy and Lower Bounds for Online Learning in Extensive-Form Games

Add code
Oct 30, 2024
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon

Calibrated Self-Rewarding Vision Language Models

Add code
May 23, 2024
Viaarxiv icon

Settling Constant Regrets in Linear Markov Decision Processes

Add code
Apr 16, 2024
Viaarxiv icon