Picture for Yong Dai

Yong Dai

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Add code
Feb 02, 2026
Viaarxiv icon

Optimizing Agentic Reasoning with Retrieval via Synthetic Semantic Information Gain Reward

Add code
Jan 31, 2026
Viaarxiv icon

Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test

Add code
Jan 07, 2026
Viaarxiv icon

Auto-US: An Ultrasound Video Diagnosis Agent Using Video Classification Framework and LLMs

Add code
Nov 11, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Add code
May 23, 2025
Figure 1 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 2 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 3 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 4 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Viaarxiv icon

MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

Add code
Nov 05, 2024
Figure 1 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 2 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 3 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 4 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Viaarxiv icon

TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting

Add code
Oct 07, 2024
Figure 1 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 2 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 3 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Figure 4 for TimeCNN: Refining Cross-Variable Interaction on Time Point for Time Series Forecasting
Viaarxiv icon

IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation

Add code
Sep 27, 2024
Figure 1 for IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
Figure 2 for IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
Figure 3 for IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
Figure 4 for IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
Viaarxiv icon

Prompt Customization for Continual Learning

Add code
Apr 28, 2024
Figure 1 for Prompt Customization for Continual Learning
Figure 2 for Prompt Customization for Continual Learning
Figure 3 for Prompt Customization for Continual Learning
Figure 4 for Prompt Customization for Continual Learning
Viaarxiv icon