Picture for Zihao Zhang

Zihao Zhang

School of Cyber Science and Engineering, Southeast University

OpenHuEval: Evaluating Large Language Model on Hungarian Specifics

Add code
Mar 27, 2025
Viaarxiv icon

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

Add code
Mar 20, 2025
Viaarxiv icon

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Add code
Mar 13, 2025
Viaarxiv icon

Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation

Add code
Jan 03, 2025
Figure 1 for Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation
Figure 2 for Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation
Figure 3 for Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation
Figure 4 for Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation
Viaarxiv icon

World-Consistent Data Generation for Vision-and-Language Navigation

Add code
Dec 09, 2024
Viaarxiv icon

HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective

Add code
Oct 10, 2024
Figure 1 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 2 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 3 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 4 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Viaarxiv icon

MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework

Add code
Oct 02, 2024
Figure 1 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 2 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 3 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 4 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Viaarxiv icon

MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry

Add code
Sep 14, 2024
Viaarxiv icon

Prompt-based Visual Alignment for Zero-shot Policy Transfer

Add code
Jun 05, 2024
Figure 1 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 2 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 3 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 4 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Viaarxiv icon

Problematizing AI Omnipresence in Landscape Architecture

Add code
Jun 03, 2024
Viaarxiv icon