Picture for Zihao Zhang

Zihao Zhang

School of Cyber Science and Engineering, Southeast University

World-Consistent Data Generation for Vision-and-Language Navigation

Add code
Dec 09, 2024
Viaarxiv icon

HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective

Add code
Oct 10, 2024
Figure 1 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 2 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 3 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Figure 4 for HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
Viaarxiv icon

MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework

Add code
Oct 02, 2024
Figure 1 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 2 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 3 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Figure 4 for MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework
Viaarxiv icon

MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry

Add code
Sep 14, 2024
Viaarxiv icon

Prompt-based Visual Alignment for Zero-shot Policy Transfer

Add code
Jun 05, 2024
Figure 1 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 2 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 3 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Figure 4 for Prompt-based Visual Alignment for Zero-shot Policy Transfer
Viaarxiv icon

Problematizing AI Omnipresence in Landscape Architecture

Add code
Jun 03, 2024
Viaarxiv icon

MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion

Add code
May 30, 2024
Viaarxiv icon

Luban: Building Open-Ended Creative Agents via Autonomous Embodied Verification

Add code
May 24, 2024
Viaarxiv icon

LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data

Add code
Feb 21, 2024
Viaarxiv icon

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

Add code
Nov 30, 2023
Viaarxiv icon