Picture for Zicheng Zhang

Zicheng Zhang

Dual Alignment Maximin Optimization for Offline Model-based RL

Add code
Feb 02, 2025
Viaarxiv icon

VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes

Add code
Jan 14, 2025
Figure 1 for VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
Figure 2 for VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
Figure 3 for VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
Figure 4 for VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
Viaarxiv icon

IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models

Add code
Jan 01, 2025
Figure 1 for IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
Figure 2 for IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
Figure 3 for IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
Figure 4 for IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models
Viaarxiv icon

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis

Add code
Nov 29, 2024
Figure 1 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 2 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 3 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 4 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Viaarxiv icon

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

Add code
Nov 25, 2024
Figure 1 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 2 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 3 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Figure 4 for Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Viaarxiv icon

MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis

Add code
Nov 18, 2024
Viaarxiv icon

DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration

Add code
Nov 15, 2024
Figure 1 for DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration
Figure 2 for DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration
Figure 3 for DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration
Figure 4 for DR-BFR: Degradation Representation with Diffusion Models for Blind Face Restoration
Viaarxiv icon

VQA$^2$:Visual Question Answering for Video Quality Assessment

Add code
Nov 06, 2024
Figure 1 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 2 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 3 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Figure 4 for VQA$^2$:Visual Question Answering for Video Quality Assessment
Viaarxiv icon

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Add code
Oct 07, 2024
Figure 1 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 2 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 3 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Figure 4 for R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Viaarxiv icon

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Add code
Sep 30, 2024
Figure 1 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 2 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 3 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Figure 4 for Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
Viaarxiv icon