Picture for Pengfei Zhou

Pengfei Zhou

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Viaarxiv icon

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Viaarxiv icon

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Add code
Mar 09, 2025
Viaarxiv icon

Text-driven 3D Human Generation via Contrastive Preference Optimization

Add code
Feb 13, 2025
Viaarxiv icon

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Add code
Jan 03, 2025
Figure 1 for EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
Figure 2 for EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
Figure 3 for EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
Figure 4 for EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation
Viaarxiv icon

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation

Add code
Dec 01, 2024
Viaarxiv icon

Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles

Add code
Jul 10, 2024
Figure 1 for Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles
Figure 2 for Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles
Figure 3 for Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles
Figure 4 for Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles
Viaarxiv icon

FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

Add code
Jun 11, 2024
Figure 1 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 2 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 3 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 4 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Viaarxiv icon

Unleashing the Power of Unlabeled Data: A Self-supervised Learning Framework for Cyber Attack Detection in Smart Grids

Add code
May 22, 2024
Figure 1 for Unleashing the Power of Unlabeled Data: A Self-supervised Learning Framework for Cyber Attack Detection in Smart Grids
Figure 2 for Unleashing the Power of Unlabeled Data: A Self-supervised Learning Framework for Cyber Attack Detection in Smart Grids
Figure 3 for Unleashing the Power of Unlabeled Data: A Self-supervised Learning Framework for Cyber Attack Detection in Smart Grids
Figure 4 for Unleashing the Power of Unlabeled Data: A Self-supervised Learning Framework for Cyber Attack Detection in Smart Grids
Viaarxiv icon

DiffHarmony: Latent Diffusion Model Meets Image Harmonization

Add code
Apr 09, 2024
Figure 1 for DiffHarmony: Latent Diffusion Model Meets Image Harmonization
Figure 2 for DiffHarmony: Latent Diffusion Model Meets Image Harmonization
Figure 3 for DiffHarmony: Latent Diffusion Model Meets Image Harmonization
Figure 4 for DiffHarmony: Latent Diffusion Model Meets Image Harmonization
Viaarxiv icon