Picture for Ming Ding

Ming Ding

From Principles to Practice: A Deep Dive into AI Ethics and Regulations

Add code
Dec 06, 2024
Viaarxiv icon

Face De-identification: State-of-the-art Methods and Comparative Studies

Add code
Nov 15, 2024
Figure 1 for Face De-identification: State-of-the-art Methods and Comparative Studies
Figure 2 for Face De-identification: State-of-the-art Methods and Comparative Studies
Figure 3 for Face De-identification: State-of-the-art Methods and Comparative Studies
Figure 4 for Face De-identification: State-of-the-art Methods and Comparative Studies
Viaarxiv icon

DreamPolish: Domain Score Distillation With Progressive Geometry Generation

Add code
Nov 03, 2024
Figure 1 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 2 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 3 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 4 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Viaarxiv icon

MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction

Add code
Sep 14, 2024
Figure 1 for MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
Figure 2 for MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
Figure 3 for MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
Figure 4 for MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction
Viaarxiv icon

CogVLM2: Visual Language Models for Image and Video Understanding

Add code
Aug 29, 2024
Figure 1 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 2 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 3 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 4 for CogVLM2: Visual Language Models for Image and Video Understanding
Viaarxiv icon

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Add code
Aug 12, 2024
Viaarxiv icon

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Add code
Aug 12, 2024
Figure 1 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 2 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 3 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 4 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Viaarxiv icon

AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models

Add code
Jun 14, 2024
Viaarxiv icon

LVBench: An Extreme Long Video Understanding Benchmark

Add code
Jun 12, 2024
Figure 1 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 2 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 3 for LVBench: An Extreme Long Video Understanding Benchmark
Figure 4 for LVBench: An Extreme Long Video Understanding Benchmark
Viaarxiv icon

Memorization in deep learning: A survey

Add code
Jun 06, 2024
Figure 1 for Memorization in deep learning: A survey
Figure 2 for Memorization in deep learning: A survey
Figure 3 for Memorization in deep learning: A survey
Figure 4 for Memorization in deep learning: A survey
Viaarxiv icon