Picture for Weiming Ren

Weiming Ren

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Add code
Dec 01, 2024
Viaarxiv icon

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Add code
Nov 11, 2024
Figure 1 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 2 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 3 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Figure 4 for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Viaarxiv icon

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Add code
Jun 04, 2024
Figure 1 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 2 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 3 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 4 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Viaarxiv icon

Video Diffusion Models: A Survey

Add code
May 06, 2024
Figure 1 for Video Diffusion Models: A Survey
Figure 2 for Video Diffusion Models: A Survey
Figure 3 for Video Diffusion Models: A Survey
Figure 4 for Video Diffusion Models: A Survey
Viaarxiv icon

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Add code
Mar 22, 2024
Figure 1 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 2 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 3 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 4 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Viaarxiv icon

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Add code
Feb 28, 2024
Viaarxiv icon

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Add code
Feb 06, 2024
Viaarxiv icon

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Add code
Nov 27, 2023
Figure 1 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 2 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 3 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Figure 4 for MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Viaarxiv icon

HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding

Add code
Aug 03, 2022
Figure 1 for HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding
Figure 2 for HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding
Figure 3 for HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding
Figure 4 for HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding
Viaarxiv icon