Picture for Yongdong Zhang

Yongdong Zhang

OmniPrism: Learning Disentangled Visual Concept for Image Generation

Add code
Dec 16, 2024
Viaarxiv icon

LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation

Add code
Dec 13, 2024
Figure 1 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 2 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 3 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 4 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Viaarxiv icon

A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

Add code
Dec 12, 2024
Viaarxiv icon

T-SVG: Text-Driven Stereoscopic Video Generation

Add code
Dec 12, 2024
Viaarxiv icon

Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing

Add code
Nov 23, 2024
Figure 1 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 2 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 3 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 4 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Viaarxiv icon

It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment

Add code
Nov 16, 2024
Figure 1 for It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment
Figure 2 for It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment
Figure 3 for It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment
Figure 4 for It Takes Two: Accurate Gait Recognition in the Wild via Cross-granularity Alignment
Viaarxiv icon

MILP-StuDio: MILP Instance Generation via Block Structure Decomposition

Add code
Oct 31, 2024
Viaarxiv icon

Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models

Add code
Oct 19, 2024
Figure 1 for Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Figure 2 for Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Figure 3 for Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Figure 4 for Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models
Viaarxiv icon

MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting

Add code
Oct 10, 2024
Figure 1 for MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Figure 2 for MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Figure 3 for MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Figure 4 for MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Viaarxiv icon

Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation

Add code
Aug 25, 2024
Figure 1 for Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation
Figure 2 for Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation
Figure 3 for Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation
Figure 4 for Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation
Viaarxiv icon