Picture for Jun He

Jun He

ByteDance

Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Large-scale EM Benchmark for Multi-Organelle Instance Segmentation in the Wild

Add code
Jan 18, 2026
Viaarxiv icon

ActAvatar: Temporally-Aware Precise Action Control for Talking Avatars

Add code
Dec 22, 2025
Viaarxiv icon

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Add code
Dec 20, 2025
Figure 1 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 2 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 3 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 4 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Viaarxiv icon

UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective

Add code
Sep 26, 2025
Figure 1 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 2 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 3 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 4 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Viaarxiv icon

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation

Add code
Aug 13, 2025
Figure 1 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 2 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 3 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 4 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Viaarxiv icon

Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations

Add code
Jul 16, 2025
Figure 1 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 2 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 3 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Figure 4 for Quantize More, Lose Less: Autoregressive Generation from Residually Quantized Speech Representations
Viaarxiv icon

Estimate Hitting Time by Hitting Probability for Elitist Evolutionary Algorithms

Add code
Jun 18, 2025
Figure 1 for Estimate Hitting Time by Hitting Probability for Elitist Evolutionary Algorithms
Figure 2 for Estimate Hitting Time by Hitting Probability for Elitist Evolutionary Algorithms
Figure 3 for Estimate Hitting Time by Hitting Probability for Elitist Evolutionary Algorithms
Figure 4 for Estimate Hitting Time by Hitting Probability for Elitist Evolutionary Algorithms
Viaarxiv icon

SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting

Add code
Jun 17, 2025
Figure 1 for SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting
Figure 2 for SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting
Figure 3 for SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting
Figure 4 for SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting
Viaarxiv icon

OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers

Add code
May 27, 2025
Viaarxiv icon