Picture for Haoyu Ma

Haoyu Ma

Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs

Add code
Jan 28, 2026
Viaarxiv icon

SurfSLAM: Sim-to-Real Underwater Stereo Reconstruction For Real-Time SLAM

Add code
Jan 15, 2026
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

MoCha: Towards Movie-Grade Talking Character Synthesis

Add code
Mar 30, 2025
Figure 1 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 2 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 3 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 4 for MoCha: Towards Movie-Grade Talking Character Synthesis
Viaarxiv icon

HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Add code
Mar 24, 2025
Figure 1 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 2 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 3 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 4 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Viaarxiv icon

OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework

Add code
Mar 03, 2025
Viaarxiv icon

DirectorLLM for Human-Centric Video Generation

Add code
Dec 19, 2024
Figure 1 for DirectorLLM for Human-Centric Video Generation
Figure 2 for DirectorLLM for Human-Centric Video Generation
Figure 3 for DirectorLLM for Human-Centric Video Generation
Figure 4 for DirectorLLM for Human-Centric Video Generation
Viaarxiv icon

A Mamba Foundation Model for Time Series Forecasting

Add code
Nov 05, 2024
Figure 1 for A Mamba Foundation Model for Time Series Forecasting
Figure 2 for A Mamba Foundation Model for Time Series Forecasting
Figure 3 for A Mamba Foundation Model for Time Series Forecasting
Figure 4 for A Mamba Foundation Model for Time Series Forecasting
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Add code
Dec 19, 2023
Figure 1 for MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Figure 2 for MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Figure 3 for MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Figure 4 for MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Viaarxiv icon