Picture for Hanghang Ma

Hanghang Ma

Forge-and-Quench: Enhancing Image Generation for Higher Fidelity in Unified Multimodal Models

Add code
Jan 08, 2026
Viaarxiv icon

LongCat-Image Technical Report

Add code
Dec 08, 2025
Figure 1 for LongCat-Image Technical Report
Figure 2 for LongCat-Image Technical Report
Figure 3 for LongCat-Image Technical Report
Figure 4 for LongCat-Image Technical Report
Viaarxiv icon

Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input

Add code
Aug 28, 2024
Figure 1 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 2 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 3 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 4 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Viaarxiv icon