Picture for Ji Hou

Ji Hou

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Pixel-Space Post-Training of Latent Diffusion Models

Add code
Sep 26, 2024
Viaarxiv icon

UniPlane: Unified Plane Detection and Reconstruction from Posed Monocular Videos

Add code
Jul 04, 2024
Viaarxiv icon

ControlRoom3D: Room Generation using Semantic Proxy Rooms

Add code
Dec 08, 2023
Figure 1 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 2 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 3 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 4 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Viaarxiv icon

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Add code
Dec 06, 2023
Figure 1 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 2 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 3 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 4 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Viaarxiv icon

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Add code
Sep 27, 2023
Viaarxiv icon

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Add code
Jul 27, 2023
Figure 1 for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Figure 2 for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Figure 3 for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Figure 4 for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Viaarxiv icon

Rotation-Invariant Transformer for Point Cloud Matching

Add code
Mar 25, 2023
Viaarxiv icon

Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors

Add code
Feb 28, 2023
Viaarxiv icon