Picture for Jie Zhou

Jie Zhou

LaCo: Efficient Layer-wise Compression of Visual Tokens for Multimodal Large Language Models

Add code
Jul 03, 2025
Viaarxiv icon

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

Add code
Jul 03, 2025
Viaarxiv icon

GenWorld: Towards Detecting AI-generated Real-world Simulation Videos

Add code
Jun 12, 2025
Viaarxiv icon

SpectralAR: Spectral Autoregressive Visual Generation

Add code
Jun 12, 2025
Viaarxiv icon

Vision Generalist Model: A Survey

Add code
Jun 11, 2025
Viaarxiv icon

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

Add code
Jun 11, 2025
Viaarxiv icon

Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings

Add code
Jun 10, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

Add code
Jun 06, 2025
Viaarxiv icon

Dissecting Long Reasoning Models: An Empirical Study

Add code
Jun 05, 2025
Viaarxiv icon