Picture for Jiajun Wu

Jiajun Wu

Re-thinking Temporal Search for Long-Form Video Understanding

Add code
Apr 03, 2025
Viaarxiv icon

X-Capture: An Open-Source Portable Device for Multi-Sensory Learning

Add code
Apr 03, 2025
Viaarxiv icon

WorldScore: A Unified Evaluation Benchmark for World Generation

Add code
Apr 01, 2025
Viaarxiv icon

Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals

Add code
Mar 25, 2025
Viaarxiv icon

Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization

Add code
Mar 14, 2025
Viaarxiv icon

BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities

Add code
Mar 07, 2025
Viaarxiv icon

FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video

Add code
Mar 06, 2025
Viaarxiv icon

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

Add code
Mar 06, 2025
Viaarxiv icon

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Add code
Mar 04, 2025
Viaarxiv icon

Predicate Hierarchies Improve Few-Shot State Classification

Add code
Feb 18, 2025
Viaarxiv icon