Picture for Cordelia Schmid

Cordelia Schmid

Thoth

Memory-Modular Classification: Learning to Generalize with Memory Replacement

Add code
Apr 08, 2025
Viaarxiv icon

InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Add code
Apr 07, 2025
Viaarxiv icon

Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs

Add code
Mar 31, 2025
Viaarxiv icon

HORT: Monocular Hand-held Objects Reconstruction with Transformers

Add code
Mar 27, 2025
Viaarxiv icon

Online 3D Scene Reconstruction Using Neural Object Priors

Add code
Mar 24, 2025
Viaarxiv icon

Large-scale Pre-training for Grounded Video Caption Generation

Add code
Mar 13, 2025
Viaarxiv icon

FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

Add code
Mar 06, 2025
Viaarxiv icon

What Are You Doing? A Closer Look at Controllable Human Video Generation

Add code
Mar 06, 2025
Viaarxiv icon

Causal Lifting of Neural Representations: Zero-Shot Generalization for Causal Inferences

Add code
Feb 10, 2025
Viaarxiv icon

Neptune: The Long Orbit to Benchmarking Long Video Understanding

Add code
Dec 12, 2024
Figure 1 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 2 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 3 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Figure 4 for Neptune: The Long Orbit to Benchmarking Long Video Understanding
Viaarxiv icon