Picture for Zelun Luo

Zelun Luo

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Add code
Jun 03, 2026
Viaarxiv icon

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Add code
Dec 04, 2024
Viaarxiv icon

Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)

Add code
Jun 03, 2024
Figure 1 for Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Figure 2 for Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Figure 3 for Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Figure 4 for Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Viaarxiv icon

Differentially Private Video Activity Recognition

Add code
Jun 27, 2023
Figure 1 for Differentially Private Video Activity Recognition
Figure 2 for Differentially Private Video Activity Recognition
Figure 3 for Differentially Private Video Activity Recognition
Figure 4 for Differentially Private Video Activity Recognition
Viaarxiv icon

Vision-Based Gait Analysis for Senior Care

Add code
Dec 01, 2018
Figure 1 for Vision-Based Gait Analysis for Senior Care
Figure 2 for Vision-Based Gait Analysis for Senior Care
Figure 3 for Vision-Based Gait Analysis for Senior Care
Viaarxiv icon

DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency

Add code
Sep 05, 2018
Figure 1 for DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Figure 2 for DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Figure 3 for DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Figure 4 for DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Viaarxiv icon

Graph Distillation for Action Detection with Privileged Modalities

Add code
Jul 27, 2018
Figure 1 for Graph Distillation for Action Detection with Privileged Modalities
Figure 2 for Graph Distillation for Action Detection with Privileged Modalities
Figure 3 for Graph Distillation for Action Detection with Privileged Modalities
Figure 4 for Graph Distillation for Action Detection with Privileged Modalities
Viaarxiv icon

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance

Add code
Apr 24, 2018
Figure 1 for Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance
Figure 2 for Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance
Figure 3 for Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance
Figure 4 for Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance
Viaarxiv icon

Label Efficient Learning of Transferable Representations across Domains and Tasks

Add code
Nov 30, 2017
Figure 1 for Label Efficient Learning of Transferable Representations across Domains and Tasks
Figure 2 for Label Efficient Learning of Transferable Representations across Domains and Tasks
Figure 3 for Label Efficient Learning of Transferable Representations across Domains and Tasks
Figure 4 for Label Efficient Learning of Transferable Representations across Domains and Tasks
Viaarxiv icon

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Add code
Apr 11, 2017
Figure 1 for Unsupervised Learning of Long-Term Motion Dynamics for Videos
Figure 2 for Unsupervised Learning of Long-Term Motion Dynamics for Videos
Figure 3 for Unsupervised Learning of Long-Term Motion Dynamics for Videos
Figure 4 for Unsupervised Learning of Long-Term Motion Dynamics for Videos
Viaarxiv icon