Picture for Andong Deng

Andong Deng

Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

Add code
Nov 25, 2024
Figure 1 for Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
Figure 2 for Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
Figure 3 for Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
Figure 4 for Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
Viaarxiv icon

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level

Add code
Nov 15, 2024
Figure 1 for Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Figure 2 for Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Figure 3 for Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Figure 4 for Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Viaarxiv icon

Order-aware Interactive Segmentation

Add code
Oct 17, 2024
Figure 1 for Order-aware Interactive Segmentation
Figure 2 for Order-aware Interactive Segmentation
Figure 3 for Order-aware Interactive Segmentation
Figure 4 for Order-aware Interactive Segmentation
Viaarxiv icon

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

Add code
Jan 07, 2024
Viaarxiv icon

Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

Add code
Apr 27, 2023
Viaarxiv icon

A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition

Add code
Mar 23, 2023
Viaarxiv icon

Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism

Add code
Nov 17, 2022
Figure 1 for Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism
Figure 2 for Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism
Figure 3 for Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism
Figure 4 for Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism
Viaarxiv icon

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Add code
Mar 29, 2022
Figure 1 for Balanced Multimodal Learning via On-the-fly Gradient Modulation
Figure 2 for Balanced Multimodal Learning via On-the-fly Gradient Modulation
Figure 3 for Balanced Multimodal Learning via On-the-fly Gradient Modulation
Figure 4 for Balanced Multimodal Learning via On-the-fly Gradient Modulation
Viaarxiv icon

Inadequately Pre-trained Models are Better Feature Extractors

Add code
Mar 09, 2022
Figure 1 for Inadequately Pre-trained Models are Better Feature Extractors
Figure 2 for Inadequately Pre-trained Models are Better Feature Extractors
Figure 3 for Inadequately Pre-trained Models are Better Feature Extractors
Figure 4 for Inadequately Pre-trained Models are Better Feature Extractors
Viaarxiv icon

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Add code
Dec 08, 2021
Figure 1 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 2 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 3 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Figure 4 for Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Viaarxiv icon