Picture for Donghuo Zeng

Donghuo Zeng

Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning

Add code
Jan 16, 2025
Viaarxiv icon

Top-down Activity Representation Learning for Video Question Answering

Add code
Sep 12, 2024
Viaarxiv icon

Multi-object event graph representation learning for Video Question Answering

Add code
Sep 12, 2024
Viaarxiv icon

Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome

Add code
Apr 21, 2024
Viaarxiv icon

Anchor-aware Deep Metric Learning for Audio-visual Retrieval

Add code
Apr 21, 2024
Viaarxiv icon

Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval

Add code
Oct 20, 2023
Viaarxiv icon

VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning

Add code
Sep 27, 2023
Viaarxiv icon

Topic-switch adapted Japanese Dialogue System based on PLATO-2

Add code
Feb 22, 2023
Viaarxiv icon

Do I Have Your Attention: A Large Scale Engagement Prediction Dataset and Baselines

Add code
Feb 01, 2023
Viaarxiv icon

Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval

Add code
Nov 07, 2022
Viaarxiv icon