Picture for Anton van den Hengel

Anton van den Hengel

the University of Adelaide

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

Add code
Mar 31, 2025
Viaarxiv icon

MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams

Add code
Mar 26, 2025
Viaarxiv icon

Analytic DAG Constraints for Differentiable DAG Learning

Add code
Mar 24, 2025
Viaarxiv icon

Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction

Add code
Mar 17, 2025
Viaarxiv icon

Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing

Add code
Mar 15, 2025
Viaarxiv icon

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Add code
Mar 12, 2025
Viaarxiv icon

Interactive Medical Image Analysis with Concept-based Similarity Reasoning

Add code
Mar 11, 2025
Viaarxiv icon

RandLoRA: Full-rank parameter-efficient fine-tuning of large models

Add code
Feb 03, 2025
Viaarxiv icon

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs

Add code
Jan 11, 2025
Viaarxiv icon

EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing

Add code
Dec 12, 2024
Figure 1 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 2 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 3 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 4 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Viaarxiv icon