Picture for Tat-Seng Chua

Tat-Seng Chua

Extending Visual Dynamics for Video-to-Music Generation

Add code
Apr 10, 2025
Viaarxiv icon

Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

Add code
Apr 09, 2025
Viaarxiv icon

LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph

Add code
Apr 04, 2025
Viaarxiv icon

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Add code
Mar 30, 2025
Viaarxiv icon

Exploring Training and Inference Scaling Laws in Generative Retrieval

Add code
Mar 24, 2025
Viaarxiv icon

Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark

Add code
Mar 24, 2025
Viaarxiv icon

Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling

Add code
Mar 19, 2025
Viaarxiv icon

Continual Multimodal Contrastive Learning

Add code
Mar 19, 2025
Viaarxiv icon

Universal Scene Graph Generation

Add code
Mar 19, 2025
Viaarxiv icon