Picture for Manjie Xu

Manjie Xu

SRC-gAudio: Sampling-Rate-Controlled Audio Generation

Add code
Oct 09, 2024
Viaarxiv icon

Towards Diverse and Efficient Audio Captioning via Diffusion Models

Add code
Sep 14, 2024
Viaarxiv icon

STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment

Add code
Sep 13, 2024
Figure 1 for STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
Figure 2 for STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
Figure 3 for STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
Figure 4 for STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
Viaarxiv icon

Video-to-Audio Generation with Hidden Alignment

Add code
Jul 10, 2024
Figure 1 for Video-to-Audio Generation with Hidden Alignment
Figure 2 for Video-to-Audio Generation with Hidden Alignment
Figure 3 for Video-to-Audio Generation with Hidden Alignment
Figure 4 for Video-to-Audio Generation with Hidden Alignment
Viaarxiv icon

Active Reasoning in an Open-World Environment

Add code
Nov 03, 2023
Viaarxiv icon

MEWL: Few-shot multimodal word learning with referential uncertainty

Add code
Jun 01, 2023
Viaarxiv icon

To think inside the box, or to think out of the box? Scientific discovery via the reciprocation of insights and concepts

Add code
Dec 04, 2022
Viaarxiv icon

On the Complexity of Bayesian Generalization

Add code
Nov 26, 2022
Viaarxiv icon

EST: Evaluating Scientific Thinking in Artificial Agents

Add code
Jun 18, 2022
Figure 1 for EST: Evaluating Scientific Thinking in Artificial Agents
Figure 2 for EST: Evaluating Scientific Thinking in Artificial Agents
Figure 3 for EST: Evaluating Scientific Thinking in Artificial Agents
Figure 4 for EST: Evaluating Scientific Thinking in Artificial Agents
Viaarxiv icon