Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qihui Yang

PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching

Oct 25, 2025

Ali Vosoughi, Yongyi Zang, Qihui Yang, Nathan Peak, Randal Leistikow, Chenliang Xu

Abstract:Room impulse response (RIR) generation remains a critical challenge for creating immersive virtual acoustic environments. Current methods suffer from two fundamental limitations: the scarcity of full-band RIR datasets and the inability of existing models to generate acoustically accurate responses from diverse input modalities. We present PromptReverb, a two-stage generative framework that addresses these challenges. Our approach combines a variational autoencoder that upsamples band-limited RIRs to full-band quality (48 kHz), and a conditional diffusion transformer model based on rectified flow matching that generates RIRs from descriptions in natural language. Empirical evaluation demonstrates that PromptReverb produces RIRs with superior perceptual quality and acoustic accuracy compared to existing methods, achieving 8.8% mean RT60 error compared to -37% for widely used baselines and yielding more realistic room-acoustic parameters. Our method enables practical applications in virtual reality, architectural acoustics, and audio production where flexible, high-quality RIR synthesis is essential.

* 9 pages, 2 figures, 4 tables

Via

Access Paper or Ask Questions

Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Oct 31, 2024

Xuyuan Liu, Yinghao Cai, Qihui Yang, Yujun Yan

Figure 1 for Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Figure 2 for Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Figure 3 for Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Figure 4 for Exploring Consistency in Graph Representations:from Graph Kernels to Graph Neural Networks

Abstract:Graph Neural Networks (GNNs) have emerged as a dominant approach in graph representation learning, yet they often struggle to capture consistent similarity relationships among graphs. While graph kernel methods such as the Weisfeiler-Lehman subtree (WL-subtree) and Weisfeiler-Lehman optimal assignment (WLOA) kernels are effective in capturing similarity relationships, they rely heavily on predefined kernels and lack sufficient non-linearity for more complex data patterns. Our work aims to bridge the gap between neural network methods and kernel approaches by enabling GNNs to consistently capture relational structures in their learned representations. Given the analogy between the message-passing process of GNNs and WL algorithms, we thoroughly compare and analyze the properties of WL-subtree and WLOA kernels. We find that the similarities captured by WLOA at different iterations are asymptotically consistent, ensuring that similar graphs remain similar in subsequent iterations, thereby leading to superior performance over the WL-subtree kernel. Inspired by these findings, we conjecture that the consistency in the similarities of graph representations across GNN layers is crucial in capturing relational structures and enhancing graph classification performance. Thus, we propose a loss to enforce the similarity of graph representations to be consistent across different layers. Our empirical analysis verifies our conjecture and shows that our proposed consistency loss can significantly enhance graph classification performance across several GNN backbones on various datasets.

* NeurIPS 2024

Via

Access Paper or Ask Questions

Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Jun 07, 2024

Zheng Huang, Qihui Yang, Dawei Zhou, Yujun Yan

Figure 1 for Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Figure 2 for Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Figure 3 for Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Figure 4 for Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning

Abstract:Although most graph neural networks (GNNs) can operate on graphs of any size, their classification performance often declines on graphs larger than those encountered during training. Existing methods insufficiently address the removal of size information from graph representations, resulting in sub-optimal performance and reliance on backbone models. In response, we propose DISGEN, a novel and model-agnostic framework designed to disentangle size factors from graph representations. DISGEN employs size- and task-invariant augmentations and introduces a decoupling loss that minimizes shared information in hidden representations, with theoretical guarantees for its effectiveness. Our empirical results show that DISGEN outperforms the state-of-the-art models by up to 6% on real-world datasets, underscoring its effectiveness in enhancing the size generalizability of GNNs. Our codes are available at: https://github.com/GraphmindDartmouth/DISGEN.

Via

Access Paper or Ask Questions