Abstract:Code-switching is a widely prevalent linguistic phenomenon in multilingual societies like India. Building speech-to-text models for code-switched speech is challenging due to limited availability of datasets. In this work, we focus on the problem of spoken translation (ST) of code-switched speech in Indian languages to English text. We present a new end-to-end model architecture COSTA that scaffolds on pretrained automatic speech recognition (ASR) and machine translation (MT) modules (that are more widely available for many languages). Speech and ASR text representations are fused using an aligned interleaving scheme and are fed further as input to a pretrained MT module; the whole pipeline is then trained end-to-end for spoken translation using synthetically created ST data. We also release a new evaluation benchmark for code-switched Bengali-English, Hindi-English, Marathi-English and Telugu- English speech to English text. COSTA significantly outperforms many competitive cascaded and end-to-end multimodal baselines by up to 3.5 BLEU points.
Abstract:The problem of sparse array design for dual-function radar-communications is investigated. Our goal is to design a sparse array which can simultaneously shape desired beam responses and serve multiple downlink users with the required signal-to-interference-plus-noise ratio levels. Besides, we also take into account the limitation of the radiated power by each antenna. The problem is formulated as a quadratically constrained quadratic program with a joint-sparsity-promoting regularization, which is NP-hard. The resulting problem is solved by the consensus alternating direction method of multipliers, which enjoys parallel implementation. Numerical simulations exhibit the effectiveness and superiority of the proposed method which leads to a more power-efficient solution.
Abstract:As a promising technology in beyond-5G (B5G) and 6G, dual-function radar-communication (DFRC) aims to ensure both radar sensing and communication on a single integrated platform with unified signaling schemes. To achieve accurate sensing and reliable communication, large-scale arrays are anticipated to be implemented in such systems, which brings out the prominent issues on hardware cost and power consumption. To address these issues, hybrid beamforming (HBF), beyond its successful deployment in communication-only systems, could be a promising approach in the emerging DFRC ones. In this article, we investigate the development of the HBF techniques on the DFRC system in a self-contained manner. Specifically, we first introduce the basics of the HBF based DFRC system, where the system model and different receive modes are discussed with focus. Then we illustrate the corresponding design principles, which span from the performance metrics and optimization formulations to the design approaches and our preliminary results. Finally, potential extension and key research opportunities, such as the combination with the reconfigurable intelligent surface, are discussed concisely.