Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nirvana Meratnia

Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Feb 10, 2025

Timo Fudala, Vasileios Tsouvalas, Nirvana Meratnia

Figure 1 for Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Figure 2 for Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Figure 3 for Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Figure 4 for Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Abstract:Multimodal transformers integrate diverse data types like images, audio, and text, advancing tasks such as audio-visual understanding and image-text retrieval; yet their high parameterization limits deployment on resource-constrained edge devices. Split Learning (SL), which partitions models at a designated cut-layer to offload compute-intensive operations to the server, offers a promising approach for distributed training of multimodal transformers, though its application remains underexplored. We present MPSL, a parallel SL approach for computational efficient fine-tuning of multimodal transformers in a distributed manner, while eliminating label sharing, client synchronization, and per-client sub-model management. MPSL employs lightweight client-side tokenizers and a unified modality-agnostic encoder, allowing flexible adaptation to task-specific needs. Our evaluation across 7 multimodal datasets demonstrates that MPSL matches or outperforms Federated Learning, reduces client-side computations by 250x, and achieves superior scalability in communication cost with model growth. Through extensive analysis, we highlight task suitability, trade-offs, and scenarios where MPSL excels, inspiring further exploration.

* 10 pages, 4 figures, submitted to IJCAI 2025

Via

Access Paper or Ask Questions

Many-Task Federated Fine-Tuning via Unified Task Vectors

Feb 10, 2025

Vasileios Tsouvalas, Tanir Ozcelebi, Nirvana Meratnia

Figure 1 for Many-Task Federated Fine-Tuning via Unified Task Vectors

Figure 2 for Many-Task Federated Fine-Tuning via Unified Task Vectors

Figure 3 for Many-Task Federated Fine-Tuning via Unified Task Vectors

Figure 4 for Many-Task Federated Fine-Tuning via Unified Task Vectors

Abstract:Federated Learning (FL) traditionally assumes homogeneous client tasks; however, in real-world scenarios, clients often specialize in diverse tasks, introducing task heterogeneity. To address this challenge, Many-Task FL (MaT-FL) has emerged, enabling clients to collaborate effectively despite task diversity. Existing MaT-FL approaches rely on client grouping or personalized layers, requiring the server to manage individual models and failing to account for clients handling multiple tasks. We propose MaTU, a MaT-FL approach that enables joint learning of task vectors across clients, eliminating the need for clustering or client-specific weight storage at the server. Our method introduces a novel aggregation mechanism that determines task similarity based on the direction of clients task vectors and constructs a unified task vector encapsulating all tasks. To address task-specific requirements, we augment the unified task vector with lightweight modulators that facilitate knowledge transfer among related tasks while disentangling dissimilar ones. Evaluated across 30 datasets, MaTU achieves superior performance over state-of-the-art MaT-FL approaches, with results comparable to per-task fine-tuning, while delivering significant communication savings.

* 10 pages, 6 figures, submitted in IJCAI 2025

Via

Access Paper or Ask Questions

EncCluster: Scalable Functional Encryption in Federated Learning through Weight Clustering and Probabilistic Filters

Jun 13, 2024

Vasileios Tsouvalas, Samaneh Mohammadi, Ali Balador, Tanir Ozcelebi, Francesco Flammini, Nirvana Meratnia

Abstract:Federated Learning (FL) enables model training across decentralized devices by communicating solely local model updates to an aggregation server. Although such limited data sharing makes FL more secure than centralized approached, FL remains vulnerable to inference attacks during model update transmissions. Existing secure aggregation approaches rely on differential privacy or cryptographic schemes like Functional Encryption (FE) to safeguard individual client data. However, such strategies can reduce performance or introduce unacceptable computational and communication overheads on clients running on edge devices with limited resources. In this work, we present EncCluster, a novel method that integrates model compression through weight clustering with recent decentralized FE and privacy-enhancing data encoding using probabilistic filters to deliver strong privacy guarantees in FL without affecting model performance or adding unnecessary burdens to clients. We performed a comprehensive evaluation, spanning various datasets and architectures, to demonstrate EncCluster's scalability across encryption levels. Our findings reveal that EncCluster significantly reduces communication costs - below even conventional FedAvg - and accelerates encryption by more than four times over all baselines; at the same time, it maintains high model accuracy and enhanced privacy assurances.

* 21 pages, 4 figures

Via

Access Paper or Ask Questions

Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

Jan 29, 2024

Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi, Nirvana Meratnia

Figure 1 for Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

Figure 2 for Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

Figure 3 for Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

Figure 4 for Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation

Abstract:Federated Learning (FL) is a promising technique for the collaborative training of deep neural networks across multiple devices while preserving data privacy. Despite its potential benefits, FL is hindered by excessive communication costs due to repeated server-client communication during training. To address this challenge, model compression techniques, such as sparsification and weight clustering are applied, which often require modifying the underlying model aggregation schemes or involve cumbersome hyperparameter tuning, with the latter not only adjusts the model's compression rate but also limits model's potential for continuous improvement over growing data. In this paper, we propose FedCompress, a novel approach that combines dynamic weight clustering and server-side knowledge distillation to reduce communication costs while learning highly generalizable models. Through a comprehensive evaluation on diverse public datasets, we demonstrate the efficacy of our approach compared to baselines in terms of communication costs and inference speed. We will make our implementation public upon acceptance.

* 9 pages, 2 figures, Accepted on ICASSP 2024

Via

Access Paper or Ask Questions

FedCode: Communication-Efficient Federated Learning via Transferring Codebooks

Nov 15, 2023

Saeed Khalilian, Vasileios Tsouvalas, Tanir Ozcelebi, Nirvana Meratnia

Abstract:Federated Learning (FL) is a distributed machine learning paradigm that enables learning models from decentralized local data. While FL offers appealing properties for clients' data privacy, it imposes high communication burdens for exchanging model weights between a server and the clients. Existing approaches rely on model compression techniques, such as pruning and weight clustering to tackle this. However, transmitting the entire set of weight updates at each federated round, even in a compressed format, limits the potential for a substantial reduction in communication volume. We propose FedCode where clients transmit only codebooks, i.e., the cluster centers of updated model weight values. To ensure a smooth learning curve and proper calibration of clusters between the server and the clients, FedCode periodically transfers model weights after multiple rounds of solely communicating codebooks. This results in a significant reduction in communication volume between clients and the server in both directions, without imposing significant computational overhead on the clients or leading to major performance degradation of the models. We evaluate the effectiveness of FedCode using various publicly available datasets with ResNet-20 and MobileNet backbone model architectures. Our evaluations demonstrate a 12.2-fold data transmission reduction on average while maintaining a comparable model performance with an average accuracy loss of 1.3% compared to FedAvg. Further validation of FedCode performance under non-IID data distributions showcased an average accuracy loss of 2.0% compared to FedAvg while achieving approximately a 12.7-fold data transmission reduction.

Via

Access Paper or Ask Questions

Intelligent Blockage Recognition using Cellular mmWave Beamforming Data: Feasibility Study

Oct 30, 2022

Bram van Berlo, Yang Miao, Rizqi Hersyandika, Nirvana Meratnia, Tanir Ozcelebi, Andre Kokkeler, Sofie Pollin

Abstract:Joint Communication and Sensing (JCAS) is envisioned for 6G cellular networks, where sensing the operation environment, especially in presence of humans, is as important as the high-speed wireless connectivity. Sensing, and subsequently recognizing blockage types, is an initial step towards signal blockage avoidance. In this context, we investigate the feasibility of using human motion recognition as a surrogate task for blockage type recognition through a set of hypothesis validation experiments using both qualitative and quantitative analysis (visual inspection and hyperparameter tuning of deep learning (DL) models, respectively). A surrogate task is useful for DL model testing and/or pre-training, thereby requiring a low amount of data to be collected from the eventual JCAS environment. Therefore, we collect and use a small dataset from a 26 GHz cellular multi-user communication device with hybrid beamforming. The data is converted into Doppler Frequency Spectrum (DFS) and used for hypothesis validations. Our research shows that (i) the presence of domain shift between data used for learning and inference requires use of DL models that can successfully handle it, (ii) DFS input data dilution to increase dataset volume should be avoided, (iii) a small volume of input data is not enough for reasonable inference performance, (iv) higher sensing resolution, causing lower sensitivity, should be handled by doing more activities/gestures per frame and lowering sampling rate, and (v) a higher reported sampling rate to STFT during pre-processing may increase performance, but should always be tested on a per learning task basis.

* accepted for presentation at the IEEE GLOBECOM 2022 conference

Via

Access Paper or Ask Questions

Federated Learning with Noisy Labels

Aug 19, 2022

Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi, Nirvana Meratnia

Figure 1 for Federated Learning with Noisy Labels

Figure 2 for Federated Learning with Noisy Labels

Figure 3 for Federated Learning with Noisy Labels

Figure 4 for Federated Learning with Noisy Labels

Abstract:Federated Learning (FL) is a distributed machine learning paradigm that enables learning models from decentralized private datasets, where the labeling effort is entrusted to the clients. While most existing FL approaches assume high-quality labels are readily available on users' devices; in reality, label noise can naturally occur in FL and follows a non-i.i.d. distribution among clients. Due to the non-iid-ness challenges, existing state-of-the-art centralized approaches exhibit unsatisfactory performance, while previous FL studies rely on data exchange or repeated server-side aid to improve model's performance. Here, we propose FedLN, a framework to deal with label noise across different FL training stages; namely, FL initialization, on-device model training, and server model aggregation. Specifically, FedLN computes per-client noise-level estimation in a single federated round and improves the models' performance by correcting (or limiting the effect of) noisy samples. Extensive experiments on various publicly available vision and audio datasets demonstrate a 24% improvement on average compared to other existing methods for a label noise level of 70%. We further validate the efficiency of FedLN in human-annotated real-world noisy datasets and report a 9% increase on average in models' recognition rate, highlighting that FedLN can be useful for improving FL services provided to everyday users.

Via

Access Paper or Ask Questions

Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Feb 05, 2022

Vasileios Tsouvalas, Tanir Ozcelebi, Nirvana Meratnia

Figure 1 for Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Figure 2 for Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Figure 3 for Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Figure 4 for Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

Abstract:Speech Emotion Recognition (SER) refers to the recognition of human emotions from natural speech. If done accurately, it can offer a number of benefits in building human-centered context-aware intelligent systems. Existing SER approaches are largely centralized, without considering users' privacy. Federated Learning (FL) is a distributed machine learning paradigm dealing with decentralization of privacy-sensitive personal data. In this paper, we present a privacy-preserving and data-efficient SER approach by utilizing the concept of FL. To the best of our knowledge, this is the first federated SER approach, which utilizes self-training learning in conjunction with federated learning to exploit both labeled and unlabeled on-device data. Our experimental evaluations on the IEMOCAP dataset shows that our federated approach can learn generalizable SER models even under low availability of data labels and highly non-i.i.d. distributions. We show that our approach with as few as 10% labeled data, on average, can improve the recognition rate by 8.67% compared to the fully-supervised federated counterparts.

* arXiv admin note: text overlap with arXiv:2107.06877

Via

Access Paper or Ask Questions

Millimeter Wave Sensing: A Review of Application Pipelines and Building Blocks

Dec 26, 2020

Bram van Berlo, Amany Elkelany, Tanir Ozcelebi, Nirvana Meratnia

Figure 1 for Millimeter Wave Sensing: A Review of Application Pipelines and Building Blocks

Figure 2 for Millimeter Wave Sensing: A Review of Application Pipelines and Building Blocks

Figure 3 for Millimeter Wave Sensing: A Review of Application Pipelines and Building Blocks

Figure 4 for Millimeter Wave Sensing: A Review of Application Pipelines and Building Blocks

Abstract:The increasing bandwidth requirement of new wireless applications has lead to standardization of the millimeter wave spectrum for high-speed wireless communication. The millimeter wave spectrum is part of 5G and covers frequencies between 30 and 300 GHz corresponding to wavelengths ranging from 10 to 1 mm. Although millimeter wave is often considered as a communication medium, it has also proved to be an excellent 'sensor', thanks to its narrow beams, operation across a wide bandwidth, and interaction with atmospheric constituents. In this paper, which is to the best of our knowledge the first review that completely covers millimeter wave sensing application pipelines, we provide a comprehensive overview and analysis of different basic application pipeline building blocks, including hardware, algorithms, analytical models, and model evaluation techniques. The review also provides a taxonomy that highlights different millimeter wave sensing application domains. By performing a thorough analysis, complying with the systematic literature review methodology and reviewing 165 papers, we not only extend previous investigations focused only on communication aspects of the millimeter wave technology and using millimeter wave technology for active imaging, but also highlight scientific and technological challenges and trends, and provide a future perspective for applications of millimeter wave as a sensing technology.

* 36 pages, submitted to IEEE Sensors Journal

Via

Access Paper or Ask Questions