Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sourav Mukherjee

Sparse Incremental Aggregation in Satellite Federated Learning

Jan 20, 2025

Nasrin Razmi, Sourav Mukherjee, Bho Matthiesen, Armin Dekorsy, Petar Popovski

Abstract:This paper studies Federated Learning (FL) in low Earth orbit (LEO) satellite constellations, where satellites are connected via intra-orbit inter-satellite links (ISLs) to their neighboring satellites. During the FL training process, satellites in each orbit forward gradients from nearby satellites, which are eventually transferred to the parameter server (PS). To enhance the efficiency of the FL training process, satellites apply in-network aggregation, referred to as incremental aggregation. In this work, the gradient sparsification methods from [1] are applied to satellite scenarios to improve bandwidth efficiency during incremental aggregation. The numerical results highlight an increase of over 4 x in bandwidth efficiency as the number of satellites in the orbital plane increases.

* This paper has been accepted for the 14th International ITG Conference on Systems, Communications and Coding (SCC 2025)

Via

Access Paper or Ask Questions

Sparse Incremental Aggregation in Multi-Hop Federated Learning

Jul 25, 2024

Sourav Mukherjee, Nasrin Razmi, Armin Dekorsy, Petar Popovski, Bho Matthiesen

Abstract:This paper investigates federated learning (FL) in a multi-hop communication setup, such as in constellations with inter-satellite links. In this setup, part of the FL clients are responsible for forwarding other client's results to the parameter server. Instead of using conventional routing, the communication efficiency can be improved significantly by using in-network model aggregation at each intermediate hop, known as incremental aggregation (IA). Prior works [1] have indicated diminishing gains for IA under gradient sparsification. Here we study this issue and propose several novel correlated sparsification methods for IA. Numerical results show that, for some of these algorithms, the full potential of IA is still available under sparsification without impairing convergence. We demonstrate a 15x improvement in communication efficiency over conventional routing and a 11x improvement over state-of-the-art (SoA) sparse IA.

* This paper is accepted for the 25th IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC) conference

Via

Access Paper or Ask Questions

Determining Standard Occupational Classification Codes from Job Descriptions in Immigration Petitions

Sep 30, 2021

Sourav Mukherjee, David Widmark, Vince DiMascio, Tim Oates

Figure 1 for Determining Standard Occupational Classification Codes from Job Descriptions in Immigration Petitions

Figure 2 for Determining Standard Occupational Classification Codes from Job Descriptions in Immigration Petitions

Figure 3 for Determining Standard Occupational Classification Codes from Job Descriptions in Immigration Petitions

Figure 4 for Determining Standard Occupational Classification Codes from Job Descriptions in Immigration Petitions

Abstract:Accurate specification of standard occupational classification (SOC) code is critical to the success of many U.S. work visa applications. Determination of correct SOC code relies on careful study of job requirements and comparison to definitions given by the U.S. Bureau of Labor Statistics, which is often a tedious activity. In this paper, we apply methods from natural language processing (NLP) to computationally determine SOC code based on job description. We implement and empirically evaluate a broad variety of predictive models with respect to quality of prediction and training time, and identify models best suited for this task.

* To appear in ICDM 2021 workshop: MLLD-2021

Via

Access Paper or Ask Questions

Immigration Document Classification and Automated Response Generation

Sep 29, 2020

Sourav Mukherjee, Tim Oates, Vince DiMascio, Huguens Jean, Rob Ares, David Widmark, Jaclyn Harder

Figure 1 for Immigration Document Classification and Automated Response Generation

Figure 2 for Immigration Document Classification and Automated Response Generation

Figure 3 for Immigration Document Classification and Automated Response Generation

Figure 4 for Immigration Document Classification and Automated Response Generation

Abstract:In this paper, we consider the problem of organizing supporting documents vital to U.S. work visa petitions, as well as responding to Requests For Evidence (RFE) issued by the U.S.~Citizenship and Immigration Services (USCIS). Typically, both processes require a significant amount of repetitive manual effort. To reduce the burden of mechanical work, we apply machine learning methods to automate these processes, with humans in the loop to review and edit output for submission. In particular, we use an ensemble of image and text classifiers to categorize supporting documents. We also use a text classifier to automatically identify the types of evidence being requested in an RFE, and used the identified types in conjunction with response templates and extracted fields to assemble draft responses. Empirical results suggest that our approach achieves considerable accuracy while significantly reducing processing time.

* To appear in ICDM 2020 workshop: MLLD-2020

Via

Access Paper or Ask Questions

Graph Node Embeddings using Domain-Aware Biased Random Walks

Aug 08, 2019

Sourav Mukherjee, Tim Oates, Ryan Wright

Figure 1 for Graph Node Embeddings using Domain-Aware Biased Random Walks

Figure 2 for Graph Node Embeddings using Domain-Aware Biased Random Walks

Figure 3 for Graph Node Embeddings using Domain-Aware Biased Random Walks

Figure 4 for Graph Node Embeddings using Domain-Aware Biased Random Walks

Abstract:The recent proliferation of publicly available graph-structured data has sparked an interest in machine learning algorithms for graph data. Since most traditional machine learning algorithms assume data to be tabular, embedding algorithms for mapping graph data to real-valued vector spaces has become an active area of research. Existing graph embedding approaches are based purely on structural information and ignore any semantic information from the underlying domain. In this paper, we demonstrate that semantic information can play a useful role in computing graph embeddings. Specifically, we present a framework for devising embedding strategies aware of domain-specific interpretations of graph nodes and edges, and use knowledge of downstream machine learning tasks to identify relevant graph substructures. Using two real-life domains, we show that our framework yields embeddings that are simple to implement and yet achieve equal or greater accuracy in machine learning tasks compared to domain independent approaches.

Via

Access Paper or Ask Questions