Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rohit Dwivedula

ConfigBot: Adaptive Resource Allocation for Robot Applications in Dynamic Environments

Jan 17, 2025

Rohit Dwivedula, Sadanand Modak, Aditya Akella, Joydeep Biswas, Daehyeok Kim, Christopher J. Rossbach

Figure 1 for ConfigBot: Adaptive Resource Allocation for Robot Applications in Dynamic Environments

Figure 2 for ConfigBot: Adaptive Resource Allocation for Robot Applications in Dynamic Environments

Figure 3 for ConfigBot: Adaptive Resource Allocation for Robot Applications in Dynamic Environments

Figure 4 for ConfigBot: Adaptive Resource Allocation for Robot Applications in Dynamic Environments

Abstract:The growing use of autonomous mobile service robots (AMSRs) in dynamic environments requires flexible management of compute resources to optimize the performance of diverse tasks such as navigation, localization, perception, and so on. Current robot deployments, which oftentimes rely on static configurations (of the OS, applications, etc.) and system over-provisioning, fall short since they do not account for the tasks' performance variations resulting in poor system-wide behavior such as robot instability and/or inefficient resource use. This paper presents ConfigBot, a system designed to adaptively reconfigure AMSR applications to meet a predefined performance specification by leveraging runtime profiling and automated configuration tuning. Through experiments on a Boston Dynamics Spot robot equipped with NVIDIA AGX Orin, we demonstrate ConfigBot's efficacy in maintaining system stability and optimizing resource allocation across diverse scenarios. Our findings highlight the promise of tailored and dynamic configurations for robot deployments.

* 14 pages, 13 figures, 6 tables

Via

Access Paper or Ask Questions

C3: Learning Congestion Controllers with Formal Certificates

Dec 14, 2024

Chenxi Yang, Divyanshu Saxena, Rohit Dwivedula, Kshiteej Mahajan, Swarat Chaudhuri, Aditya Akella

Figure 1 for C3: Learning Congestion Controllers with Formal Certificates

Figure 2 for C3: Learning Congestion Controllers with Formal Certificates

Figure 3 for C3: Learning Congestion Controllers with Formal Certificates

Figure 4 for C3: Learning Congestion Controllers with Formal Certificates

Abstract:Learning-based congestion controllers offer better adaptability compared to traditional heuristic algorithms. However, the inherent unreliability of learning techniques can cause learning-based controllers to behave poorly, creating a need for formal guarantees. While methods for formally verifying learned congestion controllers exist, these methods offer binary feedback that cannot optimize the controller toward better behavior. We improve this state-of-the-art via C3, a new learning framework for congestion control that integrates the concept of formal certification in the learning loop. C3 uses an abstract interpreter that can produce robustness and performance certificates to guide the training process, rewarding models that are robust and performant even on worst-case inputs. Our evaluation demonstrates that unlike state-of-the-art learned controllers, C3-trained controllers provide both adaptability and worst-case reliability across a range of network conditions.

Via

Access Paper or Ask Questions

Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

Feb 12, 2024

Haoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim

Figure 1 for Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

Figure 2 for Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

Figure 3 for Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

Figure 4 for Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

Abstract:As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalability of distributed training systems. Existing solutions, while aiming to mitigate this bottleneck through worker-level compression and in-network aggregation, fall short due to their inability to efficiently reconcile the trade-offs between compression effectiveness and computational overhead, hindering overall performance and scalability. In this paper, we introduce a novel compression algorithm that effectively merges worker-level compression with in-network aggregation. Our solution is both homomorphic, allowing for efficient in-network aggregation without CPU/GPU processing, and lossless, ensuring no compromise on training accuracy. Theoretically optimal in compression and computational efficiency, our approach is empirically validated across diverse DNN models such as NCF, LSTM, VGG19, and BERT-base, showing up to a 6.33$\times$ improvement in aggregation throughput and a 3.74$\times$ increase in per-iteration training speed.

Via

Access Paper or Ask Questions

On a Foundation Model for Operating Systems

Dec 13, 2023

Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel(+8 more)

Figure 1 for On a Foundation Model for Operating Systems

Figure 2 for On a Foundation Model for Operating Systems

Abstract:This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes). Our case for a foundation model revolves around the observations that several OS components such as CPU, memory, and network subsystems are interrelated and that OS traces offer the ideal dataset for a foundation model to grasp the intricacies of diverse OS components and their behavior in varying environments and workloads. We discuss a wide range of possibilities that then arise, from employing foundation models as policy agents to utilizing them as generators and predictors to assist traditional OS control algorithms. Our hope is that this paper spurs further research into OS foundation models and creating the next generation of operating systems for the evolving computing landscape.

* Machine Learning for Systems Workshop at 37th NeurIPS Conference, 2023, New Orleans, LA, USA

Via

Access Paper or Ask Questions