Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Siddhartha Datta

DiffuBox: Refining 3D Object Detection with Point Diffusion

May 25, 2024

Xiangyu Chen, Zhenzhen Liu, Katie Z Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao(+3 more)

Abstract:Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-based box refinement approach. This method employs a domain-agnostic diffusion model, conditioned on the LiDAR points surrounding a coarse bounding box, to simultaneously refine the box's location, size, and orientation. We evaluate this approach under various domain adaptation settings, and our results reveal significant improvements across different datasets, object classes and detectors.

Via

Access Paper or Ask Questions

Online Feature Updates Improve Online (Generalized) Label Shift Adaptation

Feb 05, 2024

Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger

Abstract:This paper addresses the prevalent issue of label shift in an online setting with missing labels, where data distributions change over time and obtaining timely labels is challenging. While existing methods primarily focus on adjusting or updating the final layer of a pre-trained classifier, we explore the untapped potential of enhancing feature representations using unlabeled data at test-time. Our novel method, Online Label Shift adaptation with Online Feature Updates (OLS-OFU), leverages self-supervised learning to refine the feature extraction process, thereby improving the prediction model. Theoretical analyses confirm that OLS-OFU reduces algorithmic regret by capitalizing on self-supervised learning for feature refinement. Empirical studies on various datasets, under both online label shift and generalized label shift conditions, underscore the effectiveness and robustness of OLS-OFU, especially in cases of domain shifts.

Via

Access Paper or Ask Questions

Prompt Expansion for Adaptive Text-to-Image Generation

Dec 27, 2023

Siddhartha Datta, Alexander Ku, Deepak Ramachandran, Peter Anderson

Figure 1 for Prompt Expansion for Adaptive Text-to-Image Generation

Figure 2 for Prompt Expansion for Adaptive Text-to-Image Generation

Figure 3 for Prompt Expansion for Adaptive Text-to-Image Generation

Figure 4 for Prompt Expansion for Adaptive Text-to-Image Generation

Abstract:Text-to-image generation models are powerful but difficult to use. Users craft specific prompts to get better images, though the images can be repetitive. This paper proposes a Prompt Expansion framework that helps users generate high-quality, diverse images with less effort. The Prompt Expansion model takes a text query as input and outputs a set of expanded text prompts that are optimized such that when passed to a text-to-image model, generates a wider variety of appealing images. We conduct a human evaluation study that shows that images generated through Prompt Expansion are more aesthetically pleasing and diverse than those generated by baseline methods. Overall, this paper presents a novel and effective approach to improving the text-to-image generation experience.

Via

Access Paper or Ask Questions

Projected Subnetworks Scale Adaptation

Jan 27, 2023

Siddhartha Datta, Nigel Shadbolt

Figure 1 for Projected Subnetworks Scale Adaptation

Figure 2 for Projected Subnetworks Scale Adaptation

Figure 3 for Projected Subnetworks Scale Adaptation

Figure 4 for Projected Subnetworks Scale Adaptation

Abstract:Large models support great zero-shot and few-shot capabilities. However, updating these models on new tasks can break performance on previous seen tasks and their zero/few-shot unseen tasks. Our work explores how to update zero/few-shot learners such that they can maintain performance on seen/unseen tasks of previous tasks as well as new tasks. By manipulating the parameter updates of a gradient-based meta learner as the projected task-specific subnetworks, we show improvements for large models to retain seen and zero/few shot task performance in online settings.

Via

Access Paper or Ask Questions

Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities

Nov 24, 2022

Siddhartha Datta

Abstract:The advent of personalized reality has arrived. Rapid development in AR/MR/VR enables users to augment or diminish their perception of the physical world. Robust tooling for digital interface modification enables users to change how their software operates. As digital realities become an increasingly-impactful aspect of human lives, we investigate the design of a system that enables users to manipulate the perception of both their physical realities and digital realities. Users can inspect their view history from either reality, and generate interventions that can be interoperably rendered cross-reality in real-time. Personalized interventions can be generated with mask, text, and model hooks. Collaboration between users scales the availability of interventions. We verify our implementation against our design requirements with cognitive walkthroughs, personas, and scalability tests.

* updated. arXiv admin note: text overlap with arXiv:2204.03731

Via

Access Paper or Ask Questions

Multiple Modes for Continual Learning

Sep 29, 2022

Siddhartha Datta, Nigel Shadbolt

Figure 1 for Multiple Modes for Continual Learning

Figure 2 for Multiple Modes for Continual Learning

Figure 3 for Multiple Modes for Continual Learning

Figure 4 for Multiple Modes for Continual Learning

Abstract:Adapting model parameters to incoming streams of data is a crucial factor to deep learning scalability. Interestingly, prior continual learning strategies in online settings inadvertently anchor their updated parameters to a local parameter subspace to remember old tasks, else drift away from the subspace and forget. From this observation, we formulate a trade-off between constructing multiple parameter modes and allocating tasks per mode. Mode-Optimized Task Allocation (MOTA), our contributed adaptation strategy, trains multiple modes in parallel, then optimizes task allocation per mode. We empirically demonstrate improvements over baseline continual learning strategies and across varying distribution shifts, namely sub-population, domain, and task shift.

Via

Access Paper or Ask Questions

Interpolating Compressed Parameter Subspaces

May 19, 2022

Siddhartha Datta, Nigel Shadbolt

Figure 1 for Interpolating Compressed Parameter Subspaces

Figure 2 for Interpolating Compressed Parameter Subspaces

Figure 3 for Interpolating Compressed Parameter Subspaces

Figure 4 for Interpolating Compressed Parameter Subspaces

Abstract:Inspired by recent work on neural subspaces and mode connectivity, we revisit parameter subspace sampling for shifted and/or interpolatable input distributions (instead of a single, unshifted distribution). We enforce a compressed geometric structure upon a set of trained parameters mapped to a set of train-time distributions, denoting the resulting subspaces as Compressed Parameter Subspaces (CPS). We show the success and failure modes of the types of shifted distributions whose optimal parameters reside in the CPS. We find that ensembling point-estimates within a CPS can yield a high average accuracy across a range of test-time distributions, including backdoor, adversarial, permutation, stylization and rotation perturbations. We also find that the CPS can contain low-loss point-estimates for various task shifts (albeit interpolated, perturbed, unseen or non-identical coarse labels). We further demonstrate this property in a continual learning setting with CIFAR100.

Via

Access Paper or Ask Questions

Parameter Adaptation for Joint Distribution Shifts

May 15, 2022

Siddhartha Datta

Figure 1 for Parameter Adaptation for Joint Distribution Shifts

Figure 2 for Parameter Adaptation for Joint Distribution Shifts

Figure 3 for Parameter Adaptation for Joint Distribution Shifts

Figure 4 for Parameter Adaptation for Joint Distribution Shifts

Abstract:While different methods exist to tackle distinct types of distribution shift, such as label shift (in the form of adversarial attacks) or domain shift, tackling the joint shift setting is still an open problem. Through the study of a joint distribution shift manifesting both adversarial and domain-specific perturbations, we not only show that a joint shift worsens model performance compared to their individual shifts, but that the use of a similar domain worsens performance than a dissimilar domain. To curb the performance drop, we study the use of perturbation sets motivated by input and parameter space bounds, and adopt a meta learning strategy (hypernetworks) to model parameters w.r.t. test-time inputs to recover performance.

Via

Access Paper or Ask Questions

GreaseVision: Rewriting the Rules of the Interface

Apr 07, 2022

Siddhartha Datta, Konrad Kollnig, Nigel Shadbolt

Figure 1 for GreaseVision: Rewriting the Rules of the Interface

Figure 2 for GreaseVision: Rewriting the Rules of the Interface

Figure 3 for GreaseVision: Rewriting the Rules of the Interface

Figure 4 for GreaseVision: Rewriting the Rules of the Interface

Abstract:Digital harms can manifest across any interface. Key problems in addressing these harms include the high individuality of harms and the fast-changing nature of digital systems. As a result, we still lack a systematic approach to study harms and produce interventions for end-users. We put forward GreaseVision, a new framework that enables end-users to collaboratively develop interventions against harms in software using a no-code approach and recent advances in few-shot machine learning. The contribution of the framework and tool allow individual end-users to study their usage history and create personalized interventions. Our contribution also enables researchers to study the distribution of harms and interventions at scale.

Via

Access Paper or Ask Questions

Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Mar 07, 2022

Siddhartha Datta, Nigel Shadbolt

Figure 1 for Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Figure 2 for Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Figure 3 for Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Figure 4 for Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Abstract:Recent exploration of the multi-agent backdoor attack demonstrated the backfiring effect, a natural defense against backdoor attacks where backdoored inputs are randomly classified. This yields a side-effect of low accuracy w.r.t. clean labels, which motivates this paper's work on the construction of multi-agent backdoor defenses that maximize accuracy w.r.t. clean labels and minimize that of poison labels. Founded upon agent dynamics and low-loss subspace construction, we contribute three defenses that yield improved multi-agent backdoor robustness.

Via

Access Paper or Ask Questions