Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wenbo Gao

SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Jul 14, 2024

Lin Zhang, Wenbo Gao, Jie Yi, Yunyun Yang

Figure 1 for SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Figure 2 for SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Figure 3 for SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Figure 4 for SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

Abstract:Multi-organ segmentation in medical image analysis is crucial for diagnosis and treatment planning. However, many factors complicate the task, including variability in different target categories and interference from complex backgrounds. In this paper, we utilize the knowledge of Deformable Convolution V3 (DCNv3) and multi-object segmentation to optimize our Spatially Adaptive Convolution Network (SACNet) in three aspects: feature extraction, model architecture, and loss constraint, simultaneously enhancing the perception of different segmentation targets. Firstly, we propose the Adaptive Receptive Field Module (ARFM), which combines DCNv3 with a series of customized block-level and architecture-level designs similar to transformers. This module can capture the unique features of different organs by adaptively adjusting the receptive field according to various targets. Secondly, we utilize ARFM as building blocks to construct the encoder-decoder of SACNet and partially share parameters between the encoder and decoder, making the network wider rather than deeper. This design achieves a shared lightweight decoder and a more parameter-efficient and effective framework. Lastly, we propose a novel continuity dynamic adjustment loss function, based on t-vMF dice loss and cross-entropy loss, to better balance easy and complex classes in segmentation. Experiments on 3D slice datasets from ACDC and Synapse demonstrate that SACNet delivers superior segmentation performance in multi-organ segmentation tasks compared to several existing methods.

Via

Access Paper or Ask Questions

Robotic Table Tennis: A Case Study into a High Speed Learning System

Sep 06, 2023

David B. D'Ambrosio, Jonathan Abelian, Saminda Abeyruwan, Michael Ahn, Alex Bewley, Justin Boyd, Krzysztof Choromanski, Omar Cortes, Erwin Coumans, Tianli Ding(+25 more)

Figure 1 for Robotic Table Tennis: A Case Study into a High Speed Learning System

Figure 2 for Robotic Table Tennis: A Case Study into a High Speed Learning System

Figure 3 for Robotic Table Tennis: A Case Study into a High Speed Learning System

Figure 4 for Robotic Table Tennis: A Case Study into a High Speed Learning System

Abstract:We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real world and also train policies for zero-shot transfer, and automated real world environment resets that enable autonomous training and evaluation on physical robots. We complement a complete system description, including numerous design decisions that are typically not widely disseminated, with a collection of studies that clarify the importance of mitigating various sources of latency, accounting for training and deployment distribution shifts, robustness of the perception system, sensitivity to policy hyper-parameters, and choice of action space. A video demonstrating the components of the system and details of experimental results can be found at https://youtu.be/uFcnWjB42I0.

* Published and presented at Robotics: Science and Systems (RSS2023)

Via

Access Paper or Ask Questions

ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Jan 19, 2021

Xingyou Song, Krzysztof Choromanski, Jack Parker-Holder, Yunhao Tang, Daiyi Peng, Deepali Jain, Wenbo Gao, Aldo Pacchiano, Tamas Sarlos, Yuxiang Yang

Figure 1 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Figure 2 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Figure 3 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Figure 4 for ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning

Abstract:We introduce ES-ENAS, a simple neural architecture search (NAS) algorithm for the purpose of reinforcement learning (RL) policy design, by combining Evolutionary Strategies (ES) and Efficient NAS (ENAS) in a highly scalable and intuitive way. Our main insight is noticing that ES is already a distributed blackbox algorithm, and thus we may simply insert a model controller from ENAS into the central aggregator in ES and obtain weight sharing properties for free. By doing so, we bridge the gap from NAS research in supervised learning settings to the reinforcement learning scenario through this relatively simple marriage between two different lines of research, and are one of the first to apply controller-based NAS techniques to RL. We demonstrate the utility of our method by training combinatorial neural network architectures for RL problems in continuous control, via edge pruning and weight sharing. We also incorporate a wide variety of popular techniques from modern NAS literature, including multiobjective optimization and varying controller methods, to showcase their promise in the RL field and discuss possible extensions. We achieve >90% network compression for multiple tasks, which may be special interest in mobile robotics with limited storage and computational resources.

* 14 pages. This is an updated version of a previous submission which can be found at arXiv:1907.06511. See https://github.com/google-research/google-research/tree/master/es_enas for associated code

Via

Access Paper or Ask Questions

Robotic Table Tennis with Model-Free Reinforcement Learning

Mar 31, 2020

Wenbo Gao, Laura Graesser, Krzysztof Choromanski, Xingyou Song, Nevena Lazic, Pannag Sanketi, Vikas Sindhwani, Navdeep Jaitly

Figure 1 for Robotic Table Tennis with Model-Free Reinforcement Learning

Figure 2 for Robotic Table Tennis with Model-Free Reinforcement Learning

Figure 3 for Robotic Table Tennis with Model-Free Reinforcement Learning

Figure 4 for Robotic Table Tennis with Model-Free Reinforcement Learning

Abstract:We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We demonstrate that evolutionary search (ES) methods acting on CNN-based policy architectures for non-visual inputs and convolving across time learn compact controllers leading to smooth motions. Furthermore, we show that with appropriately tuned curriculum learning on the task and rewards, policies are capable of developing multi-modal styles, specifically forehand and backhand stroke, whilst achieving 80\% return rate on a wide range of ball throws. We observe that multi-modality does not require any architectural priors, such as multi-head architectures or hierarchical policies.

* 8 pages, 4 figures

Via

Access Paper or Ask Questions

Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning

Mar 02, 2020

Xingyou Song, Yuxiang Yang, Krzysztof Choromanski, Ken Caluwaerts, Wenbo Gao, Chelsea Finn, Jie Tan

Figure 1 for Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning

Figure 2 for Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning

Figure 3 for Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning

Figure 4 for Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning

Abstract:Learning adaptable policies is crucial for robots to operate autonomously in our complex and quickly changing world. In this work, we present a new meta-learning method that allows robots to quickly adapt to changes in dynamics. In contrast to gradient-based meta-learning algorithms that rely on second-order gradient estimation, we introduce a more noise-tolerant Batch Hill-Climbing adaptation operator and combine it with meta-learning based on evolutionary strategies. Our method significantly improves adaptation to changes in dynamics in high noise settings, which are common in robotics applications. We validate our approach on a quadruped robot that learns to walk while subject to changes in dynamics. We observe that our method significantly outperforms prior gradient-based approaches, enabling the robot to adapt its policy to changes based on less than 3 minutes of real data.

* For associated video file, see http://youtu.be/_QPMCDdFC3E

Via

Access Paper or Ask Questions

ES-MAML: Simple Hessian-Free Meta Learning

Oct 05, 2019

Xingyou Song, Wenbo Gao, Yuxiang Yang, Krzysztof Choromanski, Aldo Pacchiano, Yunhao Tang

Figure 1 for ES-MAML: Simple Hessian-Free Meta Learning

Figure 2 for ES-MAML: Simple Hessian-Free Meta Learning

Figure 3 for ES-MAML: Simple Hessian-Free Meta Learning

Figure 4 for ES-MAML: Simple Hessian-Free Meta Learning

Abstract:We introduce ES-MAML, a new framework for solving the model agnostic meta learning (MAML) problem based on Evolution Strategies (ES). Existing algorithms for MAML are based on policy gradients, and incur significant difficulties when attempting to estimate second derivatives using backpropagation on stochastic policies. We show how ES can be applied to MAML to obtain an algorithm which avoids the problem of estimating second derivatives, and is also conceptually simple and easy to implement. Moreover, ES-MAML can handle new types of nonsmooth adaptation operators, and other techniques for improving performance and estimation of ES methods become applicable. We show empirically that ES-MAML is competitive with existing methods and often yields better adaptation with fewer queries.

* 10 main pages, 21 total pages, 21 figures

Via

Access Paper or Ask Questions

Reinforcement Learning with Chromatic Networks

Jul 10, 2019

Xingyou Song, Krzysztof Choromanski, Jack Parker-Holder, Yunhao Tang, Wenbo Gao, Aldo Pacchiano, Tamas Sarlos, Deepali Jain, Yuxiang Yang

Figure 1 for Reinforcement Learning with Chromatic Networks

Figure 2 for Reinforcement Learning with Chromatic Networks

Figure 3 for Reinforcement Learning with Chromatic Networks

Figure 4 for Reinforcement Learning with Chromatic Networks

Abstract:We present a new algorithm for finding compact neural networks encoding reinforcement learning (RL) policies. To do it, we leverage in the novel RL setting the theory of pointer networks and ENAS-type algorithms for combinatorial optimization of RL policies as well as recent evolution strategies (ES) optimization methods, and propose to define the combinatorial search space to be the the set of different edge-partitionings (colorings) into same-weight classes. For several RL tasks, we manage to learn colorings translating to effective policies parameterized by as few as 17 weight parameters, providing 6x compression over state-of-the-art compact policies based on Toeplitz matrices. We believe that our work is one of the first attempts to propose a rigorous approach to training structured neural network architectures for RL problems that are of interest especially in mobile robotics with limited storage and computational resources.

* 10 main pages, 22 total pages

Via

Access Paper or Ask Questions

Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

May 24, 2019

Yunfei Teng, Wenbo Gao, Francois Chalus, Anna Choromanska, Donald Goldfarb, Adrian Weller

Figure 1 for Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

Figure 2 for Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

Figure 3 for Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

Figure 4 for Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models

Abstract:We consider distributed optimization under communication constraints for training deep learning models. We propose a new algorithm, whose parameter updates rely on two forces: a regular gradient step, and a corrective direction dictated by the currently best-performing worker (leader). Our method differs from the parameter-averaging scheme EASGD in a number of ways: (i) our objective formulation does not change the location of stationary points compared to the original optimization problem; (ii) we avoid convergence decelerations caused by pulling local workers descending to different local minima to each other (i.e. to the average of their parameters); (iii) our update by design breaks the curse of symmetry (the phenomenon of being trapped in poorly generalizing sub-optimal solutions in symmetric non-convex landscapes); and (iv) our approach is more communication efficient since it broadcasts only parameters of the leader rather than all workers. We provide theoretical analysis of the batch version of the proposed algorithm, which we call Leader Gradient Descent (LGD), and its stochastic variant (LSGD). Finally, we implement an asynchronous version of our algorithm and extend it to the multi-leader setting, where we form groups of workers, each represented by its own local leader (the best performer in a group), and update each worker with a corrective direction comprised of two attractive forces: one to the local, and one to the global leader (the best performer among all workers). The multi-leader setting is well-aligned with current hardware architecture, where local workers forming a group lie within a single computational node and different groups correspond to different nodes. For training convolutional neural networks, we empirically demonstrate that our approach compares favorably to state-of-the-art baselines.

Via

Access Paper or Ask Questions