Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Błażej Osiński

PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Aug 20, 2024

Yan Wu, Esther Wershof, Sebastian M Schmon, Marcel Nassar, Błażej Osiński, Ridvan Eksi, Kun Zhang, Thore Graepel

Figure 1 for PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Figure 2 for PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Figure 3 for PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Figure 4 for PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis

Abstract:We present a comprehensive framework for predicting the effects of perturbations in single cells, designed to standardize benchmarking in this rapidly evolving field. Our framework, PerturBench, includes a user-friendly platform, diverse datasets, metrics for fair model comparison, and detailed performance analysis. Extensive evaluations of published and baseline models reveal limitations like mode or posterior collapse, and underscore the importance of rank metrics that assess the ordering of perturbations alongside traditional measures like RMSE. Our findings show that simple models can outperform more complex approaches. This benchmarking exercise sets new standards for model evaluation, supports robust model development, and advances the potential of these models to use high-throughput and high-content genetic and chemical screens for disease target discovery.

* 9 pages plus 19 pages supplementary material. Code is available at https://github.com/altoslabs/perturbench

Via

Access Paper or Ask Questions

Off-Policy Correction For Multi-Agent Reinforcement Learning

Nov 22, 2021

Michał Zawalski, Błażej Osiński, Henryk Michalewski, Piotr Miłoś

Figure 1 for Off-Policy Correction For Multi-Agent Reinforcement Learning

Figure 2 for Off-Policy Correction For Multi-Agent Reinforcement Learning

Figure 3 for Off-Policy Correction For Multi-Agent Reinforcement Learning

Figure 4 for Off-Policy Correction For Multi-Agent Reinforcement Learning

Abstract:Multi-agent reinforcement learning (MARL) provides a framework for problems involving multiple interacting agents. Despite apparent similarity to the single-agent case, multi-agent problems are often harder to train and analyze theoretically. In this work, we propose MA-Trace, a new on-policy actor-critic algorithm, which extends V-Trace to the MARL setting. The key advantage of our algorithm is its high scalability in a multi-worker setting. To this end, MA-Trace utilizes importance sampling as an off-policy correction method, which allows distributing the computations with no impact on the quality of training. Furthermore, our algorithm is theoretically grounded - we prove a fixed-point theorem that guarantees convergence. We evaluate the algorithm extensively on the StarCraft Multi-Agent Challenge, a standard benchmark for multi-agent algorithms. MA-Trace achieves high performance on all its tasks and exceeds state-of-the-art results on some of them.

Via

Access Paper or Ask Questions

SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Sep 28, 2021

Matt Vitelli, Yan Chang, Yawei Ye, Maciej Wołczyk, Błażej Osiński, Moritz Niendorf, Hugo Grimmett, Qiangui Huang, Ashesh Jain, Peter Ondruska

Figure 1 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 2 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 3 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 4 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Abstract:In this paper we present the first safe system for full control of self-driving vehicles trained from human demonstrations and deployed in challenging, real-world, urban environments. Current industry-standard solutions use rule-based systems for planning. Although they perform reasonably well in common scenarios, the engineering complexity renders this approach incompatible with human-level performance. On the other hand, the performance of machine-learned (ML) planning solutions can be improved by simply adding more exemplar data. However, ML methods cannot offer safety guarantees and sometimes behave unpredictably. To combat this, our approach uses a simple yet effective rule-based fallback layer that performs sanity checks on an ML planner's decisions (e.g. avoiding collision, assuring physical feasibility). This allows us to leverage ML to handle complex situations while still assuring the safety, reducing ML planner-only collisions by 95%. We train our ML planner on 300 hours of expert driving demonstrations using imitation learning and deploy it along with the fallback layer in downtown San Francisco, where it takes complete control of a real vehicle and navigates a wide variety of challenging urban driving scenarios.

Via

Access Paper or Ask Questions

Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Sep 27, 2021

Oliver Scheel, Luca Bergamini, Maciej Wołczyk, Błażej Osiński, Peter Ondruska

Figure 1 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 2 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 3 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 4 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Abstract:In this work we are the first to present an offline policy gradient method for learning imitative policies for complex urban driving from a large corpus of real-world demonstrations. This is achieved by building a differentiable data-driven simulator on top of perception outputs and high-fidelity HD maps of the area. It allows us to synthesize new driving experiences from existing demonstrations using mid-level representations. Using this simulator we then train a policy network in closed-loop employing policy gradients. We train our proposed method on 100 hours of expert demonstrations on urban roads and show that it learns complex driving policies that generalize well and can perform a variety of driving maneuvers. We demonstrate this in simulation as well as deploy our model to self-driving vehicles in the real-world. Our method outperforms previously demonstrated state-of-the-art for urban driving scenarios -- all this without the need for complex state perturbations or collecting additional on-policy data during training. We make code and data publicly available.

* CoRL 2021

Via

Access Paper or Ask Questions

CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving

Dec 16, 2020

Błażej Osiński, Piotr Miłoś, Adam Jakubowski, Paweł Zięcina, Michał Martyniak, Christopher Galias, Antonia Breuer, Silviu Homoceanu, Henryk Michalewski

Figure 1 for CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving

Figure 2 for CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving

Figure 3 for CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving

Figure 4 for CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving

Abstract:This work introduces interactive traffic scenarios in the CARLA simulator, which are based on real-world traffic. We concentrate on tactical tasks lasting several seconds, which are especially challenging for current control methods. The CARLA Real Traffic Scenarios (CRTS) is intended to be a training and testing ground for autonomous driving systems. To this end, we open-source the code under a permissive license and present a set of baseline policies. CRTS combines the realism of traffic scenarios and the flexibility of simulation. We use it to train agents using a reinforcement learning algorithm. We show how to obtain competitive polices and evaluate experimentally how observation types and reward schemes affect the training process and the resulting agent's behavior.

Via

Access Paper or Ask Questions

Simulation-based reinforcement learning for real-world autonomous driving

Dec 26, 2019

Błażej Osiński, Adam Jakubowski, Piotr Miłoś, Paweł Zięcina, Christopher Galias, Silviu Homoceanu, Henryk Michalewski

Figure 1 for Simulation-based reinforcement learning for real-world autonomous driving

Figure 2 for Simulation-based reinforcement learning for real-world autonomous driving

Figure 3 for Simulation-based reinforcement learning for real-world autonomous driving

Figure 4 for Simulation-based reinforcement learning for real-world autonomous driving

Abstract:We use synthetic data and a reinforcement learning algorithm to train a system controlling a full-size real-world vehicle in a number of restricted driving scenarios. The driving policy uses RGB images as input. We analyze how design decisions about perception, control and training impact the real-world performance.

Via

Access Paper or Ask Questions

Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

Apr 02, 2018

Łukasz Kidziński, Sharada Prasanna Mohanty, Carmichael Ong, Zhewei Huang, Shuchang Zhou, Anton Pechenko, Adam Stelmaszczyk, Piotr Jarosik, Mikhail Pavlov, Sergey Kolesnikov(+19 more)

Figure 1 for Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

Figure 2 for Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

Figure 3 for Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

Figure 4 for Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments

Abstract:In the NIPS 2017 Learning to Run challenge, participants were tasked with building a controller for a musculoskeletal model to make it run as fast as possible through an obstacle course. Top participants were invited to describe their algorithms. In this work, we present eight solutions that used deep reinforcement learning approaches, based on algorithms such as Deep Deterministic Policy Gradient, Proximal Policy Optimization, and Trust Region Policy Optimization. Many solutions use similar relaxations and heuristics, such as reward shaping, frame skipping, discretization of the action space, symmetry, and policy blending. However, each of the eight teams implemented different modifications of the known algorithms.

* 27 pages, 17 figures

Via

Access Paper or Ask Questions