Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Martin Servin

A simulation framework for autonomous lunar construction work

May 28, 2025

Mattias Linde, Daniel Lindmark, Sandra Ålstig, Martin Servin

Abstract:We present a simulation framework for lunar construction work involving multiple autonomous machines. The framework supports modelling of construction scenarios and autonomy solutions, execution of the scenarios in simulation, and analysis of work time and energy consumption throughout the construction project. The simulations are based on physics-based models for contacting multibody dynamics and deformable terrain, including vehicle-soil interaction forces and soil flow in real time. A behaviour tree manages the operational logic and error handling, which enables the representation of complex behaviours through a discrete set of simpler tasks in a modular hierarchical structure. High-level decision-making is separated from lower-level control algorithms, with the two connected via ROS2. Excavation movements are controlled through inverse kinematics and tracking controllers. The framework is tested and demonstrated on two different lunar construction scenarios.

* 12 pages, 16 figures

Via

Access Paper or Ask Questions

Synthesizing multi-log grasp poses

Mar 18, 2024

Arvid Fälldin, Erik Wallin, Tommy Löfstedt, Martin Servin

Abstract:Multi-object grasping is a challenging task. It is important for energy and cost-efficient operation of industrial crane manipulators, such as those used to collect tree logs off the forest floor and onto forest machines. In this work, we used synthetic data from physics simulations to explore how data-driven modeling can be used to infer multi-object grasp poses from images. We showed that convolutional neural networks can be trained specifically for synthesizing multi-object grasps. Using RGB-Depth images and instance segmentation masks as input, a U-Net model outputs grasp maps with corresponding grapple orientation and opening width. Given an observation of a pile of logs, the model can be used to synthesize and rate the possible grasp poses and select the most suitable one, with the possibility to respect changing operational constraints such as lift capacity and reach. When tested on previously unseen data, the proposed model found successful grasp poses with an accuracy of 95%.

Via

Access Paper or Ask Questions

Examining the simulation-to-reality gap of a wheel loader digging in deformable terrain

Oct 10, 2023

Koji Aoshima, Martin Servin

Abstract:We investigate how well a physics-based simulator can replicate a real wheel loader performing bucket filling in a pile of soil. The comparison is made using field test time series of the vehicle motion and actuation forces, loaded mass, and total work. The vehicle was modeled as a rigid multibody system with frictional contacts, driveline, and linear actuators. For the soil, we tested discrete element models of different resolutions, with and without multiscale acceleration. The spatio-temporal resolution ranged between 50-400 mm and 2-500 ms, and the computational speed was between 1/10,000 to 5 times faster than real-time. The simulation-to-reality gap was found to be around 10% and exhibited a weak dependence on the level of fidelity, e.g., compatible with real-time simulation. Furthermore, the sensitivity of an optimized force feedback controller under transfer between different simulation domains was investigated. The domain bias was observed to cause a performance reduction of 5% despite the domain gap being about 15%.

* 17 pages, 11 figures

Via

Access Paper or Ask Questions

Predictor models for high-performance wheel loading

Sep 22, 2023

Koji Aoshima, Arvid Fälldin, Eddie Wadbro, Martin Servin

Abstract:Autonomous wheel loading involves selecting actions that maximize the total performance over many repetitions. The actions should be well adapted to the current state of the pile and its future states. Selecting the best actions is difficult since the pile states are consequences of previous actions and thus are highly unknown. To aid the selection of actions, this paper investigates data-driven models to predict the loaded mass, time, work, and resulting pile state of a loading action given the initial pile state. Deep neural networks were trained on data using over 10,000 simulations to an accuracy of 91-97,% with the pile state represented either by a heightmap or by its slope and curvature. The net outcome of sequential loading actions is predicted by repeating the model inference at five milliseconds per loading. As errors accumulate during the inferences, long-horizon predictions need to be combined with a physics-based model.

* 22 pages, 19 figures

Via

Access Paper or Ask Questions

Multi-log grasping using reinforcement learning and virtual visual servoing

Sep 06, 2023

Erik Wallin, Viktor Wiberg, Martin Servin

Figure 1 for Multi-log grasping using reinforcement learning and virtual visual servoing

Figure 2 for Multi-log grasping using reinforcement learning and virtual visual servoing

Figure 3 for Multi-log grasping using reinforcement learning and virtual visual servoing

Figure 4 for Multi-log grasping using reinforcement learning and virtual visual servoing

Abstract:We explore multi-log grasping using reinforcement learning and virtual visual servoing for automated forwarding. Automation of forest processes is a major challenge, and many techniques regarding robot control pose different challenges due to the unstructured and harsh outdoor environment. Grasping multiple logs involves problems of dynamics and path planning, where the interaction between the grapple, logs, terrain, and obstacles requires visual information. To address these challenges, we separate image segmentation from crane control and utilize a virtual camera to provide an image stream from 3D reconstructed data. We use Cartesian control to simplify domain transfer. Since log piles are static, visual servoing using a 3D reconstruction of the pile and its surroundings is equivalent to using real camera data until the point of grasping. This relaxes the limit on computational resources and time for the challenge of image segmentation, and allows for collecting data in situations where the log piles are not occluded. The disadvantage is the lack of information during grasping. We demonstrate that this problem is manageable and present an agent that is 95% successful in picking one or several logs from challenging piles of 2--5 logs.

* 8 pages, 10 figures

Via

Access Paper or Ask Questions

Sim-to-real transfer of active suspension control using deep reinforcement learning

Jun 21, 2023

Viktor Wiberg, Erik Wallin, Arvid Fälldin, Tobias Semberg, Morgan Rossander, Eddie Wadbro, Martin Servin

Figure 1 for Sim-to-real transfer of active suspension control using deep reinforcement learning

Figure 2 for Sim-to-real transfer of active suspension control using deep reinforcement learning

Figure 3 for Sim-to-real transfer of active suspension control using deep reinforcement learning

Figure 4 for Sim-to-real transfer of active suspension control using deep reinforcement learning

Abstract:We explore sim-to-real transfer of deep reinforcement learning controllers for a heavy vehicle with active suspensions designed for traversing rough terrain. While related research primarily focuses on lightweight robots with electric motors and fast actuation, this study uses a forestry vehicle with a complex hydraulic driveline and slow actuation. We simulate the vehicle using multibody dynamics and apply system identification to find an appropriate set of simulation parameters. We then train policies in simulation using various techniques to mitigate the sim-to-real gap, including domain randomization, action delays, and a reward penalty to encourage smooth control. In reality, the policies trained with action delays and a penalty for erratic actions perform at nearly the same level as in simulation. In experiments on level ground, the motion trajectories closely overlap when turning to either side, as well as in a route tracking scenario. When faced with a ramp that requires active use of the suspensions, the simulated and real motions are in close alignment. This shows that the actuator model together with system identification yields a sufficiently accurate model of the actuators. We observe that policies trained without the additional action penalty exhibit fast switching or bang-bang control. These present smooth motions and high performance in simulation but transfer poorly to reality. We find that policies make marginal use of the local height map for perception, showing no indications of look-ahead planning. However, the strong transfer capabilities entail that further development concerning perception and performance can be largely confined to simulation.

* 15 pages, 18 figures

Via

Access Paper or Ask Questions

Learning multiobjective rough terrain traversability

Apr 13, 2022

Erik Wallin, Viktor Wiberg, Folke Vesterlund, Johan Holmgren, Henrik Persson, Martin Servin

Figure 1 for Learning multiobjective rough terrain traversability

Figure 2 for Learning multiobjective rough terrain traversability

Figure 3 for Learning multiobjective rough terrain traversability

Figure 4 for Learning multiobjective rough terrain traversability

Abstract:We present a method that uses high-resolution topography data of rough terrain, and ground vehicle simulation, to predict traversability. Traversability is expressed as three independent measures: the ability to traverse the terrain at a target speed, energy consumption, and acceleration. The measures are continuous and reflect different objectives for planning that go beyond binary classification. A deep neural network is trained to predict the traversability measures from the local heightmap and target speed. To produce training data, we use an articulated vehicle with wheeled bogie suspensions and procedurally generated terrains. We evaluate the model on laser-scanned forest terrains, previously unseen by the model. The model predicts traversability with an accuracy of 90%. Predictions rely on features from the high-dimensional terrain data that surpass local roughness and slope relative to the heading. Correlations show that the three traversability measures are complementary to each other. With an inference speed 3000 times faster than the ground truth simulation and trivially parallelizable, the model is well suited for traversability analysis and optimal path planning over large areas.

* 11 pages, 17 figures, 2 tables

Via

Access Paper or Ask Questions

Control of rough terrain vehicles using deep reinforcement learning

Jul 05, 2021

Viktor Wiberg, Erik Wallin, Martin Servin, Tomas Nordfjell

Figure 1 for Control of rough terrain vehicles using deep reinforcement learning

Figure 2 for Control of rough terrain vehicles using deep reinforcement learning

Figure 3 for Control of rough terrain vehicles using deep reinforcement learning

Figure 4 for Control of rough terrain vehicles using deep reinforcement learning

Abstract:We explore the potential to control terrain vehicles using deep reinforcement in scenarios where human operators and traditional control methods are inadequate. This letter presents a controller that perceives, plans, and successfully controls a 16-tonne forestry vehicle with two frame articulation joints, six wheels, and their actively articulated suspensions to traverse rough terrain. The carefully shaped reward signal promotes safe, environmental, and efficient driving, which leads to the emergence of unprecedented driving skills. We test learned skills in a virtual environment, including terrains reconstructed from high-density laser scans of forest sites. The controller displays the ability to handle obstructing obstacles, slopes up to 27$^\circ$, and a variety of natural terrains, all with limited wheel slip, smooth, and upright traversal with intelligent use of the active suspensions. The results confirm that deep reinforcement learning has the potential to enhance control of vehicles with complex dynamics and high-dimensional observation data compared to human operators or traditional control methods, especially in rough terrain.

* 16 pages, 13 figures

Via

Access Paper or Ask Questions

Continuous control of an underground loader using deep reinforcement learning

Mar 23, 2021

Sofi Backman, Daniel Lindmark, Kenneth Bodin, Martin Servin, Joakim Mörk, Håkan Löfgren

Figure 1 for Continuous control of an underground loader using deep reinforcement learning

Figure 2 for Continuous control of an underground loader using deep reinforcement learning

Figure 3 for Continuous control of an underground loader using deep reinforcement learning

Figure 4 for Continuous control of an underground loader using deep reinforcement learning

Abstract:Reinforcement learning control of an underground loader is investigated in simulated environment, using a multi-agent deep neural network approach. At the start of each loading cycle, one agent selects the dig position from a depth camera image of the pile of fragmented rock. A second agent is responsible for continuous control of the vehicle, with the goal of filling the bucket at the selected loading point, while avoiding collisions, getting stuck, or losing ground traction. It relies on motion and force sensors, as well as on camera and lidar. Using a soft actor-critic algorithm the agents learn policies for efficient bucket filling over many subsequent loading cycles, with clear ability to adapt to the changing environment. The best results, on average 75% of the max capacity, are obtained when including a penalty for energy usage in the reward.

* 9 pages, 7 figures

Via

Access Paper or Ask Questions

Reinforcement Learning Control of a Forestry Crane Manipulator

Mar 03, 2021

Jennifer Andersson, Kenneth Bodin, Daniel Lindmark, Martin Servin, Erik Wallin

Figure 1 for Reinforcement Learning Control of a Forestry Crane Manipulator

Figure 2 for Reinforcement Learning Control of a Forestry Crane Manipulator

Figure 3 for Reinforcement Learning Control of a Forestry Crane Manipulator

Figure 4 for Reinforcement Learning Control of a Forestry Crane Manipulator

Abstract:Forestry machines are heavy vehicles performing complex manipulation tasks in unstructured production forest environments. Together with the complex dynamics of the on-board hydraulically actuated cranes, the rough forest terrains have posed a particular challenge in forestry automation. In this study, the feasibility of applying reinforcement learning control to forestry crane manipulators is investigated in a simulated environment. Our results show that it is possible to learn successful actuator-space control policies for energy efficient log grasping by invoking a simple curriculum in a deep reinforcement learning setup. Given the pose of the selected logs, our best control policy reaches a grasping success rate of 97%. Including an energy-optimization goal in the reward function, the energy consumption is significantly reduced compared to control policies learned without incentive for energy optimization, while the increase in cycle time is marginal. The energy-optimization effects can be observed in the overall smoother motion and acceleration profiles during crane manipulation.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions