SiROP
Login   
Language
  • English
    • English
    • German
Home
Menu
  • Login
  • Register
  • Search Opportunity
  • Search Organization
  • Create project alert
Information
  • About SiROP
  • Team
  • Network
  • Partners
  • Imprint
  • Terms & conditions
Register now After registration you will be able to apply for this opportunity online.

Reinforcement Learning for Excavation Planning

We aim to develop a reinforcement learning-based global excavation planner that can plan for the long term and execute a wide range of excavation geometries. The system will be deployed on our legged excavator.

Keywords: Keywords: Reinforcement learning, task planning

  • Reinforcement learning has demonstrated significant success in decision-making and behavior planning with discrete states and action spaces. In this project, we plan to develop and extend a global excavation planner responsible for selecting the next digging area and the actions required to move soil around the site. This requires long-term planning for the sequence of excavation and an understanding of which areas are accessible and where the excavator could potentially become trapped. We developed using Jax a simulation environment, Terra, where agents can be trained even in millions of parallel environments on multiple GPUs. The first part of the project will focus on modifying the simulation environment to include 3D soil and multiple agents to simulate are real contruction site. The second part of the project will focus on deploying the system and integrating it with the current stack to dig geometries that were not achieved so far. References: [1] PPO for LUX AI [2] Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model, Deepmind [3] IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures, Deepmind

    Reinforcement learning has demonstrated significant success in decision-making and behavior planning with discrete states and action spaces. In this project, we plan to develop and extend a global excavation planner responsible for selecting the next digging area and the actions required to move soil around the site. This requires long-term planning for the sequence of excavation and an understanding of which areas are accessible and where the excavator could potentially become trapped.
    We developed using Jax a simulation environment, Terra, where agents can be trained even in millions of parallel environments on multiple GPUs. The first part of the project will focus on modifying the simulation environment to include 3D soil and multiple agents to simulate are real contruction site. The second part of the project will focus on deploying the system and integrating it with the current stack to dig geometries that were not achieved so far.

    References:
    [1] PPO for LUX AI
    [2] Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model, Deepmind
    [3] IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures, Deepmind

  • - Expand the Terra env to include 3D soil and/or multiple agents (such as trucks, wheel loaders, ...) - Train the agent(s), design curriculum and architecture - Deploy the system on realistic task

    - Expand the Terra env to include 3D soil and/or multiple agents (such as trucks, wheel loaders, ...)
    - Train the agent(s), design curriculum and architecture
    - Deploy the system on realistic task

  • - Experience in PyTorch and training neural networks - Experience with GPU-accelerated environments (preferred)

    - Experience in PyTorch and training neural networks
    - Experience with GPU-accelerated environments (preferred)

  • Not specified

  • Not specified

  • Not specified

Calendar

Earliest start2023-10-01
Latest end2024-04-30

Location

Robotic Systems Lab (ETHZ)

Labels

Semester Project

Master Thesis

Topics

  • Information, Computing and Communication Sciences

Documents

NameCommentSizeActions
Reinforcement_learning_for_excavation_planning.docx13.3MBDownload
SiROP PARTNER INSTITUTIONS