Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning

AI-generated keywords: Data harvesting UAVs MARL Dec-POMDP DRL

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Challenging problem of harvesting data from distributed IoT devices using multiple UAVs
  • Proposed MARL approach for adapting to changes in scenario parameters without recomputations or relearning control policies
  • Formulated path planning problem as Dec-POMDP and solved it through DRL
  • Effective cooperation among agents enabled by proposed network architecture
  • Balance between data collection goals, flight-time efficiency, and navigation constraints in movement decisions
  • Control policy that generalizes over scenario parameter space for analyzing individual parameter influence on collection performance and system-level benefits
  • Code availability for further exploration provided
  • Contributes to addressing challenges of data harvesting from IoT devices using UAVs
  • Offers insights into optimizing path planning in complex environments.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Harald Bayerlein, Mirco Theile, Marco Caccamo, David Gesbert

IEEE Open Journal of the Communications Society, vol. 2, pp. 1171-1187, 2021
Modifications: final formatting; Code available under https://github.com/hbayerlein/uav_data_harvesting, article extends on arXiv:2007.00544

Abstract: Harvesting data from distributed Internet of Things (IoT) devices with multiple autonomous unmanned aerial vehicles (UAVs) is a challenging problem requiring flexible path planning methods. We propose a multi-agent reinforcement learning (MARL) approach that, in contrast to previous work, can adapt to profound changes in the scenario parameters defining the data harvesting mission, such as the number of deployed UAVs, number, position and data amount of IoT devices, or the maximum flying time, without the need to perform expensive recomputations or relearn control policies. We formulate the path planning problem for a cooperative, non-communicating, and homogeneous team of UAVs tasked with maximizing collected data from distributed IoT sensor nodes subject to flying time and collision avoidance constraints. The path planning problem is translated into a decentralized partially observable Markov decision process (Dec-POMDP), which we solve through a deep reinforcement learning (DRL) approach, approximating the optimal UAV control policy without prior knowledge of the challenging wireless channel characteristics in dense urban environments. By exploiting a combination of centered global and local map representations of the environment that are fed into convolutional layers of the agents, we show that our proposed network architecture enables the agents to cooperate effectively by carefully dividing the data collection task among themselves, adapt to large complex environments and state spaces, and make movement decisions that balance data collection goals, flight-time efficiency, and navigation constraints. Finally, learning a control policy that generalizes over the scenario parameter space enables us to analyze the influence of individual parameters on collection performance and provide some intuition about system-level benefits.

Submitted to arXiv on 23 Oct. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2010.12461v3

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper discusses the challenging problem of harvesting data from distributed Internet of Things (IoT) devices using multiple autonomous unmanned aerial vehicles (UAVs). The authors propose a multi-agent reinforcement learning (MARL) approach that can adapt to changes in the scenario parameters without the need for expensive recomputations or relearning control policies. The path planning problem is formulated as a decentralized partially observable Markov decision process (Dec-POMDP), which is solved through deep reinforcement learning (DRL). The authors demonstrate that their proposed network architecture enables effective cooperation among the agents, allowing them to divide the data collection task and make movement decisions that balance data collection goals, flight-time efficiency, and navigation constraints. By learning a control policy that generalizes over the scenario parameter space, the authors are able to analyze the influence of individual parameters on collection performance and provide insights into system-level benefits. Furthermore, this research extends previous work by providing code availability for further exploration. Overall, this paper contributes to addressing the challenges of data harvesting from IoT devices using UAVs and offers valuable insights into optimizing path planning in complex environments.
Created on 27 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.