An intelligent task allocation and path planning framework for multi-level and multi-type air–ground–human collaboration in public health disinfection

Liuhua Zhang; Zhengquan Li; Nanfeng Zhang; Jingfeng Yang; Yingyi Wu; Liuhua Zhang; Zhengquan Li; Nanfeng Zhang; Jingfeng Yang; Yingyi Wu

doi:10.48130/mav-0026-0002

With the increasing demands of urban public health management and emergency epidemic control, traditional single-agent disinfection operations are insufficient to achieve efficient, comprehensive, and sustainable coverage, suffering from limited coverage, low operational efficiency, high energy consumption, and poor scheduling robustness. To address these challenges, this paper proposes an intelligent task allocation and path planning framework for multi-level, multi-type agent collaboration, integrating UAVs, large spraying vehicles, small electric sprayers, and ground personnel. Specifically, a multi-objective optimization task allocation model is developed, targeting operational efficiency, energy consumption, and system robustness, while considering task balance, coverage continuity, and fault tolerance. A collaborative path planning strategy is proposed, dynamically generating optimal coverage paths according to agent characteristics, with conflict detection and adjustment mechanisms ensuring safety and continuity. Furthermore, dynamic scheduling and anomaly prediction modules monitor execution status and enable task reassignment, enhancing system adaptability and reliability. Evaluation on real-world datasets and large-scale simulations demonstrate that the proposed framework outperforms traditional methods in multi-agent collaboration efficiency, coverage, task completion time, and energy consumption, effectively addressing complex operational scenarios. This study provides an efficient, intelligent, and scalable solution for urban public health disinfection, and offers theoretical and methodological insights for emergency management, urban inspection, and multi-agent task planning.

HTML

Introduction

Urban public health crises and epidemic emergencies demand rapid and large-scale disinfection operations to ensure environmental safety. Conventional single-agent approaches, such as unmanned aerial vehicles (UAVs), or road-based spraying vehicles are constrained by limited coverage, high energy consumption, and insufficient robustness. In dense and dynamic urban environments, complex road networks, population mobility, and real-time environmental variations further exacerbate operational challenges.

Recent studies have explored UAV-based disinfection, ground vehicle optimization, and multi-agent coordination. For UAVs, Lu et al.^[1] developed multi-UAV cooperative planning for dynamic obstacle avoidance, while Vasquez-Gomez et al.^[2] optimized spray uniformity and coverage. Dorling et al.^[3], and Vanegas et al.^[4] integrated energy and terrain constraints, and Agatz et al.^[5] proposed heuristic optimization, balancing coverage and energy. Li et al.^[6], Wang et al.^[7] and Jiang et al.^[8] extended multi-objective UAV planning under urban wind fields. Despite progress, these methods remain limited to static or small-scale tasks, and lack adaptability for prolonged, heterogeneous operations^[3−9].

Ground vehicle-based research builds on the classical Vehicle Routing Problem (VRP). Toth & Vigo^[10], and Bräysy & Gendreau^[11] established theoretical frameworks for route optimization with time-window constraints, while Hulagu et al.^[12] integrated energy and environmental objectives. Liang et al.^[13], and Ighravwe et al.^[14] examined dynamic and sustainable routing for urban sanitation. Yet, road-constrained vehicles cannot fully cover off-road or pedestrian areas.

Dual-agent UAV vehicle collaboration models have been proposed^[15−20], but most studies remain limited to static coordination, or two-agent coupling. Systematic frameworks integrating aerial, vehicular, and human operators are scarce, and robustness under complex disturbances has not been adequately validated^[9,16−21].

Dynamic task scheduling and reinforcement-learning-based optimization have recently emerged^[22−25], enabling adaptive coalition formation and load balancing. However, they rarely address heterogeneous, multi-objective trade-offs required for real-world urban disinfection.

This study proposes an intelligent multi-level and multi-type collaborative framework combining UAVs, large spraying vehicles, small electric sprayers, and ground personnel. The system jointly optimizes task allocation, path planning, and dynamic scheduling through multi-objective optimization and reinforcement learning. The key innovations are: (1) A unified optimization model balancing operational efficiency, energy consumption, and robustness under multi-agent constraints. (2) A cooperative path-planning mechanism integrating improved NSGA-II and reinforcement learning for real-time adaptability. (3) A dynamic scheduling and fault-tolerant architecture ensuring continuous operation under environmental or equipment disturbances.

This work contributes a scalable and adaptive decision framework for public-health disinfection and similar urban emergency tasks.

The main contributions of this work can be summarized as follows:

(1) We propose a unified multi-level and multi-type air–ground–human collaborative framework for public health disinfection, which explicitly models heterogeneous agent capabilities, operational constraints, and task characteristics within a single coordinated system.

(2) A hybrid decision-making strategy is developed by coupling offline multi-objective task allocation with online adaptive scheduling, enabling robust coordination under dynamic task demands and environmental disturbances.

(3) Extensive comparative and ablation experiments are conducted under consistent experimental settings to validate the effectiveness, robustness, and practical deplorability of the proposed framework in complex urban scenarios.

Research related to this work mainly involves multi-agent task allocation, path planning and coverage strategies, and heterogeneous collaborative frameworks integrating aerial, ground, and human agents. This section reviews representative studies in these areas, and highlights the limitations that motivate the proposed framework.

(1) Task allocation in multi-agent systems

Task allocation is a fundamental problem in multi-agent systems and has been widely studied in robotics, logistics, and autonomous systems. Classical approaches are typically formulated as combinatorial or multi-objective optimization problems, aiming to minimize total cost, completion time, or resource consumption. Representative methods include auction-based mechanisms, mixed-integer programming, and evolutionary algorithms, which perform effectively under static or mildly dynamic environments.

With the increasing complexity of real-world applications, learning-based approaches, such as reinforcement learning and its variants, have been introduced to enable adaptive task allocation under uncertainty. These methods improve flexibility and scalability, but often require extensive training data, and may suffer from stability issues when facing highly heterogeneous agents or rapidly changing task demands.

Despite these advances, most existing task allocation studies assume homogeneous agents, or only consider limited heterogeneity. The diversity in sensing capability, mobility constraints, and operational roles among UAVs, ground vehicles, and human operators is rarely modeled within a unified allocation framework, which limits the applicability of these methods to complex public health disinfection scenarios.

(2) Path planning and coverage strategies

Path planning and coverage control have been extensively investigated for both aerial and ground robotic platforms. UAV-based coverage planning typically focuses on maximizing area coverage efficiency, while avoiding obstacles and respecting flight constraints. Grid-based decomposition, sampling-based planners, and coverage path planning methods are commonly adopted in aerial disinfection and surveillance tasks.

In contrast, ground vehicle path planning is often constrained by road networks, accessibility, and turning limitations. Graph-based shortest-path algorithms and route optimization methods are widely used to ensure feasibility and efficiency in urban environments. Human-assisted operations introduce additional flexibility, but also uncertainty due to variable execution speeds and subjective decision-making.

While these methods are effective for individual agent types, most existing studies treat aerial, ground, and human agents independently. Directly combining heterogeneous path planning strategies without a unified coordination mechanism often leads to inefficiencies, conflicts, or suboptimal global performance, particularly in time-critical and large-scale disinfection tasks.

(3) Heterogeneous air–ground–human collaborative frameworks

Recently, increasing attention has been given to heterogeneous collaborative systems that integrate multiple agent types. Several studies explore cooperation between UAVs and ground robots to leverage complementary advantages, such as aerial perception and ground-level execution. Other works incorporate human participation to enhance adaptability in complex or uncertain environments.

However, most existing heterogeneous collaboration frameworks focus on specific agent combinations or isolated subsystems, such as perception sharing or communication protocols. Comprehensive frameworks that simultaneously address task allocation, path planning, and adaptive scheduling across aerial, ground, and human agents remain limited. Moreover, robustness against dynamic task changes, agent failures, and environmental disturbances is often insufficiently considered.

In contrast to existing approaches, this work emphasizes a system-level collaborative framework that unifies heterogeneous agents through hybrid offline multi-objective optimization and online adaptive scheduling. By explicitly modeling agent diversity and integrating predictive coordination mechanisms, the proposed framework aims to bridge the gap between theoretical multi-agent methods and practical deployment requirements in public health disinfection operations.

Unlike studies that focus on improving a single task allocation or path planning algorithm, this work aims to provide a system-level collaborative framework that integrates heterogeneous agents, multiple optimization paradigms, and adaptive scheduling mechanisms to support real-world public health disinfection operations.

System modeling and collaborative framework

Experimental evaluation and performance analysis

Discussion and practical implications

This study addresses challenges in urban public health disinfection and multi-agent collaborative operations, including low operational efficiency, uneven coverage, high energy consumption, and limited scheduling robustness. The proposed intelligent task allocation and path planning framework enables multi-level, multi-type agent collaboration. By constructing a multi-objective task allocation model targeting operational efficiency, EC, TB, CC, and system robustness, the framework achieves global optimization and dynamic scheduling of multi-agent tasks.

The dynamic path planning algorithm is tailored to different agent types (UAVs, large spraying vehicles, small electric sprayers, and ground personnel) and incorporates conflict detection, obstacle avoidance, and task priority adjustment mechanisms, ensuring safe, continuous, and optimal coverage. A real-time monitoring and anomaly detection mechanism allows dynamic responses to emergent events and environmental changes. Experimental results demonstrate that the proposed approach outperforms traditional centralized and static scheduling methods in operational efficiency, coverage, energy optimization, task completion time, and scheduling robustness. Multi-objective optimization balances efficiency and resource consumption, while TB and CC metrics ensure uniform and continuous operations. Anomaly detection further enhances adaptability to unexpected tasks. Validation across different scenarios and task scales shows high scalability and generality, offering a practical technical solution for urban disinfection, multi-agent collaborative operations, and intelligent inspection tasks.

Despite these promising results, several directions for improvement remain. In complex urban environments, agents face challenges such as multi-level buildings, dynamic weather conditions, traffic constraints, and moving obstacles. Future work will focus on enhancing adaptability under such conditions through multi-level task allocation, dynamic path planning optimization, and adaptive adjustment of multi-objective weights. Integrating reinforcement learning, deep multi-agent systems, and graph neural networks could further improve collaborative decision-making, operational efficiency, resource utilization, and task completion quality.

Additionally, leveraging multi-source sensor data, historical task data, and environmental information can support more precise anomaly prediction and dynamic task prioritization models, enabling rapid responses to emergent tasks and environmental changes. Energy consumption models and charging scheduling strategies for different agent types will be developed to optimize energy use and ensure sustainable operations.

[1]	Lu M, Fan X, Chen H, Lu P. 2025. FAPP: fast and adaptive perception and planning for UAVs in dynamic cluttered environments. IEEE Transactions on Robotics 41:871−886 doi: 10.1109/tro.2024.3522187/mm1 CrossRef Google Scholar
[2]	Vazquez-Carmona EV, Vasquez-Gomez JI, Herrera-Lozada JC, Antonio-Cruz M. 2022. Coverage path planning for spraying drones. Computers & Industrial Engineering 168:108125 doi: 10.1016/j.cie.2022.108125 CrossRef Google Scholar
[3]	Dorling K, Heinrichs J, Messier GG, Magierowski S. 2017. Vehicle routing problems for drone delivery. IEEE Transactions on Systems, Man, and Cybernetics: Systems 47(1):70−85 doi: 10.1109/TSMC.2016.2582745 CrossRef Google Scholar
[4]	Vanegas F, Gonzalez F. 2016. Enabling UAV navigation with sensor and environmental uncertainty in cluttered and GPS-denied environments. Sensors 16(5):666 doi: 10.3390/s16050666 CrossRef Google Scholar
[5]	Agatz N, Bouman P, Schmidt M. 2018. Optimization approaches for the traveling salesman problem with drone. Transportation Science 52(4):739−1034 doi: 10.1287/trsc.2017.0791 CrossRef Google Scholar
[6]	Li W, Xiong, Y, Xiong Q. 2025. Reinforcement learning-guided particle swarm optimization for multi-objective unmanned aerial vehicle path planning. Symmetry 17(8):1292 doi: 10.3390/sym17081292 CrossRef Google Scholar
[7]	Wang H, Tan L, Shi J, Lv X, Lian X. 2021. An improved NSGA-II algorithm for UAV path planning problems. Journal of Internet Technology 22(3):583−592 doi: 10.3966/160792642021052203008 CrossRef Google Scholar
[8]	Jiang C, Yang L, Gao Y, Zhao J, Hou W, et al. 2025. An intelligent 5G unmanned aerial vehicle path optimization algorithm for offshore wind farm inspection. Drones 9(1):47 doi: 10.3390/drones9010047 CrossRef Google Scholar
[9]	Zhang Y, Fan X, Cheng Z, Xue C. 2025. Multi-USV task assignment based on NSGA II-MC. IEEE Access 13:62577−62590 doi: 10.1109/ACCESS.2025.3557582 CrossRef Google Scholar
[10]	Toth P, Vigo D. 2014. Vehicle Routing: Problems, Methods, and Applications, Second Edition. USA: Society for Industrial and Applied Mathematics Press. doi: 10.1137/1.9781611973594.fm
[11]	Bräysy O, Gendreau M. 2005. Vehicle routing problem with time windows, part I: route construction and local search algorithms. Transportation Science 39(1):104−118 doi: 10.1287/trsc.1030.0056 CrossRef Google Scholar
[12]	Hulagu S, Celikoglu HB. 2022. Electric vehicle location routing problem with vehicle motion dynamics-based energy consumption and recovery. IEEE Transactions on Intelligent Transportation Systems 23:10275−10286 doi: 10.1109/TITS.2021.3089675 CrossRef Google Scholar
[13]	Liang Q, Xiao H, Wu H, Long J, Qin H, et al. 2024. Integrated environment-sensing path planning method for electric unmanned sanitation vehicle. IEEE Sensors Journal 24:29243−29257 doi: 10.1109/jsen.2024.3430083 CrossRef Google Scholar
[14]	Ighravwe DE, Oke SA, Aikhuele D, Ojo A. 2020. An optimisation approach to road sanitation workforce planning using differential evolution. Journal of Urban Management 9(4):398−407 doi: 10.1016/j.jum.2020.06.004 CrossRef Google Scholar
[15]	Murray CC, Chu AG. 2015. The flying sidekick traveling salesman problem: optimization of drone-assisted parcel delivery. Transportation Research Part C: Emerging Technologies 54:86−109 doi: 10.1016/j.trc.2015.03.005 CrossRef Google Scholar
[16]	Huang H, Wen X, Niu M, Miah MS, Gao T, et al. 2024. Multi-UAVs assisted path planning method for terrain-oriented air–ground collaborative vehicular network architecture. IEEE Transactions on Intelligent Vehicles 9(12):7840−7851 doi: 10.1109/TIV.2024.3402434 CrossRef Google Scholar
[17]	Otto A, Agatz N, Campbell J, Golden B, Pesch E. 2018. Optimization approaches for civil applications of unmanned aerial vehicles (UAVs) or aerial drones: a survey. Networks 72(4):411−458 doi: 10.1002/net.21818 CrossRef Google Scholar
[18]	Wang D, Hu P, Du J, Zhou P, Deng T, et al. 2019. Routing and scheduling for hybrid truck-drone collaborative parcel delivery with independent and truck-carried drones. IEEE Internet of Things Journal 6(6):10483−10495 doi: 10.1109/JIOT.2019.2939397 CrossRef Google Scholar
[19]	Ma Z, Xiong J, Gong H, Wang X. 2025. Adaptive depth graph neural network-based dynamic task allocation for UAV-UGVs under complex environments. IEEE Transactions on Intelligent Vehicles 10(5):3573−3586 doi: 10.1109/TIV.2024.3457493 CrossRef Google Scholar
[20]	Kim H, Aung PS, Munir MS, Saad W, Hong CS. 2025. Cooperative urban air mobility trajectory design for power and AoI optimization: a multi-agent reinforcement learning approach. IEEE Transactions on Vehicular Technology 74(9):14799−14804 doi: 10.1109/TVT.2025.3561155 CrossRef Google Scholar
[21]	Hudson N, Talbot F, Cox M, Williams J, Hines T, et al. 2022. Heterogeneous ground and air platforms, homogeneous sensing: team CSIRO data61's approach to the DARPA subterranean challenge. Field Robotics 2:595−636 doi: 10.55417/fr.2022021 CrossRef Google Scholar
[22]	Dai W, Rai U, Chiun J, Cao Y, Sartoretti G. 2025. Heterogeneous multi-robot task allocation and scheduling via reinforcement learning. IEEE Robotics and Automation Letters 10(3):2654−2661 doi: 10.1109/LRA.2025.3534682 CrossRef Google Scholar
[23]	Liu D, Dou L, Zhang R, Zhang X, Zong Q. 2023. Multi-agent reinforcement learning-based coordinated dynamic task allocation for heterogenous UAVs. IEEE Transactions on Vehicular Technology 72(4):4372−4383 doi: 10.1109/tvt.2022.3228198 CrossRef Google Scholar
[24]	Hua M, Yao XY, Liu WJ, Geng MJ, Lv M. 2025. Optimizing fixed-time generalized nash equilibrium seeking in multi-autonomous aerial vehicle games. IEEE Transactions on Aerospace and Electronic Systems 61(3):6339−6353 doi: 10.1109/taes.2025.3526559 CrossRef Google Scholar
[25]	Long Y, Xu G, Zhao J, Xie B, Fang M. 2023. Dynamic truck–UAV collaboration and integrated route planning for resilient urban emergency response. IEEE Transactions on Engineering Management 71:9826−9838 doi: 10.1109/tem.2023.3299693 CrossRef Google Scholar

Assignment strategy	CR (%)	WE (%)	EC (MJ)	RD (%)	RB	TB (%)	CC (%)	FTR (%)	CF (%)
Single agent UAV	72.45	65.12	120.34	15.22	0.81	67.5	70.12	62.34	8.45
Single agent large spraying vehicle	68.34	62.45	115.78	18.15	0.78	64.81	66.5	59.12	10.22
Single agent small electric sprayers	61.78	60.12	112.45	17.34	0.75	62.50	63.45	57.12	11.33
Single agent ground personnel	55.21	54.23	110.45	20.45	0.72	60.15	60.11	55.34	12.33
Multi-agent collaboration (static)	88.34	80.12	140.56	10.12	0.88	75.23	82.45	74.12	6.78
Multi-agent collaboration (dynamic)	92.56	85.45	138.23	8.45	0.91	78.34	86.12	81.23	5.12

Assignment strategy/algorithm	CR (%)	EC (MJ)	WE (%)	RD (%)	RB	TB (%)	CC (%)	FTR (%)	CF (%)	Path length (m)
UAV - A*	74.12	125.34	66.78	14.56	0.82	68.45	71.12	63.34	8.78	1,450
UAV - RRT*	75.34	122.56	68.12	13.89	0.84	69.78	72.45	64.12	8.12	1,423
UAV - GA	72.45	120.78	65.12	15.22	0.81	67.5	70.12	62.34	8.45	1,478
UAV - DRL	78.23	118.34	70.45	12.78	0.85	71.23	73.45	65.12	7.89	1,402
Large spraying vehicle - Dijkstra	69.12	115.78	63.45	17.34	0.78	65.12	66.78	59.45	10.22	1,600
Large spraying vehicle - RL	70.45	113.45	64.78	16.89	0.8	66.45	67.23	60.12	9.78	1,575
Small electric sprayers - GA	61.78	112.45	60.12	17.34	0.75	62.5	63.45	57.12	11.33	1,520
Ground personnel - Graph Theory	55.21	110.45	54.23	20.45	0.72	60.15	60.1	55.34	12.33	1,680
Multi-agent static	88.34	140.56	80.12	10.12	0.88	75.23	82.45	74.12	6.78	1,300
Multi-agent dynamic	92.56	138.23	85.45	8.45	0.91	78.34	86.12	81.23	5.12	1,285

Strategy	ART (s)	TCR (%)	DR (%)	RR (%)	RB	AH	EC (MJ)	ET (min)	CF (%)
Static	10.45	80.12	10.56	74.23	0.78	68.34	138.45	145.2	7.78
Predictive (LSTM)	9.12	86.45	9.78	76.56	0.82	71.12	134.12	138.3	6.45
Predictive (transformer)	8.89	87.78	9.12	78.45	0.84	73.56	133.45	137.8	6.12
RL-based	7.78	88.34	8.56	80.12	0.88	76.23	132.78	136.5	5.89
Multi-agent dynamic	6.45	90.56	7.12	83.23	0.91	78.34	128.23	128.5	5.62

Method	CR (%)	EC (MJ)	WT (min)	WE (%)	RD (%)	RB	TB (%)	CC (%)	FTR (%)	CF (%)
Static NSGA-II	84.23	142.45	145.28	72.56	15.12	0.78	68.12	70.45	74.23	8.78
MOEA/D	85.12	140.78	142.34	73.45	14.78	0.84	69.34	71.12	76.12	8.12
Multi-agent DDPG	87.45	138.34	138.55	75.23	13.45	0.85	71.45	73.12	77.45	7.89
Improved NSGA-II + RL	91.56	132.23	132.51	77.45	13.12	0.88	72.34	74.02	78.23	7.62

{{lists.name}}

An intelligent task allocation and path planning framework for multi-level and multi-type air–ground–human collaboration in public health disinfection

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors