Urban low-altitude drone route planning: theory and algorithm in high-density CBD scenarios
Systematically analyzes the core problems and solution ideas of urban low-altitude UAV route planning, covering A*, RRT*, APF, FM², MILP, ORCA and MARL methods, with complete mathematical derivation and equations.
Urban low-altitude UAV route planning: theory and algorithm in high-density CBD scenarios
When hundreds of drones shuttle between skyscrapers at the same time, route planning is no longer a simple problem of “flying from point A to point B” - it is a high-dimensional constrained optimization problem that seeks a balance between three-dimensional space, time, energy, and safety.
Introduction: Why is CBD so difficult?
Urban low-altitude airspace is usually defined as the flight range 0–300 m (AGL) above the ground. This height level happens to be the main battlefield for UAV logistics, inspection, emergency response and other applications. The CBD (Central Business District) is the most complex sub-scenario for three reasons:
1. Dense buildings form an “urban canyon”
High-rise buildings make the available flight corridors extremely narrow, and the line of sight is blocked, which reduces the accuracy of GPS. The edges of the buildings also generate strong turbulence. At low altitudes below 50 meters, these turbulences can completely cause a small multi-rotor to lose control.
2. High-density UAVs cause intensive conflicts
In a suburban scene, there may be only a few drones flying at the same time; while under a mature urban air traffic management (UTM) system, the number of drones over the CBD may reach more than 40 drones per minute. This means that Conflict Detection & Resolution (CD&R) becomes the core bottleneck of the system rather than a peripheral function.
3. Dynamic obstacle and multi-constraint coupling
In addition to buildings, drones also have to deal with temporary no-fly zones, manned aircraft routes, real-time wind field changes, and safety risks from crowd density on the ground - all of which work together to make it difficult for any single path planning algorithm to deal with alone.
1. Problem modeling: turning flight problems into mathematical problems
1.1 3D Occupancy Grid
Discretize urban space into a voxel grid, and each voxel records its occupancy status:
Voxel resolution is typically 1–5 m and the CBD core region can be refined to 0.5 m. Building height data comes from a GIS (Geographic Information System) database and is dynamically updated combined with real-time sensors.
1.2 Mathematical definition of 4D trajectory
The flight trajectory of a single UAV is a space curve parameterized with respect to time:$$
\boldsymbol{\xi}(t) = \bigl(x(t),; y(t),; z(t)\bigr), \quad t \in [t_0,, t_f]
|\boldsymbol{\xi}_i(t) - \boldsymbol{\xi}_j(t)|2 \geq d{sep}, \quad \forall, i \neq j,; \forall, t \in [t_0, t_f]
d(u,v) = \ell_{uv}\cdot\bigl(1 + \beta\cdot\mathcal{R}_{uv}\bigr)
$$Among them, is the corridor segment length, is the ground risk score of the corridor (combining factors such as population density, building type, accident consequences, etc.), and is the risk weight coefficient. This makes A* tend to choose paths that fly over low-risk areas (e.g. rivers, parks), even if they are slightly detoured.
Limitations of A*: The quality of the airspace map determines the quality of the understanding. In high-density CBD, the number of nodes in the graph can reach hundreds of thousands, and the construction of the graph itself is a challenge.
RRT* (Rapidly-exploring Random Tree Star) explores feasible paths by randomly sampling in continuous space, which is particularly suitable for high-dimensional and complex obstacle scenes.
Nearest neighbor query - Find the node closest to the random sampling point in the tree :
Step expansion - Extend step size from to direction:
The core improvement of RRT - Rewire:* Find all neighboring nodes in the sphere with as the center and radius :
Where is the number of nodes of the current tree, is the space dimension (3D scene ), and is a constant related to the free space volume. This radius shrinks as the sampling points increase, ensuring asymptotic optimality.
Cost update:
If the cost of can be reduced through , reconnection is performed:$$
\text{If } c(x_{near}) > c(x_{new}) + d(x_{new},, x_{near}),\text{ then change the parent node of } x_{near} \text{ to } x_{new}
VO_{ij}^\tau = \left{\mathbf{v}_i ;\middle|; \exists, t\in[0,\tau],; \mathbf{p}_i+\mathbf{v}_i t ;\in; \mathbf{p}_j+\mathbf{v}j t \oplus \mathcal{D}(d{sep})\right}
where is the size of the minimum velocity change, and points to the normal direction of the boundary.
The set of feasible velocities for agent (intersect all neighbor constraints and then intersect with dynamic constraints):
Among them, encodes dynamic constraints such as maximum velocity and acceleration. ORCA has achieved a 100% success rate in density scenarios of more than 40 frames/minute, with a computational complexity of , making it suitable for real-time deployment.
4. Graph theory modeling: urban airspace network
4.1 Construction of route network diagram
Urban airspace is modeled as a weighted directed graph:
Node: above road intersections, tops of buildings, key transfer points
Edge: Legal flight corridor between two nodes (needs to pass collision detection verification)
Edge weight: multi-objective scalar weighting
Corridor capacity constraints (the number of UAVs passing at the same time does not exceed the upper limit):
The occupancy status of the entire airspace can be described by a four-dimensional tensor ( is the voxel grid, is the number of time slots):$$
\mathbf{A} \in {0,1}^{N_x\times N_y\times N_z\times N_t}, \quad A_{x,y,z,t} = 1 \iff \exists\text{ UAV occupied voxel}(x,y,z)\text{ in time slot }t
You can't use 'macro parameter character #' in math mode ### 4.2 Rotor UAV energy consumption model Energy consumption is an important optimization goal for route planning and requires accurate modeling. **Hover Power** (derived from leaf element momentum theory):
You can't use 'macro parameter character #' in math mode For a typical small multicopter ($m\approx 1\,\text{kg}$), $v^*$ is typically between 8–12 m/s. ---## 5. Wind field and urban canyon effect ### 5.1 Urban wind field modeling The wind speed distribution in urban canyons is much more complex than in the countryside, and the Weibull distribution is widely used in statistical modeling:
TI = \frac{\sigma_u}{\bar{u}}, \qquad \sigma_u = \sqrt{\overline{u’^2}}
You can't use 'macro parameter character #' in math mode Corridors with $TI > 0.3$ are usually marked as high risk, and the planner will actively avoid or increase the edge weight of this segment. ### 5.2 Dynamic safety radiusThe turbulence around buildings increases sharply as the height margin decreases. Therefore, the safe clearance distance should not be a fixed constant, but should be dynamically adjusted with the flight altitude:
\rho\bigl(\mathbf{p}(t)\bigr) \geq d_{safe}\bigl(z(t)\bigr), \quad \forall, t \in [t_0, t_f]
You can't use 'macro parameter character #' in math mode --- ## 6. Multi-machine collaborative optimization: MILP global modeling For the joint path and time slot allocation problem of $N$ drones, a **Mixed Integer Linear Programming (MILP)** model can be established to obtain the global optimal solution at small to medium scale ($N\leq 50$). **Objective function** (minimize the total completion time and energy consumption of all drones):
You can't use 'macro parameter character #' in math mode MINLP is an NP-hard problem that is approximately solved by commonly used heuristic algorithms in practice (random fractal search SFS, cheetah optimization MCO, etc.). --- ## 7. Reinforcement learning solution: MARL and attention mechanism When the scale of UAVs exceeds a hundred, the computational complexity of MILP is unacceptable. **Multi-agent reinforcement learning (MARL)** provides an alternative for offline training and extremely fast inference. ### 7.1 Reward function design The reward received by each drone $i$ at time step $t$:
r_i^t = r_{arrive}\cdot\mathbf{1}[goal] - c_{step} - c_{conflict}\cdot\mathbf{1}[conflict] - c_{detour}\cdot|\mathbf{p}i^t - \mathbf{p}{direct}|
$$The meaning of each item: is the positive reward for reaching the target; is the time penalty for each flight step; is the penalty when a conflict occurs; is the detour penalty for deviating from the straight line.
7.2 Double-DQN update (discrete action space)
The online network selects actions, and the target network evaluates values, decoupling selection and evaluation to reduce overestimation bias.
The decision-making of each drone in the CBD requires sensing the status of its surrounding neighbors. The attention mechanism allows agent to dynamically weight the influence of neighbors :
The attention weight reflects the relevance of the neighbor to the decision-making of the agent . Neighbors with close distances and large speed conflicts naturally receive higher weights.
You can't use 'macro parameter character #' in math mode The Clip operation limits the update step size to the range of $[1-\varepsilon,\, 1+\varepsilon]$ (usually $\varepsilon=0.2$) to prevent training from crashing due to excessive policy updates. **Centralized Training, Decentralized Execution (CTDE) Paradigm:** - **Training phase**: Evaluation network $V(s^{global};\phi)$ uses global state and can perceive all agent information - **Execution phase**: Policy network $\pi_\theta(a_i\mid o_i)$ only uses local observations of agent $i$, without communication --- ## 8. Trajectory Smoothing: Bézier Curve and Minimum Snap The output of path planning is often a series of discrete waypoints, and directly tracking these waypoints will produce unfeasible sharp turns. It is necessary to generate dynamically feasible continuous trajectories through **trajectory smoothing**. ### 8.1 Bézier Curve A Bézier curve of order $n$ is defined by $n+1$ control points $\{\mathbf{P}_i\}$:
\boldsymbol{\xi}(u) = \sum_{i=0}^{n}\binom{n}{i}(1-u)^{n-i}u^i,\mathbf{P}_i, \quad u \in [0,1]
You can't use 'macro parameter character #' in math mode ### 8.2 Minimum Snap: The standard solution for quadcopters For a quadcopter UAV, minimizing Snap (the second derivative of acceleration) is equivalent to minimizing the rate of change of required thrust, resulting in optimal flight dynamics:
You can't use 'macro parameter character #' in math mode The matrix $\mathbf{Q}$ encodes the Snap integral (can be calculated analytically), and the equality constraint $\mathbf{A}_{eq}\mathbf{c}=\mathbf{b}_{eq}$ forces the trajectory to pass through all path points and ensures the continuity of position, velocity, and acceleration between segments. --- ## 9. Horizontal comparison of methods| Method | Completeness | Optimality | Time complexity | Real-time | Multi-machine scalability | |------|--------|--------|------------|--------|------------| | **A\*** | Complete | Optimal (discrete graph) | $O(b^d)$ | Medium | Poor | | **RRT\*** | Probabilistically complete | Asymptotically optimal | $O(n\log n)$ | Better | Medium | | **APF** | Incomplete | No guarantee | $O(1)$/step | Excellent | Good | | **FM²** | Complete | Optimal (continuous) | $O(N\log N)$ | Medium | Medium | | **MILP** | Complete | Global Optimal | NP-hard | Poor | Medium ($N\leq50$) | | **ORCA** | Probabilistically complete | Local optimum | $O(N^2)$ | Excellent | Excellent | | **MARL+Attn** | Complete probability | Approximate | Heavy training, fast inference | Good | Excellent | **Selection suggestions:** - **Small scale, high security requirements** ($N\leq 20$) → MILP global optimal - **Medium scale, real-time sensitive** ($20 < N \leq 100$) → A\* / RRT\* + ORCA conflict resolution - **Large scale, high density** ($N > 100$) → MARL + attention mechanism (inference delay $< 10\,\text{ms}$) --- ## 10. Summary and Outlook Urban low-altitude, especially high-density UAV route planning in CBD scenarios, is a multidisciplinary system engineering problem. This article sorts out the complete method chain from **single-machine path planning** (A\*, RRT\*, APF, FM²) to **multi-machine conflict resolution** (CD&R, ORCA, MILP) to **learning methods** (MARL, PPO, attention), and gives the precise mathematical expression of each core link. **Three main unsolved challenges:** 1. **Real-time online Replanning**: When a sudden no-fly zone or drone failure occurs, the system needs to complete the replanning of all affected trajectories within 200 ms. Currently MILP falls far short of this requirement and MARL is the most promising candidate.2. **Digital twins and perception fusion**: Accurate real-time three-dimensional urban maps (including dynamic building construction, temporary enclosures, and meteorological information) are the basis for the quality of route planning. Digital twin technology is expected to achieve centimeter-level and sub-second-level airspace status synchronization. 3. **Technical implementation of the regulatory framework**: The Civil Aviation Administration of China (CAAC) low-altitude management regulations, European EASA U-Space, and American FAA UTM CONOPS all have clear requirements for conflict resolution time, flight plan submission format, emergency landing procedures, etc., and algorithm design needs to be deeply coupled with regulatory boundaries. > From a mathematical perspective, urban low-altitude air route planning is a non-convex, non-linear, mixed integer, multi-agent, real-time constrained optimization problem. No single framework can "solve it with one click" - in engineering practice, it is often a multi-level hybrid architecture: map planning is used at the strategic layer, ORCA is used at the tactical layer, and APF is used at the emergency layer, which together form a robust air traffic management system. --- **Main references:**1. Karaman, S., & Frazzoli, E. (2011). *Sampling-based algorithms for optimal motion planning.* International Journal of Robotics Research, 30(7), 846–894. 2. Van den Berg, J., Guy, S. J., Lin, M., & Manocha, D. (2011). *Reciprocal n-body collision avoidance.* Robotics Research, 3–19. 3. Zeng, Y., Xu, J., & Zhang, R. (2019). *Energy minimization for wireless communication with rotary-wing UAV.* IEEE Transactions on Wireless Communications, 18(4), 2329–2345. 4. Mueller, M. W., Hehn, M., & D'Andrea, R. (2015). *A computationally efficient motion primitive for quadrocopter trajectory generation.* IEEE Transactions on Robotics, 31(6), 1294–1310. 5. Brittain, M., & Wei, P. (2019). *Autonomous air traffic controller: A deep multi-agent reinforcement learning approach.* arXiv:1905.01303. 6. Bertram, J., & Wei, P. (2020). *Distributed computational guidance for high-density urban air mobility.* AIAA Aviation Forum. 7. Valavanis, K. P., & Vachtsevanos, G. J. (Eds.). (2015). *Handbook of Unmanned Aerial Vehicles.* Springer. 8. Augugliaro, F., Schoellig, A. P., & D'Andrea, R. (2012). *Generation of collision-free trajectories for a quadrocopter fleet.* IEEE/RSJ IROS, 3977–3982.