Urban low-altitude UAV route planning: theory and algorithm in high-density CBD scenarios

When hundreds of drones shuttle between skyscrapers at the same time, route planning is no longer a simple problem of “flying from point A to point B” - it is a high-dimensional constrained optimization problem that seeks a balance between three-dimensional space, time, energy, and safety.

Introduction: Why is CBD so difficult?

Urban low-altitude airspace is usually defined as the flight range 0–300 m (AGL) above the ground. This height level happens to be the main battlefield for UAV logistics, inspection, emergency response and other applications. The CBD (Central Business District) is the most complex sub-scenario for three reasons:

1. Dense buildings form an “urban canyon”

High-rise buildings make the available flight corridors extremely narrow, and the line of sight is blocked, which reduces the accuracy of GPS. The edges of the buildings also generate strong turbulence. At low altitudes below 50 meters, these turbulences can completely cause a small multi-rotor to lose control.

2. High-density UAVs cause intensive conflicts

In a suburban scene, there may be only a few drones flying at the same time; while under a mature urban air traffic management (UTM) system, the number of drones over the CBD may reach more than 40 drones per minute. This means that Conflict Detection & Resolution (CD&R) becomes the core bottleneck of the system rather than a peripheral function.

3. Dynamic obstacle and multi-constraint coupling

In addition to buildings, drones also have to deal with temporary no-fly zones, manned aircraft routes, real-time wind field changes, and safety risks from crowd density on the ground - all of which work together to make it difficult for any single path planning algorithm to deal with alone.

1. Problem modeling: turning flight problems into mathematical problems

1.1 3D Occupancy Grid

Discretize urban space into a voxel grid, and each voxel records its occupancy status:

Voxel resolution is typically 1–5 m and the CBD core region can be refined to 0.5 m. Building height data comes from a GIS (Geographic Information System) database and is dynamically updated combined with real-time sensors.

1.2 Mathematical definition of 4D trajectory

The flight trajectory of a single UAV is a space curve parameterized with respect to time:$$ \boldsymbol{\xi}(t) = \bigl(x(t),; y(t),; z(t)\bigr), \quad t \in [t_0,, t_f]

|\boldsymbol{\xi}_i(t) - \boldsymbol{\xi}_j(t)|2 \geq d{sep}, \quad \forall, i \neq j,; \forall, t \in [t_0, t_f]

You can't use 'macro parameter character #' in math mode Where $d_{sep}$ is the minimum safe separation, with a typical value of 5–30 m (depending on flight speed and GPS accuracy). ### 1.3 General form of multi-objective optimization problem Route planning is essentially a constrained multi-objective optimization problem:

\min_{\boldsymbol{\xi}}; J(\boldsymbol{\xi}) = w_1 J_{len} + w_2 J_{time} + w_3 J_{energy} + w_4 J_{risk}

You can't use 'macro parameter character #' in math mode The meaning of each sub-item is as follows: | Breakdown | Meaning | Typical measures | |------|------|----------| | $J_{len}$ | Path length | $\int_{t_0}^{t_f}\|\dot{\boldsymbol{\xi}}\|\,\mathrm{d}t$ | | $J_{time}$ | Flight time | $t_f - t_0$ | | $J_{energy}$ | Energy consumption | $\int P(v)\,\mathrm{d}t$ | | $J_{risk}$ | Ground risk | Points for flying over populated areas | Constraints (all are indispensable):- **Accessibility**: $O(\boldsymbol{\xi}(t)) = 0,\;\forall t$ - **Kinematics**: $\dot{\mathbf{x}} = f(\mathbf{x}, \mathbf{u})$ (UAV kinematics/dynamics model) - **Safe separation**: $\|\boldsymbol{\xi}_i(t)-\boldsymbol{\xi}_j(t)\| \geq d_{sep},\;\forall i\neq j$ - **Boundary conditions**: $\boldsymbol{\xi}(t_0)=\mathbf{p}_{start},\;\boldsymbol{\xi}(t_f)=\mathbf{p}_{goal}$ - **Speed limit**: $v_{min} \leq \|\dot{\boldsymbol{\xi}}(t)\| \leq v_{max}$ --- ## 2. Single machine path planning algorithm Before dealing with multi-machine collaboration, first understand the core algorithm in a single-machine scenario. ### 2.1 A* algorithm: the cornerstone of graph search A* searches for the shortest path on a discretized spatial graph (Waypoint Graph or Visibility Graph). The evaluation value of each node $n$ is:

f(n) = g(n) + h(n)

g(n) = g(\text{parent}) + d(\text{parent},, n)

h(n) = |\mathbf{p}n - \mathbf{p}{goal}|_2 = \sqrt{(x_n-x_g)^2+(y_n-y_g)^2+(z_n-z_g)^2}

d(u,v) = \ell_{uv}\cdot\bigl(1 + \beta\cdot\mathcal{R}_{uv}\bigr) $$Among them, is the corridor segment length, is the ground risk score of the corridor (combining factors such as population density, building type, accident consequences, etc.), and is the risk weight coefficient. This makes A* tend to choose paths that fly over low-risk areas (e.g. rivers, parks), even if they are slightly detoured.

Limitations of A*: The quality of the airspace map determines the quality of the understanding. In high-density CBD, the number of nodes in the graph can reach hundreds of thousands, and the construction of the graph itself is a challenge.

2.2 RRT* algorithm: probabilistically complete asymptotic optimal planning

RRT* (Rapidly-exploring Random Tree Star) explores feasible paths by randomly sampling in continuous space, which is particularly suitable for high-dimensional and complex obstacle scenes.

Nearest neighbor query - Find the node closest to the random sampling point in the tree :

Step expansion - Extend step size from to direction:

The core improvement of RRT - Rewire:* Find all neighboring nodes in the sphere with as the center and radius :

Where is the number of nodes of the current tree, is the space dimension (3D scene ), and is a constant related to the free space volume. This radius shrinks as the sampling points increase, ensuring asymptotic optimality.

Cost update:

If the cost of can be reduced through , reconnection is performed:$$ \text{If } c(x_{near}) > c(x_{new}) + d(x_{new},, x_{near}),\text{ then change the parent node of } x_{near} \text{ to } x_{new}

\lim_{n\to\infty} c(\xi_n^) = c^ \quad \text{(almost certainly)}

You can't use 'macro parameter character #' in math mode ### 2.3 Artificial Potential Field Method (APF): The King of Real-time APF constructs the target as a gravitational field and the obstacles as a repulsive field, and the UAV moves under the action of the resultant force. **Gravitational potential** (quadratic potential well, pulling towards the target):

U_{att}(\mathbf{p}) = \frac{1}{2}k_{att}|\mathbf{p} - \mathbf{p}_{goal}|^2

U_{rep}(\mathbf{p}) = \begin{cases} \dfrac{1}{2}k_{rep}!\left(\dfrac{1}{\rho(\mathbf{p})}-\dfrac{1}{\rho_0}\right)^{!2} & \rho(\mathbf{p}) \leq \rho_0 \[8pt] 0 & \rho(\mathbf{p}) > \rho_0 \end{cases}

\mathbf{F}(\mathbf{p}) = -\nabla U_{att}(\mathbf{p}) - \nabla U_{rep}(\mathbf{p})

\nabla U_{att} = k_{att},(\mathbf{p}-\mathbf{p}_{goal})

You can't use 'macro parameter character #' in math mode\nabla U_{rep} = k_{rep}\!\left(\frac{1}{\rho}-\frac{1}{\rho_0}\right)\!\frac{1}{\rho^2}\,\nabla\rho \qquad (\rho\leq\rho_0) $$ The online update of APF is usually lightweight and suitable for real-time obstacle avoidance; however, if the nearest obstacle distance is calculated directly $\rho(\mathbf{p})=\min_{obs}\|\mathbf{p}-\mathbf{p}_{obs}\|$ according to the aforementioned definition, the naive implementation usually requires at least traversing the obstacle set at each step, which is about $O(n_{obs})$. Single-step queries can only be approximately $O(1)$) if distance fields, ESDFs, or raster queries have been precomputed. But it still has an Achilles heel in the CBD Canyon: **Local Minimum** - When gravity and repulsion are exactly balanced, the UAV will get stuck at a non-target point and be unable to move forward. Improvements include random perturbations, harmonic potential fields, or the PF-RRT algorithm combined with RRT. ### 2.4 Fast Traveling Square Method (FM²): The Elegance of Wavefront Propagation FM² (Fast Marching Square) generates smooth trajectories by solving the Eikonal equation, which is particularly suitable for 4D conflict avoidance. **Eikonal equation** - a partial differential equation describing the wavefront arrival time $T(\mathbf{x})$: $$ |\nabla T(\mathbf{x})|^2 \cdot v^2(\mathbf{x}) = 1 $$ where $v(\mathbf{x})$ is the velocity of propagation in space. Construct a **clearance distance-based velocity map** so that the wavefront naturally decelerates near obstacles: $$ v(\mathbf{x}) = c\cdot\rho(\mathbf{x}) = c\cdot\min_{obs}\|\mathbf{x}-\mathbf{x}_{obs}\| $$ After solving $T(\mathbf{x})$, the path is extracted by gradient descent on the $T$ field: $$ \dot{\boldsymbol{\xi}}(s) = -\frac{\nabla T\bigl(\boldsymbol{\xi}(s)\bigr)}{\left|\nabla T\bigl(\boldsymbol{\xi}(s)\bigr)\right|} $$**Extended to 4D conflict avoidance:** Introducing a time-varying velocity map, letting $v\to 0$ in the space-time region already occupied by other UAVs: $$ v(\mathbf{x},t) = v_0(\mathbf{x})\cdot\phi_{conflict}(\mathbf{x},t) $$ When $\phi_{conflict}\to 0$, the wavefront naturally bypasses the space-time conflict volume, achieving a collision-free 4D path. The paths generated by FM² are naturally smooth ($C^\infty$ continuous) and require no additional smoothing post-processing. --- ## 3. The core problem of high-density scenes: conflict detection and resolution (CD&R) The fundamental challenge in high-density UAV scenarios is not to find a path, but to ensure that all paths are safe simultaneously. ### 3.1 Conflict Detection Define the relative position vector between UAV $i$ and $j$: $$ \Delta\mathbf{p}_{ij}(t) = \mathbf{p}_i(t) - \mathbf{p}_j(t) $$ **Conflict determination conditions** (horizontal **and** vertical separation are violated simultaneously): $$ \text{Conflict}_{ij} \iff \|\Delta\mathbf{p}_{ij}(t)\|_{xy} < d_h \;\wedge\; |\Delta z_{ij}(t)| < d_v $$ Refer to NASA UTM CONOPS typical parameters: horizontal separation standard $d_h=30\,\text{m}$, vertical separation standard $d_v=10\,\text{m}$. In practice, the system needs to predict conflicts before flight rather than waiting for conflicts to occur before responding. Assume that the UAV flies at a constant speed within the look-ahead window $[0, T_h]$: $$ \mathbf{p}_i(t) = \mathbf{p}_i^0 + \mathbf{v}_i t, \quad \mathbf{p}_j(t) = \mathbf{p}_j^0 + \mathbf{v}_j t $$ **Closest Point of Approach (CPA, Closest Point of Approach) time:**$$ t_{CPA} = -\frac{\Delta\mathbf{p}_{ij}^0 \cdot \Delta\mathbf{v}_{ij}}{\|\Delta\mathbf{v}_{ij}\|^2}, \qquad \Delta\mathbf{v}_{ij} = \mathbf{v}_i - \mathbf{v}_j $$ Minimum spacing at CPA: $$ d_{min} = \|\Delta\mathbf{p}_{ij}(t_{CPA})\| $$ When $d_{min} < d_{sep}$ and $t_{CPA}\in[0, T_h]$, it is determined that there is a **prediction conflict** and the release mechanism needs to be triggered immediately. ### 3.2 Conflict Resolution Disarm strategies fall into three categories and can be used individually or in combination: **Strategy 1: Speed Adjustment** Apply a velocity scaling factor $\alpha$ to UAV $i$, decelerating or accelerating within the allowed range of dynamics: $$ \mathbf{v}_i^{new} = \alpha\,\mathbf{v}_i, \quad \alpha\in\!\left[\frac{v_{min}}{v_i},\;\frac{v_{max}}{v_i}\right] $$ The optimal $\alpha$ minimizes the deviation from the original plan while satisfying the separation constraints: $$ \alpha^* = \arg\min_\alpha\;|\alpha-1| \quad \text{s.t. } d_{min}^{new}(\alpha)\geq d_{sep} $$ **Strategy 2: Heading Change** Rotate the flight direction of UAV $i$ by $\delta\psi$ in the horizontal plane: $$ \mathbf{v}_i^{new} = v_i\begin{pmatrix}\cos(\psi_i+\delta\psi)\\\sin(\psi_i+\delta\psi)\\0\end{pmatrix}

\delta\psi^* = \arg\min_{|\delta\psi|};\delta\psi \quad \text{s.t. } d_{min}(\delta\psi)\geq d_{sep}

z_{layer}(k) = z_{base} + k\cdot\Delta z_{layer}, \quad k\in{0,1,\ldots,N_{layer}-1}

You can't use 'macro parameter character #' in math mode Typical configuration: eastbound $\to z_1$, westbound $\to z_2$, northbound $\to z_3$, southbound $\to z_4$, layer spacing $\Delta z_{layer}=10\,\text{m}$. This reduces the dimensionality of the three-dimensional collision problem to a two-dimensional problem, greatly reducing the system complexity. ### 3.3 Decentralized Coordination: Speed Barriers and ORCA Centralized UTM can obtain the global optimal solution, but the communication overhead increases with $O(N^2)$ as the number of UAVs $N$, facing a bottleneck in extremely high-density scenarios. Among decentralized solutions, **Velocity Obstacle (VO)** and its improvement **ORCA** are the most mature frameworks. **Speed Barrier** Definition - UAV $i$ The set of velocities that are prohibited due to the presence of UAV $j$ (all velocities that would cause a collision within the time window $\tau$):

VO_{ij}^\tau = \left{\mathbf{v}_i ;\middle|; \exists, t\in[0,\tau],; \mathbf{p}_i+\mathbf{v}_i t ;\in; \mathbf{p}_j+\mathbf{v}j t \oplus \mathcal{D}(d{sep})\right}

where is the size of the minimum velocity change, and points to the normal direction of the boundary.

The set of feasible velocities for agent (intersect all neighbor constraints and then intersect with dynamic constraints):

Among them, encodes dynamic constraints such as maximum velocity and acceleration. ORCA has achieved a 100% success rate in density scenarios of more than 40 frames/minute, with a computational complexity of , making it suitable for real-time deployment.

4. Graph theory modeling: urban airspace network

4.1 Construction of route network diagram

Urban airspace is modeled as a weighted directed graph:

Node : above road intersections, tops of buildings, key transfer points
Edge : Legal flight corridor between two nodes (needs to pass collision detection verification)
Edge weight : multi-objective scalar weighting

Corridor capacity constraints (the number of UAVs passing at the same time does not exceed the upper limit):

The occupancy status of the entire airspace can be described by a four-dimensional tensor ( is the voxel grid, is the number of time slots):$$ \mathbf{A} \in {0,1}^{N_x\times N_y\times N_z\times N_t}, \quad A_{x,y,z,t} = 1 \iff \exists\text{ UAV occupied voxel}(x,y,z)\text{ in time slot }t

You can't use 'macro parameter character #' in math mode ### 4.2 Rotor UAV energy consumption model Energy consumption is an important optimization goal for route planning and requires accurate modeling. **Hover Power** (derived from leaf element momentum theory):

P_{hover} = \sqrt{\frac{(mg)^3}{2,\rho_{air}, A_r}}

P(v) = \underbrace{P_0!\left(1+\frac{3v^2}{U_{tip}^2}\right)}{\text{Blade resistance}} + \underbrace{P_i!\left(\sqrt{1+\frac{v^4}{4v_0^4}}-\frac{v^2}{2v_0^2}\right)^{!\frac{1}{2}}}{\text{Induction power}} + \underbrace{\frac{1}{2},d_0,\rho_{air},s,A,v^3}_{\text{Body resistance}}

\mathcal{E}{ij} = \frac{\ell{ij}}{v}\cdot P(v)

v^* = \arg\min_v \frac{P(v)}{v}

You can't use 'macro parameter character #' in math mode For a typical small multicopter ($m\approx 1\,\text{kg}$), $v^*$ is typically between 8–12 m/s. ---## 5. Wind field and urban canyon effect ### 5.1 Urban wind field modeling The wind speed distribution in urban canyons is much more complex than in the countryside, and the Weibull distribution is widely used in statistical modeling:

f(v_w;, k,, \lambda) = \frac{k}{\lambda}!\left(\frac{v_w}{\lambda}\right)^{k-1}!\exp!\left[-!\left(\frac{v_w}{\lambda}\right)^k\right]

\bar{u}(z) = \frac{u^*}{\kappa}\ln!\left(\frac{z - d_0}{z_0}\right), \quad \kappa = 0.41 \text{(von Kármán constant)}

t_{ij} = \frac{d_{ij}}{v_{air} + v_w\cos\theta_w}

\mathcal{E}{ij}^{wind} = \int_0^{t{ij}} P!\left(|\mathbf{v}_{UAV}(t) - \mathbf{v}_w(t)|\right)\mathrm{d}t

TI = \frac{\sigma_u}{\bar{u}}, \qquad \sigma_u = \sqrt{\overline{u’^2}}

You can't use 'macro parameter character #' in math mode Corridors with $TI > 0.3$ are usually marked as high risk, and the planner will actively avoid or increase the edge weight of this segment. ### 5.2 Dynamic safety radiusThe turbulence around buildings increases sharply as the height margin decreases. Therefore, the safe clearance distance should not be a fixed constant, but should be dynamically adjusted with the flight altitude:

d_{safe}(h) = d_{base} + \frac{k\cdot H_{bld}}{h - H_{bld} + \epsilon}

\rho\bigl(\mathbf{p}(t)\bigr) \geq d_{safe}\bigl(z(t)\bigr), \quad \forall, t \in [t_0, t_f]

You can't use 'macro parameter character #' in math mode --- ## 6. Multi-machine collaborative optimization: MILP global modeling For the joint path and time slot allocation problem of $N$ drones, a **Mixed Integer Linear Programming (MILP)** model can be established to obtain the global optimal solution at small to medium scale ($N\leq 50$). **Objective function** (minimize the total completion time and energy consumption of all drones):

\min;\sum_{k=1}^{N}!\left(w_1, T_k + w_2,\mathcal{E}_k\right)

\sum_{j:(i,j)\in E}x_{ij}^k - \sum_{j:(j,i)\in E}x_{ji}^k = b_i^k, \quad \forall, i\in V,;\forall, k

\sum_{k=1}^{N} x_{ij}^k \leq C_{ij}, \quad \forall,(i,j)\in E

t_j^k \geq t_i^k + \frac{d_{ij}}{v_{max}}\cdot x_{ij}^k, \quad \forall,(i,j)\in E,;\forall, k

t_i^k - t_i^l \geq \Delta t_{sep} - M(1 - z_{kl}^i)

t_i^l - t_i^k \geq \Delta t_{sep} - M, z_{kl}^i

\min_{x,, t,, v};\sum_k\sum_{(i,j)} x_{ij}^k\cdot\frac{d_{ij}}{v_{ij}^k}\cdot P(v_{ij}^k), \quad v_{min}\leq v_{ij}^k\leq v_{max}

You can't use 'macro parameter character #' in math mode MINLP is an NP-hard problem that is approximately solved by commonly used heuristic algorithms in practice (random fractal search SFS, cheetah optimization MCO, etc.). --- ## 7. Reinforcement learning solution: MARL and attention mechanism When the scale of UAVs exceeds a hundred, the computational complexity of MILP is unacceptable. **Multi-agent reinforcement learning (MARL)** provides an alternative for offline training and extremely fast inference. ### 7.1 Reward function design The reward received by each drone $i$ at time step $t$:

r_i^t = r_{arrive}\cdot\mathbf{1}[goal] - c_{step} - c_{conflict}\cdot\mathbf{1}[conflict] - c_{detour}\cdot|\mathbf{p}i^t - \mathbf{p}{direct}| $$The meaning of each item: is the positive reward for reaching the target; is the time penalty for each flight step; is the penalty when a conflict occurs; is the detour penalty for deviating from the straight line.

7.2 Double-DQN update (discrete action space)

The online network selects actions, and the target network evaluates values, decoupling selection and evaluation to reduce overestimation bias.

7.3 Attention Mechanism: Modeling Neighbor Influence

The decision-making of each drone in the CBD requires sensing the status of its surrounding neighbors. The attention mechanism allows agent to dynamically weight the influence of neighbors :

The attention weight reflects the relevance of the neighbor to the decision-making of the agent . Neighbors with close distances and large speed conflicts naturally receive higher weights.

7.4 PPO policy gradient (continuous/mixed action space)$$

\mathcal{L}^{CLIP}(\theta) = \mathbb{E}_t!\left[\min!\left(r_t(\theta)\hat{A}_t,;\mathrm{clip}!\left(r_t(\theta),,1-\varepsilon,,1+\varepsilon\right)\hat{A}_t\right)\right]

r_t(\theta) = \frac{\pi_\theta(a_t\mid s_t)}{\pi_{\theta_{old}}(a_t\mid s_t)}

You can't use 'macro parameter character #' in math mode The Clip operation limits the update step size to the range of $[1-\varepsilon,\, 1+\varepsilon]$ (usually $\varepsilon=0.2$) to prevent training from crashing due to excessive policy updates. **Centralized Training, Decentralized Execution (CTDE) Paradigm:** - **Training phase**: Evaluation network $V(s^{global};\phi)$ uses global state and can perceive all agent information - **Execution phase**: Policy network $\pi_\theta(a_i\mid o_i)$ only uses local observations of agent $i$, without communication --- ## 8. Trajectory Smoothing: Bézier Curve and Minimum Snap The output of path planning is often a series of discrete waypoints, and directly tracking these waypoints will produce unfeasible sharp turns. It is necessary to generate dynamically feasible continuous trajectories through **trajectory smoothing**. ### 8.1 Bézier Curve A Bézier curve of order $n$ is defined by $n+1$ control points $\{\mathbf{P}_i\}$:

\boldsymbol{\xi}(u) = \sum_{i=0}^{n}\binom{n}{i}(1-u)^{n-i}u^i,\mathbf{P}_i, \quad u \in [0,1]

\dot{\boldsymbol{\xi}}(u) = n\sum_{i=0}^{n-1}\binom{n-1}{i}(1-u)^{n-1-i}u^i,(\mathbf{P}_{i+1}-\mathbf{P}_i)

\kappa = \frac{|\dot{\boldsymbol{\xi}}\times\ddot{\boldsymbol{\xi}}|}{|\dot{\boldsymbol{\xi}}|^3} \leq \frac{a_{max}}{v^2}

You can't use 'macro parameter character #' in math mode ### 8.2 Minimum Snap: The standard solution for quadcopters For a quadcopter UAV, minimizing Snap (the second derivative of acceleration) is equivalent to minimizing the rate of change of required thrust, resulting in optimal flight dynamics:

\min;\int_{t_0}^{t_f}!\left|\frac{d^4\boldsymbol{\xi}}{dt^4}\right|^2!\mathrm{d}t

\min_{\mathbf{c}};\mathbf{c}^\top\mathbf{Q}\mathbf{c} \quad \text{s.t. }\mathbf{A}{eq}\mathbf{c} = \mathbf{b}{eq}

You can't use 'macro parameter character #' in math mode The matrix $\mathbf{Q}$ encodes the Snap integral (can be calculated analytically), and the equality constraint $\mathbf{A}_{eq}\mathbf{c}=\mathbf{b}_{eq}$ forces the trajectory to pass through all path points and ensures the continuity of position, velocity, and acceleration between segments. --- ## 9. Horizontal comparison of methods| Method | Completeness | Optimality | Time complexity | Real-time | Multi-machine scalability | |------|--------|--------|------------|--------|------------| | **A\*** | Complete | Optimal (discrete graph) | $O(b^d)$ | Medium | Poor | | **RRT\*** | Probabilistically complete | Asymptotically optimal | $O(n\log n)$ | Better | Medium | | **APF** | Incomplete | No guarantee | $O(1)$/step | Excellent | Good | | **FM²** | Complete | Optimal (continuous) | $O(N\log N)$ | Medium | Medium | | **MILP** | Complete | Global Optimal | NP-hard | Poor | Medium ($N\leq50$) | | **ORCA** | Probabilistically complete | Local optimum | $O(N^2)$ | Excellent | Excellent | | **MARL+Attn** | Complete probability | Approximate | Heavy training, fast inference | Good | Excellent | **Selection suggestions:** - **Small scale, high security requirements** ($N\leq 20$) → MILP global optimal - **Medium scale, real-time sensitive** ($20 < N \leq 100$) → A\* / RRT\* + ORCA conflict resolution - **Large scale, high density** ($N > 100$) → MARL + attention mechanism (inference delay $< 10\,\text{ms}$) --- ## 10. Summary and Outlook Urban low-altitude, especially high-density UAV route planning in CBD scenarios, is a multidisciplinary system engineering problem. This article sorts out the complete method chain from **single-machine path planning** (A\*, RRT\*, APF, FM²) to **multi-machine conflict resolution** (CD&R, ORCA, MILP) to **learning methods** (MARL, PPO, attention), and gives the precise mathematical expression of each core link. **Three main unsolved challenges:** 1. **Real-time online Replanning**: When a sudden no-fly zone or drone failure occurs, the system needs to complete the replanning of all affected trajectories within 200 ms. Currently MILP falls far short of this requirement and MARL is the most promising candidate.2. **Digital twins and perception fusion**: Accurate real-time three-dimensional urban maps (including dynamic building construction, temporary enclosures, and meteorological information) are the basis for the quality of route planning. Digital twin technology is expected to achieve centimeter-level and sub-second-level airspace status synchronization. 3. **Technical implementation of the regulatory framework**: The Civil Aviation Administration of China (CAAC) low-altitude management regulations, European EASA U-Space, and American FAA UTM CONOPS all have clear requirements for conflict resolution time, flight plan submission format, emergency landing procedures, etc., and algorithm design needs to be deeply coupled with regulatory boundaries. > From a mathematical perspective, urban low-altitude air route planning is a non-convex, non-linear, mixed integer, multi-agent, real-time constrained optimization problem. No single framework can "solve it with one click" - in engineering practice, it is often a multi-level hybrid architecture: map planning is used at the strategic layer, ORCA is used at the tactical layer, and APF is used at the emergency layer, which together form a robust air traffic management system. --- **Main references:**1. Karaman, S., & Frazzoli, E. (2011). *Sampling-based algorithms for optimal motion planning.* International Journal of Robotics Research, 30(7), 846–894. 2. Van den Berg, J., Guy, S. J., Lin, M., & Manocha, D. (2011). *Reciprocal n-body collision avoidance.* Robotics Research, 3–19. 3. Zeng, Y., Xu, J., & Zhang, R. (2019). *Energy minimization for wireless communication with rotary-wing UAV.* IEEE Transactions on Wireless Communications, 18(4), 2329–2345. 4. Mueller, M. W., Hehn, M., & D'Andrea, R. (2015). *A computationally efficient motion primitive for quadrocopter trajectory generation.* IEEE Transactions on Robotics, 31(6), 1294–1310. 5. Brittain, M., & Wei, P. (2019). *Autonomous air traffic controller: A deep multi-agent reinforcement learning approach.* arXiv:1905.01303. 6. Bertram, J., & Wei, P. (2020). *Distributed computational guidance for high-density urban air mobility.* AIAA Aviation Forum. 7. Valavanis, K. P., & Vachtsevanos, G. J. (Eds.). (2015). *Handbook of Unmanned Aerial Vehicles.* Springer. 8. Augugliaro, F., Schoellig, A. P., & D'Andrea, R. (2012). *Generation of collision-free trajectories for a quadrocopter fleet.* IEEE/RSJ IROS, 3977–3982.