Low-altitude planning paper matrix v2: three papers in progress, follow-up embodied low-altitude and large model routes

With three ongoing papers on conflict-free path planning, three-layer scheduling of hundreds of UAVs, and information theory-driven 3DGS active sensing planning as the core, we will re-plan the follow-up paper route on embodied low-altitude, low-altitude cloud brain, vertebral large model fine-tuning, inference acceleration, and software and hardware collaboration.

Low-altitude planning paper matrix v2: three papers in progress, follow-up embodied low-altitude and large model routes

This article reintegrates the low-altitude UAV papers that have been written so far into a paper portfolio.
The goal is not to spread out and write a lot of ideas, but to clarify: which articles have already taken shape, which ones can continue to be made into top journal/regular conference papers, and what literature support, experimental assets and submission positioning are needed for each paper.


0. 2026-05-28 Correction Conclusion

The current focus needs to be changed: instead of “planning 7-10 papers at the same time”, it is first to acknowledge that three papers are already being worked on, and subsequent papers must naturally grow from the assets of these three papers.

The three articles I’m currently working on are:

StatusArticleRoleThe main line that cannot be deviated from
Already working onPaper A: Conflict-free path planning / PPO-MAPPO / Low-altitude conflict resolutionTactical security layerHigh-density low-altitude corridors, non-cooperative UAVs, communication/positioning degradation, safety-efficiency trade-offs
Already in progressPaper B: Three-layer hierarchical scheduling of 100 UAVsSystem operation layer100-level fleet, queue stability, vertiport/charging/corridor bottleneck, multi-modal scheduling
Already working onPaper C: Information theory-driven UAV 3DGS active sensingEnvironmental cognitive layer3DGS / Fisher information / NBV / safe reconstruction / planning-aware mapping

Don’t start a new paper for subsequent papers that have nothing to do with the direction. The correct route is:1. **First make A/B/C into three main papers that can be submitted. ** 2. Follow-up Paper D/F/G/H/I are only used as extensions of A/B/C: scene coverage supports A/C, low-altitude cloud brain connects A/B/C in series, embodied low-altitude connects C’s perception and A’s control into a closed loop, and model fine-tuning and inference accelerate the implementation of the service cloud brain. 3. General AGI directions cannot be written as empty claims. A more stable expression is “towards general embodied low-altitude intelligence”: starting from domain Agent, tool invocation, simulation feedback, VLA/VLN, world model, and device-side reasoning, and gradually approaching general embodied intelligence. 4. It is not recommended to train vertical foundation model from scratch in the first stage. First use an ordinary large model + Agent + Skills/MCP + RAG + verifier + simulator post-processing to form a reproducible experimental closed loop; wait until there are enough tool call trajectories, failure samples, and simulation feedback before doing LoRA/SFT/DPO/GRPO fine-tuning.

This means that subsequent papers should be divided into two levels:

LevelThesis goalWhether to start in the near future
A/B/C Main LayerAlready in progress, the experimental closed loop must be completed firstImmediately
D scene overlayProvides benchmark, failure taxonomy and safety-critical data to A/CRecent
G Cloud Brain Agent LayerTurn A/B/C/D/E into tools to build a verifiable low-altitude traffic cloud brainMid-term
H Embodied low-altitude layerMake UAV VLN/VLA/world model and connect to universal embodied AIMid-term and later
I model training layerTraining LowAltitudeGPT / tool-use / LowAltitudeIR / simulation feedbackWait for the data to stabilize
J inference acceleration layervLLM/TensorRT-LLM/quantization/device-cloud collaboration/hardware deploymentWait until the agent workload is stable

1. There are currently articles and main line positioning

There are currently three core articles that have formed the basis of content:| Number | Existing content | Current positioning | Recommended main investment | Core judgment | |---|---|---|---|---| | Paper A | Conflict-free path planning / PPO / MAPPO / Multi-UAV conflict resolution | Robust conflict resolution in low-altitude air route network | IEEE T-ITS / IEEE T-RO / ICRA-IROS | You cannot just write PPO, you must write “Safety-efficiency trade-off under non-cooperative UAVs, communication degradation, and high-density corridors” | | Paper B | Hundreds of UAV three-layer dispatching | Urban low-altitude logistics/emergency system operation dispatching | TR-C first, T-ITS backup | This is a transportation system paper, focusing on capacity, delay, queue stability, vertiport/charging/corridor bottleneck | | Paper C | UAV 3DGS active sensing planning driven by information theory | Active sensing + low-altitude digital twin + planning closed loop | T-RO / T-ITS / ICRA-IROS | If submitted to a transportation journal, it must be proven that active sensing improves inspection, emergency, obstacle avoidance or operational control indicators |

These three articles have been able to form a very stable low-altitude planning triangle:

Paper A:战术安全
  多 UAV conflict resolution / no-conflict planning / PPO-MAPPO / CBF / RMADER

Paper B:系统运营
  hundred-UAV scheduling / queue stability / Lyapunov / multimodal logistics

Paper C:环境认知
  3DGS active perception / Fisher information / NBV / safe reconstruction

It is best for subsequent new papers to expand around this triangle and not to start in a completely unrelated direction.


2. Overall submission judgment

The low-altitude planning direction can be divided into three categories of papers, and different categories have different review standards:| Type | Representative papers | Review attention | Recommended venues | |---|---|---|---| | Transportation system papers | Paper B, emergency resource allocation, low-altitude road network planning | Real traffic problems, system indicators, data/simulation credibility, policy or operational implications | TR-C, T-ITS | | Robot planning paper | Paper A, Paper C, digital twin planning | Algorithm novelty, real-time, security, hardware/simulation verification | T-RO, RA-L+ICRA/IROS, T-ITS | | AI method papers | VERA-UAV, CloudBrain-Agent, scene acceleration generation | benchmark difficulty, theory/verification mechanism, model generalization, reproducibility | AAAI, IJCAI, NeurIPS/ICLR workshop, T-ITS extension |

The official positioning of TR-C emphasizes transportation systems and emerging technologies, and the intellectual core is on the transportation side [1]; T-ITS covers sensing, communications, controls, planning, design, implementation and other modern transportation system technologies [2]. Therefore:


3. Paper matrix: 3 papers already in progress + follow-up extension route

The reading of this section needs to be changed: Paper A/B/C are the three main works that are already being done, not “new directions in the future”. Paper D/E/G/H/I/J are writable extensions, but the startup sequence must obey the experimental asset maturity of A/B/C.

3.1 Paper A: Robust conflict-free planning of low-altitude air route networkSuggested topic: Robust Conflict-Free UAV Corridor Planning under Non-Cooperative Traffic and Communication Degradation

Corresponding to existing articles: Conflict-free path planning, PPO/MAPPO, UAV conflict resolution, UAV conflict env construction.

Core question: In an urban low-altitude airway network, how can multiple UAVs maintain separation safety while controlling delays, extra distances, and throughput losses under the conditions of local observation, communication delays, positioning errors, and non-cooperative aircraft insertion.

Method Route:

**Key References:**The multi-agent stable training of MAPPO/PPO can be supported by Yu et al. [3]; MAT and FACMAC provide stronger MARL baseline [4,5]; HAPPO/HATRPO provides trust-region multi-agent policy optimization reference [6]. On the robot side, EGO-Swarm, MADER, RMADER, RACER, PANTHER and GCOPTER respectively support decentralized swarm planning, trajectory sharing under delay, collaborative exploration, perception-aware planning and multicopter trajectory optimization [7-12].

Innovation suggestions:

  1. Upgrade “PPO conflict-free path planning” from a simple RL task to low-altitude traffic corridor safety control.
  2. Introduce communication degradation and non-cooperative UAV to form the actual operating boundary that T-ITS is more concerned about.
  3. Use learning policy + formal/safety shield to avoid the lack of security of pure RL.
  4. Trafficization of indicators: LoWC, NMAC, conflict count, average delay, extra distance, throughput, runtime.

3.2 Paper B: Three-layer hierarchical scheduling of hundreds of UAVs

Suggested topic: H-LyraUAV: Queue-Stable Hierarchical Scheduling for Hundred-Scale Low-Altitude UAV Logistics

Corresponding to existing articles: Paper B three-tier scheduling planning.

Core question: How can a hundred-level UAV fleet operate stably, efficiently, and safely under dynamic requirements, limited vertiport/charging/corridor capacity, and multi-modal transportation constraints.

Method Route:- macro layer: demand queue, fleet repositioning, mode choice;

Key References:

TR-C low-altitude UAV delivery traffic management has directly discussed resource allocation and conflict resolution in low-altitude urban space [13]; passenger-centric UAM, fairness and operational efficiency research support service quality framing [14]; charging-station delivery network, capacity-constrained UAM scheduling, safe learning scheduling support infrastructure capacity and safe online scheduling [15-17]; truck-drone / UAV-UGV multi-modal delivery support multimodal extension [18,19].

**Innovation suggestions:**1. Hundred-shelf-level online three-layer scheduling closed loop instead of offline routing/network design. 2. Queue stability becomes the main line of theory, and the learning module only makes predictions or value estimates. 3. Evaluate delay, throughput, backlog, charging utilization, vertiport bottleneck, and corridor congestion at the same time. 4. The conclusion of the traffic system can answer: when does it need to limit traffic, where is the bottleneck, and when is UAV-only inferior to multimodal fallback.

3.3 Paper C: FIM-3DGS UAV active sensing planning

Suggested topic: FIM-3DGS: Fisher-Information-Driven Active Perception Planning for Safe UAV Reconstruction

Corresponds to existing articles: Paper C, Next-Best-View and NeRF/3DGS, Information Theory Active Sensing.

Core question: Under limited flight time, energy and safety constraints, how can UAV actively select viewpoints to make the 3DGS map converge faster and serve low-altitude planning tasks.

Method Route:

**Key References:**The original 3DGS text provides real-time explicit radiance field representation [20]; ActiveNeRF is an early representative of neural rendering active perception [21]; FisherRF directly supports Fisher information active view selection, and has 3DGS backend 70 fps results [22]; GS-Planner, HGS-Planner, POp-GS and NVF support the 3DGS/NBV competition line of 2024-2025 [23-26].

Innovation suggestions:

  1. Upgrade from “3DGS NBV” to “active perception serving UAV security planning”.
  2. Use Fisher information to connect CRB / reconstruction uncertainty / planning safety.
  3. Expand from visual indicators to traffic/robot task indicators: path feasibility rate, obstacle recall rate, emergency inspection coverage rate.
  4. Do cross-scenario generalization on MatrixCity/AirSim/self-built urban low-altitude cells.

3.4 Paper D: Low-altitude safety critical scene coverage and accelerated testing

Suggested topic: Coverage-Guided Accelerated Testing for Safety-Critical Low-Altitude UAV Navigation

Corresponding to existing articles: Paper F scene coverage, dangerous scene generation, 76 million exploration logs.

Core question: How to define the test scene space of low-altitude UAV obstacle avoidance/planning algorithm, how to measure coverage, and how to efficiently discover dangerous but effective failure scenarios.

Method Route:- scenario grammar: local 50m x 50m x 50m cell, obstacle combination, dynamic obstacles, wind disturbance, target point, start and end points;

Key References:

Shuo Feng’s NADE and testing scenario library generation are core references for accelerated testing and security-critical scenario libraries [27-29]; SafeBench provides a benchmark platform and security assessment protocol reference [30].

Innovation suggestions:

  1. Migrate from autonomous driving scenario engineering to low-altitude UAV 3D scene space.
  2. Model the three objectives of coverage, criticality, and feasibility simultaneously.
  3. Prove coverage space and failure taxonomy using 76 million exploration logs.
  4. Let the results answer: Which combinations of obstacles are the most dangerous, which planners generalize the worst, and whether increased coverage really reduces unknown risks.

3.5 Paper E: Verification of error-correcting UAV language planning

Suggested topic: VERA-UAV: Verification-and-Repair Language Planning for Low-Altitude UAV Tasks

Corresponding to existing articles: Paper E.

Core problem: LLM can convert natural language tasks into UAV executable task specifications, but it is prone to producing plans that are unexecutable, semantic mismatch, or violate safety constraints. Requires typed IR, LTL/STL, validators, and counterexample feedback loops.Method Route:

Key References:

Lang2LTL, NL2LTL, LTLCodeGen, and ConformalNL2LTL respectively support NL-to-LTL grounding, system demonstration, code-generation-style temporal logic generation and conformal correctness guarantee [31-34].

Innovation suggestions:

  1. It is not just NL2LTL, but the UAV trajectory can perform closed loop.
  2. Typed TaskIR reduces language ambiguity and improves interpretability.
  3. Counterexample feedback and STL robustness feedback give repair a specific direction.
  4. The AAAI/IJCAI version focuses on AI planning/verification; T-ITS is extended to connect to low-altitude traffic operation scenarios.

3.6 Paper G: Low-altitude traffic cloud brain LLM Agent

Suggested topic: CloudBrain-Agent: Tool-Augmented LLM Agents for Low-Altitude Traffic Operation

Corresponding to existing articles: Paper G/G1.

Core question: The low-altitude traffic cloud brain cannot be just a chat model, but a verifiable agent that can call the scheduler, path planner, verifier, simulator and risk assessor.

Method Route:- LLM is responsible for task understanding, tool selection, status summary and interpretation;

Key References:

UrbanGPT, UniST, and TrafficGPT show that transportation/urban spatiotemporal tasks have begun to move closer to foundation models and agent frameworks [35-37]; although DriveLM is autonomous driving, its Graph VQA task form can learn from the multi-step reasoning of low-altitude traffic cloud brain [38].

Innovation suggestions:

  1. The low-altitude traffic cloud brain is not a “vertical chat model”, but a tool-augmented verifiable agent.
  2. Use unified IR to connect scheduling, planning, sensing, verification, and scenario testing.
  3. Do the agent benchmark first, and then decide whether to fine-tune the vertical model to reduce the risk of the first article.
  4. Evaluation indicators include tool-call accuracy, task success, safety violation, repair success, latency, and human auditability.

3.7 Paper H: Urban low-altitude ODD and semantic functional area planning

Suggested topic: ODD2Route: Semantic Operational-Design-Domain Modeling for Low-Altitude UAV Route Planning

**This is a new article that can be written in a new direction. **

Core question: How does the overall urban scene map to local low-altitude route planning? How to determine the risk, capacity and service strategy of low-altitude air routes based on different functional areas, building density, road structure, crowd activities, no-fly zones and emergency facility distribution?Method Route:

Literature support:

This article can be supported by Paper B’s TR-C/UAM literature [13-19], Paper D’s scenario coverage literature [27-30], and Paper C’s 3D/digital twin literature [20-26]. The difficulty lies not in the complexity of the algorithm, but in the trustworthy definition of city-level ODD to local scenario/route risk.

Innovation suggestions:

  1. Establish a computable mapping between the “overall city scene” and “local obstacle combination”.
  2. Use ODD coverage to interpret scene coverage instead of randomly generating scenes.
  3. Provide a bridge between urban low-altitude planning, route design and test scenario library for TR-C/T-ITS.

3.8 Paper I: Embodied low-altitude intelligence and Aerial VLA/VLN

Suggested topic: Embodied Low-Altitude Intelligence: Vision-Language-Action Planning for UAVs in Urban Airspace

**This is the mid- to long-term direction that is most worth retaining after the new survey. **The current main line of embodied intelligence has moved from “LLM speaking” to “VLM/VLA directly connecting perception, language and action”. RT-2 clearly proposed the vision-language-action model, putting vision, language and robot action into the same model paradigm [44]; OpenVLA and Octo showed that the open source VLA / generalist robot policy can be pre-trained with large-scale robot trajectories and then fine-tuned with a small amount of target domain data [42,43]. Directly related work has also begun to appear in the UAV direction: SINGER uses Gaussian Splatting to generate language embedding flight simulation data, train onboard drone VLN policy, and do hardware experiments [39]; FlightGPT uses SFT + GRPO to do UAV VLN, and verifies generalization and interpretable reasoning on CityNav [40]; UAV-VLN connects natural language, visual perception and feasible trajectory planning [41].

Our writable gap:

Most of the existing aerial VLN/VLA focus on “giving a language target and letting the drone fly near the target”. This is not the capability required by the low-altitude traffic cloud brain. Low-altitude scenarios require the model to understand at the same time:

Suggested method:

multimodal observation
  = UAV RGB/depth/semantic map/3DGS local map
  + low-altitude traffic state
  + natural-language mission
  + city ODD metadata

LLM/VLM/VLA policy
  -> LowAltitudeIR
  -> skill selection
  -> waypoint / velocity / route command
  -> verifier + safety shield
  -> simulator or hardware feedback

Recommended version to do first:

Don’t initially train end-to-end AerialVLA. First make a hybrid embodied agent:

Available targets:- ICRA/IROS/T-RO: Emphasis on embodied navigation, hardware closed loop, and sim-to-real.

3.9 Paper J: LowAltitudeGPT training and fine-tuning route

Suggested topic: LowAltitudeGPT: Tool-Use and Simulation-Feedback Tuning for Low-Altitude Traffic Intelligence

Core judgment:

It’s not time to train the “large low-altitude traffic model” from scratch now. There are three problems with this:

  1. The amount of data is insufficient to support foundation model-level contributions;
  2. The review will ask whether the model contribution exceeds that of ordinary large models + RAG + tool-use;
  3. The training cost is high, but it is not necessarily more valuable than the agent/verifier/simulator closed loop.

A more feasible route is ordinary large model + Agent + Skills/MCP + RAG + verifier + simulator to run through it first, and then precipitate the running logs into trainable data. MCP is essentially a standard interface that exposes tools and context to LLM, and is suitable for unified access to schedulers, planners, verifiers, simulators, databases and document libraries [47].

A review of large low-altitude economic models also breaks down low-altitude systems into facility networks, information networks, route networks, and service networks, and emphasizes that large models need to be combined with edge computing, 6G/ISAC, and trusted distributed intelligence [50]. This shows that our paper cannot just be written as “training a chat model”, but must be written as a closed loop of models, tools, networks, operation control and system evaluation.

Model selection suggestions:| Stage | Recommended model | Reason | |---|---|---| | Solution exploration / data generation / teacher | High-capability API model | Quickly generate tasks, tool traces, counterexample explanations and evaluation samples first, without using the API as a final reproducible dependency | | Local reproducible experiments | Qwen3-8B / Qwen3-14B / Qwen3-32B | Qwen3 officially supports local operation, deployment, quantification and training processes, with good Chinese language, tool calling and engineering ecology [45] | | Reasoning/Mathematics/Constraint Interpretation | DeepSeek-R1-Distill-Qwen-14B / 32B | The DeepSeek-R1 series emphasizes RL-motivated reasoning capabilities. The distill version can be deployed locally and is fine-tuned based on the Qwen/Llama open source model [46] | | Multi-modal low-altitude perception | Qwen-VL / Qwen3-VL / other open source VLM | Semantic understanding for pictures, video frames, maps, track charts, and 3DGS render | | Edge-end small model | Qwen3-4B / 8B quantitative version, SLM | Used for end-side status summary, anomaly detection, low-latency fallback |

Training data design:

Data typeSourceTraining target
NL mission -> LowAltitudeIRManual template + API teacher + real task rewritingTask parsing and structured representation
tool-use tracePaper A/B/C/D/E tool call logLearn when to call scheduling, planning, verification, simulation
verifier counterexampleSpot/RTAMT/CBF/simulator feedbackLearn to fix unexecutable or dangerous plans
simulation rolloutSUMO/AirSim/self-developed low-altitude simulationLearn to explain system bottlenecks from results
failure casecollision, LoWC, timeout, queue explosion, insufficient energy consumptionLearn risk diagnosis and emergency de-escalation
human audit dataManual selection of more reasonable solutionsDPO/preference optimization

**Training Phase:**1. RAG + prompt baseline: No fine-tuning, only use the literature library, regulations, system description and tool schema. 2. LoRA/QLoRA SFT: training NL-to-IR, tool-call, result interpretation, and counterexample repair. 3. DPO/IPO: Use manual preferences or verifier scoring preferences to optimize “safe, executable, concise, and explainable”. 4. GRPO/RL-style tuning: Use simulation to reward training task success rate, low violation, low latency and format compliance. FlightGPT’s SFT + GRPO route can be used as a UAV VLN reference [40]. 5. distillation: Distill API teacher/32B model capabilities to 8B/4B for local and edge deployment.

Evaluation indicators:

-task success;

3.10 Paper K: Low-altitude cloud brain inference acceleration and software and hardware collaboration

Suggested topic: Edge-Cloud Co-Optimized Inference for Low-Altitude Traffic Cloud-Brain Agents

**Why can this article be written:**If we want to do both software and hardware in the future, inference acceleration cannot just be engineering optimization. It needs to be written as Real-time intelligent service problem under low-altitude traffic system constraints: There are large models and global states on the cloud side, low latency and privacy/communication constraints on the edge side, and power consumption, computing power, heat dissipation and real-time control constraints on the drone side. General-Purpose Aerial Intelligent Agents have given a direct signal in the direction of hardware-software co-design: the 14B model onboard runs about 5-6 tokens/sec, has a peak power consumption of about 220W, and adopts a bidirectional cognitive architecture of slow LLM planning + fast reaction control [51].

System Architecture:

cloud brain
  - full LLM / VLM
  - global scheduler
  - long-horizon planner
  - batch simulation evaluator

edge station / vertiport
  - quantized 8B/14B model
  - local RAG cache
  - route/conflict verifier
  - streaming state summarizer

onboard UAV
  - tiny policy / controller
  - VIO / obstacle avoidance
  - emergency fallback
  - compressed semantic state uplink

Accelerated technology route:

**Thesis points available:**1. System paper: latency/cost/energy profiling of low-altitude cloud brain agent workload. 2. Algorithm-System Paper: Dynamic selection of API/cloud 32B/edge 14B/onboard 4B based on task risk. 3. Operator/Inference paper: KV cache and batching optimization for low-altitude traffic with multiple agents, multiple tools, long context, and streaming status updates. 4. Hardware collaboration paper: Jetson Orin / RTX workstation / cloud GPU three-tier deployment, evaluating tokens/sec, end-to-end latency, energy per decision, and safety fallback rate.

Recommended venue:


4. Recommendation priority| Priority | Articles | Recent Actions | Reasons |

|---|---|---|---| | P0-Active | Paper B | Freeze problem formulation, queue model, experimental benchmark | Most similar to the TR-C system paper, and most suitable for low-altitude economy/emergency | | P0-Active | Paper A | Rewriting PPO/MAPPO into Robust Low-altitude Conflict Resolution Paper | Already have algorithm basis, but need traffic indicators and strong baseline | | P0-Active | Paper C | Converged to Fisher + 3DGS + safe planning, no longer expand too much | The algorithm is innovative and can be used in robots/AI/ITS | | P1-Support | Paper D | Reuse 76 million exploration logs and do coverage-guided testing | Provide safety-critical scenarios, failure taxonomy and benchmark for A/C | | P1-Bridge | Paper G | Make the tool interface and CloudBrain-Agent benchmark first | String A/B/C/D/E into a low-altitude cloud brain instead of an empty chat model | | P2-Embodied | Paper I | Making aerial VLN/VLA small-scale pilot: simulation data, expert trajectories, end-to-end/hybrid strategy comparison | This is the main line leading to embodied AGI, but it requires A/C perception and security tools to be stabilized first | | P2-Model | Paper J | Precipitate LowAltitudeIR, tool trace, verifier feedback, and then do LoRA/SFT/GRPO | First have data closed loop, then fine-tune the vertical model | | P3-System | Paper K | Wait for the CloudBrain-Agent workload to be fixed before doing vLLM/TensorRT/quantization/end-cloud collaboration | The software and hardware direction can be written, but it requires a real workload to be like a paper | | P3-Planning | Paper H | As a subsequent extension of TR-C/T-ITS | Requires mature urban data pipeline and ODD definition |

**Execution order suggestions:**1. Do not change the current main battlefield: A/B/C continue to advance. 2. Supplement Paper D first, because it directly enhances the experimental credibility of A/C and can also generate subsequent model training data. 3. Make Paper G again and package A/B/C/D/E into a tool-based cloud brain. 4. Paper I/J/K Don’t rush to start a big project; do a small pilot and data schema first. Before starting the actual question, you must answer: Where does the data come from, what are the evaluation indicators, and whether it can be stronger than ordinary large models + tool calls.


4.1 Literature support matrix

In order to avoid document stacking, the current 51 references are used in a closed manner according to the direction of the paper:| Directions | Documentation Groups | Usage | |---|---|---| | Submission and transportation system positioning | [1,2] | Determine the framing differences of TR-C / T-ITS | | Paper A: Multi-agent conflict resolution | [3-12] | PPO/MAPPO, MAT/FACMAC/HAPPO and EGO-Swarm/MADER/RMADER/RACER/PANTHER/GCOPTER baseline | | Paper B: Hundreds of UAV Scheduling | [13-19] | Low-altitude delivery resource allocation, UAM scheduling, safe learning, truck-drone/UAV-UGV multi-modal delivery | | Paper C: 3DGS active sensing | [20-26] | 3DGS, ActiveNeRF, FisherRF, GS-Planner, HGS-Planner, POp-GS, NVF | | Paper D: Safety-critical scenario coverage | [27-30] | Shuo Feng accelerated testing, scenario library, SafeBench | | Paper E: Language Planning and Verification | [31-34] | Lang2LTL, NL2LTL, LTLCodeGen, ConformalNL2LTL | | Paper G: Low-altitude cloud brain Agent | [35-38,47,50,51] | UrbanGPT/UniST/TrafficGPT/DriveLM, MCP, low-altitude economic large model review, aerial intelligent agent | | Paper I: Embodied Low Altitude / Aerial VLA | [39-44] | SINGER, FlightGPT, UAV-VLN, OpenVLA, Octo, RT-2 | | Paper J: Model training and fine-tuning | [40,45,46,47,50] | SFT/GRPO reference, Qwen3, DeepSeek-R1, MCP/tool-use, low-altitude large model system positioning | | Paper K: Inference acceleration and software and hardware collaboration | [45,48,49,51] | Qwen3 deployment ecology, vLLM/PagedAttention, TensorRT-LLM, onboard 14B aerial agent hardware constraints|---

5. Zotero organizes status

Target Zotero collection name:

低空规划论文参考

Currently, two levels of organization have been completed:

ProjectStatus
Zotero collectionalready exists, collection key is FVHS3SKY, local treeViewID is C17
Zotero local selection linkzotero://select/library/collections/FVHS3SKY
Imported documents51 top-level items
item type distributionjournalArticle 17 items, conferencePaper 11 items, document/preprint/webpage 23 items
Local backup BibTeXzotero/low-altitude-planning-references-20260527.bib; Increment: zotero/low-altitude-planning-references-update-20260528.bib

The import method uses Zotero’s local connector server instead of writing zotero.sqlite directly. The specific process is:

  1. Use pandoc to check that BibTeX can be parsed as CSL JSON.
  2. Import zotero/low-altitude-planning-references-20260527.bib through Zotero local /connector/import.
  3. Update the target collection of the imported session to C17 / Low Altitude Planning Paper Reference through /connector/updateSession.
  4. Use Zotero local API and read-only SQLite to double verify that there are 51 top-level documents in the collection.If you continue to add documents in the future, it is recommended to update the local BibTeX first, and then import Zotero through the same connector import/updateSession process. Do not modify SQLite directly.

6. Follow-up execution plan

6.1 Week 1: Freeze three papers in progress

6.2 Weeks 2-3: Supplementing the literature matrix and subsequent route novelty checking

6.3 Weeks 4-8: Advance the three experimental lines of Paper B/A/C first- Paper B: synthetic UAM queuing benchmark + FCFS/greedy/MILP/backpressure/MARL baseline.

6.4 Weeks 9-12: Constructing the minimum closed loop of low-altitude cloud brain

6.5 Weeks 13-20: Deciding on submission and model routes- If Paper B’s queue stability and hundred-shelf level results are the most stable: vote for TR-C first.


7. References

[1] Elsevier. Transportation Research Part C: Emerging Technologies: Aims and Scope. URL: https://www.sciencedirect.com/journal/transportation-research-part-c-emerging-technologies

[2] IEEE Intelligent Transportation Systems Society. IEEE Transactions on Intelligent Transportation Systems: Scope. URL: https://ieee-itss.org/pub/t-its/[3] Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre M. Bayen, and Yi Wu. “The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games.” Advances in Neural Information Processing Systems, 2022. URL: https://arxiv.org/abs/2103.01955

[4] Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, and Yaodong Yang. “Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.” NeurIPS, 2022. URL: https://proceedings.neurips.cc/paper_files/paper/2022/hash/69413f87e5a34897cd010ca698097d0a-Abstract-Conference.html

[5] Bei Peng, Tabish Rashid, Christian Schroeder de Witt, Pierre-Alexandre Kamienny, Philip Torr, Wendelin Boehmer, and Shimon Whiteson. “FACMAC: Factored Multi-Agent Centralized Policy Gradients.” NeurIPS, 2021. URL: https://proceedings.neurips.cc/paper/2021/hash/65b9eea6e1cc6bb9f0cd2a47751a186f-Abstract.html[6] Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fudan Sun, Jun Wang, and Yaodong Yang. “Trust Region Policy Optimization in Multi-Agent Reinforcement Learning.” arXiv:2109.11251, 2021. URL: https://arxiv.org/abs/2109.11251

[7] Boyu Zhou, Xin Zhou, Jun Zhang, Fei Gao, and Shaojie Shen. “EGO-Swarm: A Fully Autonomous and Decentralized Quadrotor Swarm System in Cluttered Environments.” ICRA, 2021. DOI: 10.1109/ICRA48506.2021.9561902. URL: https://arxiv.org/abs/2011.04183

[8] Jesus Tordesillas, Brett T. Lopez, and Jonathan P. How. “MADER: Trajectory Planner in Multiagent and Dynamic Environments.” IEEE Transactions on Robotics, 38(1):463-476, 2022. URL: https://arxiv.org/abs/2010.11061[9] Kota Kondo, Reinaldo Figueroa, Juan Rached, Jesus Tordesillas, Parker C. Lusk, and Jonathan P. How. “Robust MADER: Decentralized Multiagent Trajectory Planner Robust to Communication Delay in Dynamic Environments.” arXiv:2303.06222, 2023. URL: https://arxiv.org/abs/2303.06222

[10] Boyu Zhou, Hao Xu, and Shaojie Shen. “RACER: Rapid Collaborative Exploration With a Decentralized Multi-UAV System.” IEEE Transactions on Robotics, 2023. DOI: 10.1109/TRO.2023.3236945. URL: https://arxiv.org/abs/2209.08533

[11] Jesus Tordesillas and Jonathan P. How. “PANTHER: Perception-Aware Trajectory Planner in Dynamic Environments.” IEEE Access, 10:22662-22677, 2022. DOI: 10.1109/ACCESS.2022.3154037. URL: https://arxiv.org/abs/2103.06372[12] Zhepei Wang, Xin Zhou, Chao Xu, and Fei Gao. “Geometrically Constrained Trajectory Optimization for Multicopters.” IEEE Transactions on Robotics, 38(5):3259-3278, 2022. DOI: 10.1109/TRO.2022.3160022. URL: https://arxiv.org/abs/2103.00190

[13] Ang Li, Mark Hansen, and Bo Zou. “Traffic Management and Resource Allocation for UAV-Based Parcel Delivery in Low-Altitude Urban Space.” Transportation Research Part C: Emerging Technologies, 143:103808, 2022. DOI: 10.1016/j.trc.2022.103808. URL: https://doi.org/10.1016/j.trc.2022.103808

[14] Mehdi Bennaceur, Rémi Delmas, and Youssef Hamadi. “Passenger-Centric Urban Air Mobility: Fairness Trade-Offs and Operational Efficiency.” Transportation Research Part C: Emerging Technologies, 136:103519, 2022. DOI: 10.1016/j.trc.2021.103519. URL: https://doi.org/10.1016/j.trc.2021.103519[15] Roberto Pinto and Alexandra Lagorio. “Point-to-Point Drone-Based Delivery Network Design with Intermediate Charging Stations.” Transportation Research Part C: Emerging Technologies, 135:103506, 2022. DOI: 10.1016/j.trc.2021.103506. URL: https://doi.org/10.1016/j.trc.2021.103506

[16] Qinshuang Wei, Gustav Nilsson, and Samuel Coogan. “Capacity-Constrained Urban Air Mobility Scheduling.” arXiv:2107.02900, 2021. URL: https://arxiv.org/abs/2107.02900

[17] Surya Murthy, Natasha A. Neogi, and Suda Bharadwaj. “Scheduling for Urban Air Mobility Using Safe Learning.” arXiv:2209.15457, NASA NTRS, 2022. URL: https://arxiv.org/abs/2209.15457[18] Jiahao Xing, Tong Guo, and Lu Tong. “Reliable Truck-Drone Routing with Dynamic Synchronization: A High-Dimensional Network Programming Approach.” Transportation Research Part C: Emerging Technologies, 165:104698, 2024. DOI: 10.1016/j.trc.2024.104698. URL: https://doi.org/10.1016/j.trc.2024.104698

[19] Bolong Zhou, Wei Zeng, and Hai Yang. “Multi-Trip UAV-UGV Delivery Network Design with Release Times.” Transportation Research Part C: Emerging Technologies, 181:105389, 2025. DOI: 10.1016/j.trc.2025.105389. URL: https://doi.org/10.1016/j.trc.2025.105389

[20] Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, and George Drettakis. “3D Gaussian Splatting for Real-Time Radiance Field Rendering.” ACM Transactions on Graphics / SIGGRAPH, 42(4), 2023. DOI: 10.1145/3592433. URL: https://arxiv.org/abs/2308.04079[21] Xuran Pan, Zihang Lai, Shiji Song, and Gao Huang. “ActiveNeRF: Learning Where to See with Uncertainty Estimation.” ECCV, 2022. URL: https://arxiv.org/abs/2209.08546

[22] Wen Jiang, Boshu Lei, and Kostas Daniilidis. “FisherRF: Active View Selection and Mapping with Radiance Fields Using Fisher Information.” ECCV, 2024. DOI: 10.1007/978-3-031-72624-8_24. URL: https://eccv.ecva.net/virtual/2024/oral/1226

[23] Rui Jin, Yuman Gao, Yingjian Wang, Haojian Lu, and Fei Gao. “GS-Planner: A Gaussian-Splatting-Based Planning Framework for Active High-Fidelity Reconstruction.” arXiv:2405.10142, 2024. URL: https://arxiv.org/abs/2405.10142

[24] Zijun Xu, Rui Jin, Ke Wu, Yi Zhao, Zhiwei Zhang, Jieru Zhao, Fei Gao, Zhongxue Gan, and Wenchao Ding. “HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting.” arXiv:2409.17624, 2024. URL: https://arxiv.org/abs/2409.17624[25] Joey Wilson, Marcelino Almeida, Sachit Mahajan, Martin Labrie, Maani Ghaffari, Omid Ghasemalizadeh, Min Sun, Cheng-Hao Kuo, and Arnab Sen. “POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality.” CVPR, 2025. URL: https://cvpr.thecvf.com/virtual/2025/poster/34708

[26] Shangjie Xue, Jesse Dill, Pranay Mathur, Frank Dellaert, Panagiotis Tsiotras, and Danfei Xu. “Neural Visibility Field for Uncertainty-Driven Active Mapping.” CVPR, 2024. URL: https://arxiv.org/abs/2406.06948

[27] Shuo Feng, Xintao Yan, Haowei Sun, Yiheng Feng, and Henry X. Liu. “Intelligent Driving Intelligence Test for Autonomous Vehicles with Naturalistic and Adversarial Environment.” Nature Communications, 12:748, 2021. DOI: 10.1038/s41467-021-21007-8. URL: https://www.nature.com/articles/s41467-021-21007-8[28] Shuo Feng, Yiheng Feng, Chao Yu, Yi Zhang, and Henry X. Liu. “Testing Scenario Library Generation for Connected and Automated Vehicles, Part I: Methodology.” IEEE Transactions on Intelligent Transportation Systems, 2021. DOI: 10.1109/TITS.2020.2972211. URL: https://doi.org/10.1109/TITS.2020.2972211

[29] Shuo Feng, Yiheng Feng, Haowei Sun, Yi Zhang, and Henry X. Liu. “Testing Scenario Library Generation for Connected and Automated Vehicles, Part II: Case Studies.” IEEE Transactions on Intelligent Transportation Systems, 2021. DOI: 10.1109/TITS.2020.2988309. URL: https://doi.org/10.1109/TITS.2020.2988309[30] Chejian Xu, Wenhao Ding, Baiming Li, Xinqing Chen, Ding Zhao, Bo Li, Jiajun Liu, and Hang Zhao. “SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles.” NeurIPS Datasets and Benchmarks, 2022. URL: https://proceedings.neurips.cc/paper_files/paper/2022/hash/a48ad12d588c597f4725a8b84af647b5-Abstract-Datasets_and_Benchmarks.html

[31] Jason Liu, Ankit Shah, Eric Rosen, and Stefanie Tellex. “Lang2LTL: Translating Natural Language Commands to Temporal Robot Task Specification.” PMLR/CoRL, 229, 2023. URL: https://proceedings.mlr.press/v229/liu23d.html

[32] Francesco Fuggitti and Tathagata Chakraborti. “NL2LTL: A Python Package for Converting Natural Language Instructions to Linear Temporal Logic Formulas.” AAAI Demonstration, 37(13):16428-16430, 2023. DOI: 10.1609/aaai.v37i13.27068. URL: https://ojs.aaai.org/index.php/AAAI/article/view/27068[33] Behrad Rabiei and Mahesh A. Kumar. “LTLCodeGen: Code Generation of Syntactically Correct Temporal Logic for Robot Task Planning.” arXiv:2503.07902, 2025. URL: https://arxiv.org/abs/2503.07902

[34] Jun Wang, David Smith Sundarsingh, Jyotirmoy V. Deshmukh, and Yiannis Kantaros. “ConformalNL2LTL: Translating Natural Language Instructions into Temporal Logic Formulas with Conformal Correctness Guarantees.” arXiv:2504.21022, 2025. URL: https://arxiv.org/abs/2504.21022

[35] Zhonghang Li, Lianghao Xia, Jiabin Tang, Yong Xu, Lei Shi, Long Xia, Dawei Yin, and Chao Huang. “UrbanGPT: Spatio-Temporal Large Language Models.” arXiv:2403.00813, 2024. URL: https://arxiv.org/abs/2403.00813

[36] Yuan Yuan, Jingtao Ding, Jie Feng, Depeng Jin, and Yong Li. “UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction.” KDD, 2024. DOI: 10.1145/3637528.3671662. URL: https://dblp.org/rec/conf/kdd/0032D0J024[37] Jinhui Ouyang, Yijie Zhu, Xiang Yuan, and Di Wu. “TrafficGPT: Towards Multi-Scale Traffic Analysis and Generation with Spatial-Temporal Agent Framework.” arXiv:2405.05985, 2024. URL: https://arxiv.org/abs/2405.05985

[38] Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Jens Beisswenger, Ping Luo, Andreas Geiger, and Hongyang Li. “DriveLM: Driving with Graph Visual Question Answering.” ECCV, 2024. URL: https://github.com/OpenDriveLab/DriveLM

[39] Maximilian Adang, JunEn Low, Ola Shorinwa, and Mac Schwager. “SINGER: An Onboard Generalist Vision-Language Navigation Policy for Drones.” arXiv:2509.18610, 2025. URL: https://arxiv.org/abs/2509.18610[40] Hengxing Cai, Jinhan Dong, Jingjun Tan, Jingcheng Deng, Sihang Li, Zhifeng Gao, Haidong Wang, Zicheng Su, Agachai Sumalee, and Renxin Zhong. “FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models.” EMNLP, 2025. DOI: 10.18653/v1/2025.emnlp-main.338. URL: https://aclanthology.org/2025.emnlp-main.338/

[41] Pranav Saxena, Nishant Raghuvanshi, and Neena Goveas. “UAV-VLN: End-to-End Vision Language guided Navigation for UAVs.” arXiv:2504.21432, 2025. URL: https://arxiv.org/abs/2504.21432[42] Moo Jin Kim, Karl Pertsch, Siddharth Karamcheti, Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, and Chelsea Finn. “OpenVLA: An Open-Source Vision-Language-Action Model.” arXiv:2406.09246, 2024. URL: https://arxiv.org/abs/2406.09246

[43] Octo Model Team, Dibya Ghosh, Homer Walke, Karl Pertsch, Kevin Black, Oier Mees, Sudeep Dasari, Joey Hejna, Tobias Kreiman, Charles Xu, Jianlan Luo, You Liang Tan, Lawrence Yunliang Chen, Pannag Sanketi, Quan Vuong, Ted Xiao, Dorsa Sadigh, Chelsea Finn, and Sergey Levine. “Octo: An Open-Source Generalist Robot Policy.” arXiv:2405.12213, 2024. URL: https://arxiv.org/abs/2405.12213[44] Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, et al. “RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control.” arXiv:2307.15818, 2023. URL: https://arxiv.org/abs/2307.15818

[45] Qwen Team. “Qwen3 Technical Report.” arXiv:2505.09388, 2025; QwenLM/Qwen3 official repository. URL: https://arxiv.org/abs/2505.09388; https://github.com/QwenLM/Qwen3

[46] DeepSeek-AI. “DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning.” arXiv:2501.12948, 2025. URL: https://arxiv.org/abs/2501.12948; model card: https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B

[47] OpenAI. “Model Context Protocol (MCP): OpenAI Agents SDK.” Official documentation, 2026. URL: https://openai.github.io/openai-agents-js/guides/mcp/[48] Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, and Ion Stoica. “Efficient Memory Management for Large Language Model Serving with PagedAttention.” arXiv:2309.06180, 2023. URL: https://arxiv.org/abs/2309.06180

[49] NVIDIA. “NVIDIA TensorRT-LLM.” Official documentation, 2026. URL: https://docs.nvidia.com/tensorrt-llm/index.html

[50] Jinpeng Hu, Wei Wang, Yuxiao Liu, and Jing Zhang. “Large Model in Low-Altitude Economy: Applications and Challenges.” Big Data and Cognitive Computing, 10(1):33, 2026. DOI: 10.3390/bdcc10010033. URL: https://www.mdpi.com/2504-2289/10/1/33

[51] Ji Zhao and Xiao Lin. “General-Purpose Aerial Intelligent Agents Empowered by Large Language Models.” arXiv:2503.08302, 2025. URL: https://arxiv.org/abs/2503.08302