Active perception from an information theory perspective: Fisher Information and Cramér-Rao lower bound
UAV Perception Planning Series · Part X Focus: Basics of information theory, active sensing framework, calculation of Fisher Information and application in SLAM
1. What is active perception?
Traditional perception is passive: the robot receives sensor data and updates a model of the environment.
Active perception goes one step further: the robot actively chooses “where to look” to maximize the value of the task.
被动感知:
传感器 → 数据 → 地图更新(机器人不动)
主动感知:
当前地图 → 信息价值评估 → 最优下一视角选择 → 移动 → 传感器 → 地图更新
↑
核心问题:如何量化"信息价值"?
For UAVs, active sensing is particularly critical:
- Energy Constraint: Flying consumes energy and cannot fly randomly.
- Wide field of view: When moving in the air, the field of view changes drastically, and it is crucial to choose the optimal path.
- Three-dimensional space: Buildings, mountains, and trees all need to be observed from multiple angles for complete modeling.
2. Mathematical foundation of information theory
2.1 Fisher Information
Given a probability model
Intuitive understanding:
- If
changes very steeply near , it means that the data is very sensitive to → Fisher Information large - If
changes flat around , the data is not sensitive to → Fisher Information small
Scalar vs matrix form:- Scalar:
- Matrix: Fisher Information Matrix (FIM)
(multidimensional parameters)
FIM is the Riemannian metric tensor in parameter space, which determines how accurately you can estimate parameters.
2.2 Cramér-Rao Lower Bound (CRLB)
The Cramér-Rao lower bound is a core application of Fisher Information: gives an optimal lower bound on the variance of an unbiased estimator.
Physical meaning: No matter what estimation method you use (as long as it is unbiased), the estimation accuracy cannot exceed
Meaning in SLAM:
- The lower bound of the covariance of the robot pose
is determined by FIM - The smaller the inverse of FIM → the more accurate the estimate
2.3 Mutual Information
Mutual information measures the statistical dependence between two random variables:
Meaning in active perception:
= future sensor observations = the uncertain state of the current map
Maximizing
3. Active sensing framework
3.1 Core issue: Next-Best-View (NBV)
The core problem of active sensing is NBV planning: given the currently observed area, where should we move next to most effectively reduce uncertainty?
Mathematical form of the NBV problem:
That is: Choose action
3.2 Three major components of active sensing system
Information theory active perception framework proposes three components of an active perception system:
┌─────────────────────────────────────────────────────────┐
│ Active Perception System │
│ │
│ Component 1: 状态估计 & 地图表示 │
│ (State Estimation & Map Representation) │
│ → 当前已观测区域的完整表示(几何 + 语义) │
│ │
│ Component 2: 未来观测合成 │
│ (Generative Model of Future Observations) │
│ → 给定候选动作,生成未来会看到的图像/传感器数据 │
│ │
│ Component 3: 信息驱动的规划 │
│ (Information-Driven Planning) │
│ → 在候选轨迹上计算互信息,选择最优 │
└─────────────────────────────────────────────────────────┘
**Why do you need Component 2 (generated model)? **
- You can’t really fly out and try every location (too expensive)
- You need a model to “imagine” what you would see by flying to each candidate location
- NeRF/3DGS are perfect generative models (already written about it in your blog!)
4. Application of Fisher Information in SLAM
4.1 FIM in SLAM
In visual SLAM, the robot needs to simultaneously estimate:
- Pose
(where is the camera) - Map Point
(where is the 3D point in space)
Observation model:
is the projection function (3D → 2D image coordinates) is the measurement noiseObserved Fisher Information:
Key Insights:
- Observing the same 3D point, different perspectives produce different Fisher Information
- The deeper the observation depth (the further away), the smaller the amount of information
- The larger the observation baseline (the greater the change in viewing angle), the greater the amount of information
**This is why UAVs need to actively choose their perspective! **
4.2 Interpretation of classic papers
FIT-SLAM (arXiv, January 2024)
Paper: FIT-SLAM — Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments Author: Suchetan Saravanan, Corentin Chauffaut, Caroline Chanel, Damien Vivet Source: arXiv:2401.09322, January 2024
Core contribution:
- Explicitly introduce Fisher Information into the objective function of Active SLAM
- Also consider Traversability - not just “see clearly”, but also “fly”
- Targeted at 3D environment (non-planar), suitable for UAV exploration in complex urban canyons
Note: This paper was published on arXiv (it was submitted to IEEE ICARA 2024). No clear publication record has been found at the top conference. The arXiv version should be noted when citing.
---#### Active View Planning for Visual SLAM: Continuous Information Modeling (arXiv, 2022/2023)
Paper: Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling Author: Zhihao Wang, Haoyao Chen, Shiwu Zhang, Yunjiang Lou Source: arXiv:2211.xxxxx, 2022
Core contribution:
- Proposed continuous information modeling to replace discrete information grids
- Optimize the next view on a continuous space rather than a discrete set of candidate points
- Model spatial uncertainty using Gaussian Process (GP)
Key Insights:
Traditional methods discretize space into candidate points → information gain is only evaluated on this limited set of points
Continuous method: Use GP to represent “the amount of information at any position”, and then directly optimize in the continuous space
Advantages over UAV:
- The motion space of UAV is continuous and should not be forced to discretize
- Ability to optimize complete 6-DoF trajectories rather than just discrete waypoint selections
5. Information gain calculation for active sensing
5.1 Information gain based on Fisher Information
Information Gain = FIM change before and after the action:
\hat{I}(X; Y) = \frac{1}{N} \sum_{i=1}^N \log \frac{p(x_i|y_i)}{p(x_i)}