interacting with a physical screen, users receive clear
physical feedback, while when performing gesture
interactions in the air, this direct feedback mechanism
is lacking, restricting positioning accuracy. In
complex dynamic environments, such as forest fire
scenes, environmental factors (such as smoke, light
changes) will interfere with the sensor's recognition
of user gestures, further increasing the positioning
error. To solve this problem, it is necessary to
combine visual Simultaneous Localization and
Mapping (SLAM) technology. Visual SLAM
technology can construct a real-time environmental
map by identifying and tracking feature points in the
environment, thereby accurately determining the
position of user gestures in space, reducing
positioning errors, and significantly improving
interaction accuracy (Liu et al., 2021). This allows
inspectors to more accurately operate the holographic
image through gestures, achieving precise positioning
and processing of problem areas.
3.3.2 Spatio - Temporal Alignment Problem
In the time dimension, the data collection frequencies
of different types of sensors vary significantly. For
example, a temperature sensor may collect data once
per second, while a camera can capture 30 frames per
second. This difference in frequency will lead to time
asynchrony of data. If the data is directly fused, the
analysis results will be deviated and unable to
accurately reflect the true state of the environment. To
solve the time - synchronization problem, methods
such as dynamic interpolation or resampling are
usually adopted. Dynamic interpolation inserts
reasonable data values within the time interval
according to the changing trend of known data points;
resampling resamples the data to make the data of
different sensors reach the same frequency in time,
ensuring the accuracy of data fusion.
In the space dimension, accuracy errors between
the GPS coordinates of sensors and the geographic
information (GIS) of images also cause problems. For
example, in the fire- positioning scenario, there may
be a certain deviation in the GPS positioning of
sensors, and errors may also occur during the
processing and matching of geographic information
in images. The superposition of these two factors will
lead to a deviation in fire positioning, affecting
subsequent rescue decisions. To solve the space-
alignment problem, it is necessary to accurately
calibrate the GPS data of sensors and the GIS
information of images. Through technical means such
as coordinate transformation and matching
algorithms, accuracy errors can be reduced, ensuring
the spatial consistency of data from different sources
(Yang et al., 2022).
3.3.3 Advantages of Holographic Interaction
The cloud - based AI model has a powerful gesture -
parsing ability and can accurately identify various
gesture actions of inspectors, such as pinch - to -
zoom, swipe - to - rotate, etc. When an inspector
makes a pinch gesture, the cloud - based AI model
will dynamically adjust the zoom ratio of the
holographic image according to the preset algorithm,
allowing the inspector to view the environmental data
of the area of interest in more detail. The swipe - to -
rotate gesture is used to change the viewing angle of
the holographic image, enabling the inspector to
observe the environment from different angles for
comprehensive monitoring. Through these gesture
operations, inspectors can intuitively see information
such as geographical location and terrain elevation
differences, enhancing their perception of the
environmental situation. In the forest fire monitoring
scenario, when an inspector "grabs" the holographic
fire source marker through a gesture, the cloud - based
AI model will quickly trigger the drone inspection
and fire - fighting command after recognizing the
operation, achieving efficient fire emergency
handling and greatly improving the monitoring and
response efficiency.
3.3.4 Advantages of Edge - Cloud
Collaborative Architecture
In the edge - cloud collaborative architecture, edge
nodes play a crucial role. Edge nodes such as
NVIDIA Jetson have strong local data - processing
capabilities and can process a large amount of raw
data locally at the data source. By performing
operations such as data filtering, noise reduction, and
anomaly detection, edge nodes can process 80% of
the original data locally and only upload the key data
to the cloud. This process greatly reduces the amount
of data transmission, thereby significantly reducing
the demand for transmission bandwidth and enabling
the end - to - end latency to be controlled within 20ms,
ensuring the real - time performance of the
monitoring system.
In addition, edge nodes are highly independent
and reliable. In the event of a network outage, edge
nodes can still execute local decisions based on
locally stored data and preset decision - making logic.
For example, when abnormal environmental
parameters (such as excessive smoke concentration,
sudden temperature increase) are detected, even if