Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment
Across diverse surveillance, rescue, and scientific research applications, there is a specific need to perform on-the-ground target object detection and localization from highly efficient aerial platforms. The aerial target geolocation task has seen rapid development in recent years, contributed...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | en_US |
Subjects: | |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-usim-ddms-12675 |
---|---|
record_format |
uketd_dc |
spelling |
my-usim-ddms-126752024-05-29T04:37:51Z Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment Amir Hilmi Bin Ahmad Azizi Across diverse surveillance, rescue, and scientific research applications, there is a specific need to perform on-the-ground target object detection and localization from highly efficient aerial platforms. The aerial target geolocation task has seen rapid development in recent years, contributed by the explosive growth of unmanned aircraft vehicles (UAVs) and artificial intelligence (AI) computing. The existing solutions of using the monocular camera as the vision-based geolocation sensor exhibited a high chance of geolocation error and inconvenience requirement of pre-operation calibration of the camera's intrinsic parameter. Studies on the potential of using stereo vision technology for geolocating targets from aerial platforms are still scarce, especially when integrated with automated detection and edge processing. This study aimed to develop a stereo vision-based aerial target geolocation system with excellent effectiveness and operational flexibility. This research explores three modules: (1) stereo vision-based geolocation modeling, (2) deep learning-based object detection modeling, and (3) evaluation of edge computer performance. The novel stereo vision-based aerial target geolocation algorithm is formulated by using a constructive model development technique, whereby the stereo vision point cloud information of the detected target is used as the distance and angle parameters in the radar target tracking model and projected coordinate system. The relevant Darknet-based object detection models are evaluated, each trained with the VisDrone2018 aerial training dataset. The detection and geolocation module has been executed in the Nvidia Jetson TX2 edge computing platform to determine the system's feasibility. The evaluation and validation of the proposed geolocation, detection, and processing modules are performed using the ground-based platform field test experiments and MATLAB. The field experiment results validated the proposed stereo vision-based aerial target geolocation, demonstrating the geolocation accuracy of 0.53-meter mean error at the 5-meter testing height. Besides, the selected detection module, the YOLOv4 model, scored a detection accuracy of 30.69% mean Average Precision (mAP) value when tested on the official test tool, and its detection speed satisfied the safety requirements of 2 frames per second (FPS) for aerial applications. Further, the edge computer platform exhibited minimal power consumption of 5 Watts and maintained the manufacturer's system operating standards. These findings demonstrate the feasibility of the proposed stereo vision based aerial target geolocation system to be used as a system payload for aerial platforms and serve as a reference framework for Search and Rescue (SAR) agencies in detecting distressed humans. Universiti Sains Islam Malaysia 2023-10 Thesis en_US https://oarep.usim.edu.my/handle/123456789/12675 https://oarep.usim.edu.my/bitstreams/6665b57a-42dd-484c-b84f-f8fdf3a62a7f/download 8a4605be74aa9ea9d79846c1fba20a33 Monocular camera, geolocation Computer Graphics Geographic information systems. Artificial Intelligence Global Positioning System. |
institution |
Universiti Sains Islam Malaysia |
collection |
USIM Institutional Repository |
language |
en_US |
topic |
Monocular camera geolocation Computer Graphics Geographic information systems. Artificial Intelligence Global Positioning System. |
spellingShingle |
Monocular camera geolocation Computer Graphics Geographic information systems. Artificial Intelligence Global Positioning System. Amir Hilmi Bin Ahmad Azizi Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
description |
Across diverse surveillance, rescue, and scientific research applications, there is a
specific need to perform on-the-ground target object detection and localization from
highly efficient aerial platforms. The aerial target geolocation task has seen rapid
development in recent years, contributed by the explosive growth of unmanned aircraft
vehicles (UAVs) and artificial intelligence (AI) computing. The existing solutions of
using the monocular camera as the vision-based geolocation sensor exhibited a high
chance of geolocation error and inconvenience requirement of pre-operation calibration
of the camera's intrinsic parameter. Studies on the potential of using stereo vision
technology for geolocating targets from aerial platforms are still scarce, especially when
integrated with automated detection and edge processing. This study aimed to develop
a stereo vision-based aerial target geolocation system with excellent effectiveness and
operational flexibility. This research explores three modules: (1) stereo vision-based
geolocation modeling, (2) deep learning-based object detection modeling, and
(3) evaluation of edge computer performance. The novel stereo vision-based aerial
target geolocation algorithm is formulated by using a constructive model development
technique, whereby the stereo vision point cloud information of the detected target is
used as the distance and angle parameters in the radar target tracking model and
projected coordinate system. The relevant Darknet-based object detection models are
evaluated, each trained with the VisDrone2018 aerial training dataset. The detection
and geolocation module has been executed in the Nvidia Jetson TX2 edge computing
platform to determine the system's feasibility. The evaluation and validation of the
proposed geolocation, detection, and processing modules are performed using the
ground-based platform field test experiments and MATLAB. The field experiment
results validated the proposed stereo vision-based aerial target geolocation,
demonstrating the geolocation accuracy of 0.53-meter mean error at the 5-meter testing
height. Besides, the selected detection module, the YOLOv4 model, scored a detection
accuracy of 30.69% mean Average Precision (mAP) value when tested on the official
test tool, and its detection speed satisfied the safety requirements of 2 frames per second
(FPS) for aerial applications. Further, the edge computer platform exhibited minimal
power consumption of 5 Watts and maintained the manufacturer's system operating
standards. These findings demonstrate the feasibility of the proposed stereo vision based
aerial target geolocation system to be used as a system payload for aerial
platforms and serve as a reference framework for Search and Rescue (SAR) agencies in
detecting distressed humans. |
format |
Thesis |
author |
Amir Hilmi Bin Ahmad Azizi |
author_facet |
Amir Hilmi Bin Ahmad Azizi |
author_sort |
Amir Hilmi Bin Ahmad Azizi |
title |
Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
title_short |
Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
title_full |
Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
title_fullStr |
Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
title_full_unstemmed |
Stereo Camera-Based Aerial Target Geolocation in a Ground-Based Platform Environment |
title_sort |
stereo camera-based aerial target geolocation in a ground-based platform environment |
granting_institution |
Universiti Sains Islam Malaysia |
_version_ |
1812444692472659968 |