Boost UAV-based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Liu, Fan
Yao, Liang
Zhang, Chuanyi
Wu, Ting
Zhang, Xinlei
Jiang, Xiruo
Zhou, Jun
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2025
Size
File type(s)
Location
License
Abstract

Detecting objects from Unmanned Aerial Vehicles (UAV) is often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multi-stage inferences. Despite their remarkable detecting accuracies, real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a Scale-Invariant Feature Disentangling module is designed to disentangle scale-related and scale-invariant features. Then an Adversarial Feature Learning scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection. Furthermore, we construct a multi-modal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on three datasets. Our code and dataset are publicly available at https://github.com/1e12Leon/SIFDAL.

Journal Title

IEEE Transactions on Geoscience and Remote Sensing

Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

This work is covered by copyright. You must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a specified licence, refer to the licence for details of permitted re-use. If you believe that this work infringes copyright please make a copyright takedown request using the form at https://www.griffith.edu.au/copyright-matters.

Item Access Status
Note

This publication has been entered in Griffith Research Online as an advance online version.

Access the data
Related item(s)
Subject

Earth sciences

Engineering

Persistent link to this record
Citation

Liu, F; Yao, L; Zhang, C; Wu, T; Zhang, X; Jiang, X; Zhou, J, Boost UAV-based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning, IEEE Transactions on Geoscience and Remote Sensing, 2025

Collections