强化学习驱动的复杂遥感场景目标检测方法-航天遥感图像智能处理与分析

doi:10.7527/S1000-6893.2025.32861

本期目录 | 过刊浏览 | 高级检索

强化学习驱动的复杂遥感场景目标检测方法-航天遥感图像智能处理与分析

刘文林,胡锡坤,钟平

国防科技大学电子科学学院

收稿日期:2025-10-09 修回日期:2025-11-27 发布日期:2025-11-28
通讯作者: 胡锡坤
基金资助:
国家自然科学基金

Reinforcement learning-driven object detection method for complex remote sensing scenes

Received:2025-10-09 Revised:2025-11-27 Published:2025-11-28
Contact: Xi-kun Hu
Supported by:
National Natural Science Foundation of China

摘要/Abstract

摘要： 卫星遥感图像目标检测技术是当前对地观测与智能解译的重要手段。然而，现有研究多集中于理想成像条件下，对于复杂天气、大气扰动及噪声干扰等复杂退化条件下的目标检测能力仍显不足。针对这一问题，本文提出了一种强化学习驱动的退化遥感图像目标检测方法，通过动态编排图像预处理算子以实现复杂场景下的鲁棒检测。该方法的核心思想是以目标检测性能为优化目标，利用强化学习的决策优势，自适应地迭代选择并组合图像去噪、去模糊、对比度增强等预处理操作，从而提升遥感影像的质量与检测精度。在使用YOLO11-OBB检测器基于DIOR和DOTA卫星遥感数据集构建的退化场景上进行的实验表明，所提方法均取得优异表现：在DIOR数据集上，相较于Raw-Syn（原始数据训练、退化场景数据验证）和Syn-Syn（退化场景数据训练与验证）方案，mAP50分别提升11.1%和2.5%，最终达80.8%；在DOTA数据集上，mAP50较Raw-Syn和Syn-Syn分别提升7.2%和2.8%，最终达76.6%。同时，处理后遥感影像的图像质量明显提升（PSNR>25dB），充分验证了所提方法在复杂环境下的有效性与适用性。

关键词: 卫星遥感, 目标检测, 复杂场景, 图像预处理, 强化学习, 自适应方法, 鲁棒性

Abstract: Satellite remote sensing image object detection constitutes a pivotal technique for earth observation and intelligent interpretation. However, most existing research has concentrated on ideal imaging conditions, and the capability for object detection under complex degradations, such as adverse weather, atmospheric turbulence, and noise interference, remains notably inadequate. To address this limitation, this paper proposes a reinforcement learning-based adaptive object detection methodology that dynamically orchestrates image preprocessing operators to achieve robust detection in complex scenarios. The core principle is to optimize object detection performance by leveraging reinforcement learning's decision-making capability to adaptively and iteratively select and compose preprocessing operations, including denoising, deblurring, and contrast enhancement, thereby enhancing both remote sensing imagery quality and detection precision. Experiments on the degraded scenarios constructed from the DIOR and DOTA satellite remote-sensing datasets with the YOLO11-OBB detector demonstrate that the proposed method achieves superior performance in all cases: on DIOR, the proposed method achieves mAP50 improvements of 11.1 and 2.5 points over Raw-Syn (trained on pristine data, validated on degraded data) and Syn-Syn (trained and validated on degraded data) baselines, respectively, attaining a final mAP50 of 80.8; on DOTA, it improves mAP50 by 7.2 and 2.8 points over the same baselines, reaching 76.6. Furthermore, the quality of processed remote sensing imagery is significantly enhanced (PSNR > 25 dB), substantiating the efficacy and applicability of the proposed approach in challenging environments.

Key words: Satellite remote sensing, Object detection, Complex scenarios, Image pre-processing, Reinforcement learning, Adaptive methods, Robustness

中图分类号:

TP391.41

刘文林胡锡坤钟平. 强化学习驱动的复杂遥感场景目标检测方法-航天遥感图像智能处理与分析 [J]. 航空学报, doi: 10.7527/S1000-6893.2025.32861.

参考文献

[1] 臧晶, 李成华, 田野. 卫星遥感农业监测系统中实例检索算法研究[J]. 宇航学报, 2019, 40(11): 1358-1366. ZANG J, LI C H, TIAN Y. Research on instance retrieval algorithm in satellite remote sensing agricultural monitoring system[J]. Journal of Astronautics, 2019, 40(11): 1358-1366 (in Chinese).[2]李志忠, 卫征, 付垒, 等. 我国遥感卫星技术与应用重要进展[J]. 卫星应用, 2025, (4): 16-19.LI Z Z, WEI Z, FU L, et al. Important progress of China's remote sensing satellite technology and application[J]. Satellite Application, 2025, (4): 16-19 (in Chinese).[3]王俊杰, 李清泉, 邬国锋. 红树林定量遥感研究进展[J]. 遥感学报, 2025, (6): 1769-1787.WANG J J, LI Q Q, WU G F. Progress in quantitative remote sensing of mangroves[J]. Journal of Remote Sensing, 2025, (6): 1769-1787 (in Chinese).[4]莫妮卡. 卫星遥感图像舰船目标检测系统[D]. 杭州:浙江大学，2022.MO N K. Ship target detection system for satellite remote sensing images[D]. Hangzhou: Zhejiang University, 2022 (in Chinese).[5]刘瑞锦, 何章鸣. 基于YOLOv8的卫星遥感图像快速目标检测方法[J]. 空间控制技术与应用, 2023, 49(5): 89-97. LIU R J, HE Z M. Fast object detection method for satellite remote sensing images based on YOLOv8[J]. Aerospace Control and Application, 2023, 49(5): 89-97 (in Chinese).[6]赵其昌, 吴一全, 苑玉彬. 光学遥感图像舰船目标检测与识别方法研究进展[J]. 航空学报, 2024, 45(8): 51-84. ZHAO Q C, WU Y Q, YUAN Y B. Research progress of ship target detection and recognition methods for optical remote sensing images[J]. Acta Aeronautica et Astronautica Sinica, 2024, 45(8): 51-84 (in Chinese).[7]XI Y, JIA W J, MIAO Q G, et al. CoDerainNet: Collaborative deraining network for drone-view object detection in rainy weather conditions[J]. Remote Sensing, 2023, 15(1487): 1487. [8]Aswini N, Uma S V. Drone image de-noising and feature extraction[C]//2020 IEEE International Conference for Innovation in Technology (INOCON). Bengaluru, India: IEEE, 2020: 1-6.[9]Jae-In K, Chang-Uk H, Hyangsun H, et al. Digital surface model generation for drifting Arctic sea ice with low-textured surfaces based on drone images[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2021, 172: 147-159. [10]QIAN G, WANG Y, GU J, et al. Rethinking learning-based demosaicing, denoising, and super-resolution pipeline[C]//2022 IEEE International Conference on Computational Photography (ICCP). Cluj-Napoca, Romania: IEEE, 2022: 1-12.[11]XING W Z, EGIAZARIAN K. End-to-end learning for joint image demosaicking, denoising and super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 3507-3516. [12]SUGANUMA M, LIU X, OKATANI T. Attention-based adaptive selection of operations for image restoration in the presence of unknown combined distortions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 9039-9048.[13]KIM C, KIM T H, BAIK S. LAN: Learning to adapt noise for image denoising[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024: 25193-25202.[14]LIU Y, LI W, GUAN J, et al. Effective cloud removal for remote sensing images by an improved mean-reverting denoising model with elucidated design space[C]//Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition Conference. 2025: 17851-17861.[15]ZHANG J, ZHANG Q, ZHAO X, et al. Boosting denoisers with reinforcement learning for image restoration[J]. Soft Computing, 2022, 26(7): 3261-3272.[16]RYUSUKE F, NAOTO I, TOSHIHIKO Y. PixelRL: Fully convolutional network with reinforcement learning for image processing[J]. IEEE Transactions on Multimedia, 2020, 22(7): 1704-1719.[17]YU K, DONG C, LI L, et al. Crafting a toolchain for image restoration by deep reinforcement learning[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA: CVF, 2018: 2443-2452.[18]UKCHEOL S, KYUNGHYUN L, IN S K. DRL-ISP: Multi-objective camera ISP with deep reinforcement learning[C]//2022 IEEE/RSJ International Conference on Intelligent Robots and Systems, Kyoto, Japan: IEEE, 2022: 7044-7051.[19]YU K, WANG X T, DONG C, et al. Path-restore: Learning network path selection for image restoration[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 44(10): 7078-7092.[20]WEI Z Y, CHEN H H, NAN L L, et al. PathNet: Path-Selective Point Cloud Denoising[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(6): 4426-4442, doi: 10.1109/TPAMI.2024.3355988.[21]范天麒, 邹征夏, 史振威. 基于强化学习数据合成的典型遥感目标检测[J]. 航空学报, doi: 10.7527/S1000-6893.2025.31955. FAN T Q, ZOU Z X, SHI Z W. Typical remote sensing object detection based on reinforcement learning data synthesis[J]. Acta Aeronautica et Astronautica Sinica, doi: 10.7527/s1000-6893.2025.31955 (in Chinese).[22]HE K M, ZHANG X Y, REN S Q, et al. Deep Residual Learning for Image Recognition[C]//2016 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA: CVF, 2016:770-778.[23]HAARNOJA T, ZHOU A, ABBEEL P, et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]//Proceedings of the 36th International Conference on Machine Learning. New York: PMLR, 2019: 2879-2888.[24]RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany, 2015-10: 234-241. Cham: Springer International Publishing, 2015.[25]Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., ... & Petersen, S. (2015). Human-level control through deep reinforcement learning.?Nature, 518(7540), 529-533. DOI: 10.1038/nature14236.[26]Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O. (2017). Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347.[27]REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 39(6): 1137-1149.[28]ULTRALYTICS. YOLOv11: Multi-feature fusion real-time object detection framework (Version 1.0)[EB/OL]. Hangzhou: Ultralytics Inc., (2025-02-03) [2025-09-30]. https://github.com/ultralytics/ultralytics/tree/main/yolo1[29]XIE X X, CHENG G, WANG J B, et al. Oriented R-CNN for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. IEEE, 2021: 3520-3529.[30]LIU Z, LIN Y T, CAO Y, et al. Swin Transformer: Hierarchical vision transformer using shifted windows[C]//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV). Montreal, Canada: IEEE/CVF, 2021: 1001-1010.[31]LI K, WAMG G, CHENG G, et al. Object detection in optical remote sensing images: A survey and a new benchmark[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 159: 296-307.[32]XIA G S, BAI X, DING J, et al. DOTA: A large-scale dataset for object detection in aerial images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2018: 3974-3983.

E-mail：hkxb@buaa.edu.cn

关于我们

期刊社服务

专业学科

封面文章

友情链接

主管单位：中国科学技术协会主办单位：中国航空学会北京航空航天大学

强化学习驱动的复杂遥感场景目标检测方法-航天遥感图像智能处理与分析

Reinforcement learning-driven object detection method for complex remote sensing scenes

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	万开方, 吴志林, 武韫晖, 强皓植, 吴艺博, 李波. 拒止环境下基于深度强化学习的多无人机协同定位[J]. 航空学报, 2025, 46(8): 331024-331024.
[2]	姜凌峰, 李新凯, 张海, 李涵玮, 张宏立. 基于改进TD3算法的无人机动态环境无地图导航[J]. 航空学报, 2025, 46(8): 331035-331035.
[3]	杨敏, 刘关俊, 周子渊. 基于安全强化学习的月球着陆器控制[J]. 航空学报, 2025, 46(3): 630553-630553.
[4]	吴一全, 童康. 基于深度学习的无人机航拍图像小目标检测研究进展[J]. 航空学报, 2025, 46(3): 30848-030848.
[5]	王辰, 魏才盛, 殷泽阳, 靳锴, 李星辰. 考虑信道资源约束的多无人机航迹与通信策略协同规划[J]. 航空学报, 2025, 46(18): 331837-331837.
[6]	郑忆, 程向红, 唐兴邦, 曹毅. 基于改进ReDet的航拍绝缘子及其缺陷定向检测算法[J]. 航空学报, 2025, 46(18): 331825-331825.
[7]	罗祎喆, 张辉, 余新得, 金钊, 冯朔, 石育澄, 徐明亮. 面向舰载机多波次弹药保障任务的分层动态调度[J]. 航空学报, 2025, 46(18): 331945-331945.
[8]	黄湘松, 王梦宇, 潘大鹏. 基于对抗强化学习的无人机逃离路径规划方法[J]. 航空学报, 2025, 46(17): 331637-331637.
[9]	杨永刚, 姜文韬, 高志云. 低空无人机实时目标检测算法[J]. 航空学报, 2025, 46(16): 331619-331619.
[10]	王昱, 谢志鹏, 田永健, 孟光磊. 虚拟结构引领强化学习分布式无人机编队控制[J]. 航空学报, 2025, 46(15): 331354-331354.
[11]	陈伟, 李璐璐, 陈董, 张少辉, 李亚飞, 王可, 靳远远, 徐明亮. 差异化保障需求驱动的舰载机多机协同决策方法[J]. 航空学报, 2025, 46(13): 531274-531274.
[12]	陈旭东, 陈琦琦, 罗祎喆, 王佳宝, 徐明亮. 异构舰载机舰面保障作业动态并行调度[J]. 航空学报, 2025, 46(13): 531329-531329.
[13]	王政, 王华, 崔可可, 李超超, 刘俊楠, 徐明亮. 局部引导强化学习的舰载机自主调运方法[J]. 航空学报, 2025, 46(13): 531333-531333.
[14]	凌文辉, 牟春晖, 聂聆聪, 杜宪, 孙希明. 基于改进DDPG的宽速域几何可调燃烧室压力分布控制[J]. 航空学报, 2025, 46(12): 131092-131092.
[15]	余子杰, 郑征, 李清东, 郭林, 任素萍, 郭健. 基于深度强化学习的太阳能无人机航迹规划[J]. 航空学报, 2025, 46(12): 331420-331420.