航天遥感图像智能处理与分析专刊

强化学习驱动的退化遥感图像目标检测方法

  • 刘文林 ,
  • 胡锡坤 ,
  • 钟平
展开
  • 国防科技大学 电子科学学院,长沙 410073
E-mail: xikun@nudt.edu.cn

收稿日期: 2025-10-09

  修回日期: 2025-10-24

  录用日期: 2025-11-25

  网络出版日期: 2025-11-28

基金资助

国家自然科学基金(62301574);湖南省科技创新计划(2024RC3119)

Reinforcement learning-driven object detection method for degraded remote sensing images

  • Wenlin LIU ,
  • Xikun HU ,
  • Ping ZHONG
Expand
  • College of Electronic Science and Technology,National University of Defense Technology,Changsha 410073,China
E-mail: xikun@nudt.edu.cn

Received date: 2025-10-09

  Revised date: 2025-10-24

  Accepted date: 2025-11-25

  Online published: 2025-11-28

Supported by

National Natural Science Foundation of China(62301574);Science and Technology Innovation Program of Hunan Province(2024RC3119)

摘要

卫星遥感图像目标检测技术是当前对地观测与智能解译的重要手段。然而,现有研究多集中于理想成像条件下,对于复杂天气、大气扰动及噪声干扰等复杂因素影响下的目标检测能力仍显不足。针对这一问题,一种强化学习驱动的退化遥感图像目标检测方法被提出,其通过动态编排图像预处理算子以实现复杂场景下的鲁棒检测。该方法的核心思想是以目标检测性能为优化目标,利用强化学习的决策优势,自适应地迭代选择并组合图像去噪、去模糊、对比度增强等预处理操作,从而提升遥感影像的质量与检测精度。在使用YOLO11m-OBB检测器基于DIOR和DOTA卫星遥感数据集构建的退化场景上进行的实验表明,所提方法均取得优异表现:在DIOR数据集上,相较于Raw-Syn(原始数据训练、退化场景数据验证)和Syn-Syn(退化场景数据训练与验证)方案,mAP50分别提升11.1%和2.5%,最终达80.8%;在DOTA数据集上,mAP50较Raw-Syn和Syn-Syn分别提升7.2%和2.8%,最终达76.6%。同时,处理后遥感影像的图像质量明显提升(PSNR>25 dB),充分验证了所提方法在复杂环境下的有效性与适用性。

本文引用格式

刘文林 , 胡锡坤 , 钟平 . 强化学习驱动的退化遥感图像目标检测方法[J]. 航空学报, 2026 , 47(10) : 532861 -532861 . DOI: 10.7527/S1000-6893.2025.32861

Abstract

Satellite remote sensing image object detection constitutes a pivotal technique for Earth observation and intelligent interpretation. However, most existing research has concentrated on ideal imaging conditions, and the resulting detection performance remains notably insufficient under complex degradations, such as adverse weather, atmospheric turbulence, and noise interference. To address this limitation, a reinforcement learning-based adaptive object detection methodology is proposed for degraded remote sensing images. This methodology achieves robust detection in complex scenarios by dynamically orchestrating image preprocessing operators. The core principle is to optimize object detection performance by leveraging reinforcement learning’s decision-making capability to adaptively and iteratively select and compose preprocessing operations, including denoising, deblurring, and contrast enhancement, thereby improving both remote sensing imagery quality and detection precision. Experiments on degraded scenarios constructed from the DIOR and DOTA satellite remote sensing datasets with the YOLO11m-OBB detector demonstrate that the proposed method achieves superior performance in all cases. On DIOR, the proposed method achieves mAP50 improvements of 11.1% and 2.5% over Raw-Syn (trained on pristine data, validated on degraded data) and Syn-Syn (trained and validated on degraded data) baselines, respectively, achieving a final mAP50 of 80.8%. On DOTA, mAP50 is improved by 7.2% and 2.8% over the same baselines, reaching 76.6%. Furthermore, the quality of processed remote sensing imagery is significantly enhanced (PSNR > 25 dB), substantiating the efficacy and applicability of the proposed approach in challenging environments.

参考文献

[1] 臧晶, 李成华, 田野. 卫星遥感农业监测系统中实例检索算法研究[J]. 宇航学报201940(11): 1358-1366.
  ZANG J, LI C H, TIAN Y. Research on case retrieval algorithm in satellite remote sensing monitoring system for agriculture[J]. Journal of Astronautics201940(11): 1358-1366 (in Chinese).
[2] 李志忠, 卫征, 付垒, 等. 我国遥感卫星技术与应用重要进展[J]. 卫星应用2025(4): 16-19.
  LI Z Z, WEI Z, FU L, et al. Important progress in China’s remote sensing satellite technology and application[J]. Satellite Application2025(4): 16-19 (in Chinese).
[3] 王俊杰, 李清泉, 邬国锋. 红树林定量遥感研究进展[J]. 遥感学报202529(6): 1769-1787.
  WANG J J, LI Q Q, WU G F. Progress in quantitative remote sensing research of mangroves[J]. National Remote Sensing Bulletin202529(6): 1769-1787 (in Chinese).
[4] 莫妮卡. 卫星遥感图像舰船目标检测系统[D]. 杭州:浙江大学,2022: 1-2.
  MO N K. Ship detection system based on satellite remote sensing images[D]. Hangzhou: Zhejiang University, 2022: 1-2 (in Chinese).
[5] 刘瑞锦, 何章鸣. 基于YOLOv8的卫星遥感图像快速目标检测方法[J]. 空间控制技术与应用202349(5): 89-97.
  LIU R J, HE Z M. A fast target detection method for satellite remote sensing images based on YOLOv8[J]. Aerospace Control and Application202349(5): 89-97 (in Chinese).
[6] 赵其昌, 吴一全, 苑玉彬. 光学遥感图像舰船目标检测与识别方法研究进展[J]. 航空学报202445(8): 029025.
  ZHAO Q C, WU Y Q, YUAN Y B. Progress of ship detection and recognition methods in optical remote sensing images[J]. Acta Aeronautica et Astronautica Sinica202445(8): 029025 (in Chinese).
[7] XI Y, JIA W J, MIAO Q G, et al. CoDerainNet: Collaborative deraining network for drone-view object detection in rainy weather conditions[J]. Remote Sensing202315(6): 1487-1508.
[8] ASWINI N, UMA S V. Drone image de-noising and feature extraction[C]∥2020 IEEE International Conference for Innovation in Technology. Piscataway: IEEE Press, 2020: 1-6.
[9] KIM J I, HYUN C U, HAN H, et al. Digital surface model generation for drifting Arctic sea ice with low-textured surfaces based on drone images[J]. ISPRS Journal of Photogrammetry and Remote Sensing2021172: 147-159.
[10] QIAN G C, WANG Y H, GU J J, et al. Rethinking learning-based demosaicing, denoising, and super-resolution pipeline[C]∥2022 IEEE International Conference on Computational Photography. Piscataway: IEEE Press, 2022: 1-12.
[11] XING W Z, EGIAZARIAN K. End-to-end learning for joint image demosaicing, denoising and super-resolution[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2021: 3507-3516.
[12] SUGANUMA M, LIU X, OKATANI T. Attention-based adaptive selection of operations for image restoration in the presence of unknown combined distortions[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2019: 9039-9048.
[13] KIM C, KIM T H, BAIK S. LAN: Learning to adapt noise for image denoising[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2024: 25193-25202.
[14] LIU Y, LI W, GUAN J, et al. Effective cloud removal for remote sensing images by an improved mean-reverting denoising model with elucidated design space[C]∥Proceedings of the IEEE Computer Vision and Pattern Recognition Conference. Piscataway: IEEE Press, 2025: 17851-17861.
[15] ZHANG J, ZHANG Q, ZHAO X, et al. Boosting denoisers with reinforcement learning for image restoration[J]. Soft Computing202226(7): 3261-3272.
[16] FURUTA R, INOUE N, YAMASAKI T. PixelRL: Fully convolutional network with reinforcement learning for image processing[J]. IEEE Transactions on Multimedia202022(7): 1704-1719.
[17] YU K, DONG C, LIN L, et al. Crafting a toolchain for image restoration by deep reinforcement learning[C]∥Proceedings of the IEEE Computer Vision and Pattern Recognition Conference. Piscataway: IEEE Press, 2018: 2443-2452.
[18] SHIN U, LEE K, KWEON I S. DRL-ISP: Multi-objective camera ISP with deep reinforcement learning[C]∥2022 IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway: IEEE Press, 2022: 7044-7051.
[19] YU K, WANG X T, DONG C, et al. Path-restore: Learning network path selection for image restoration[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence202244(10): 7078-7092.
[20] WEI Z Y, CHEN H H, NAN L L, et al. PathNet: Path-selective point cloud denoising[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence202446(6): 4426-4442.
[21] 范天麒, 邹征夏, 史振威. 基于强化学习数据合成的典型遥感目标检测[J]. 航空学报202546(23): 631955.
  FAN T Q, ZOU Z X, SHI Z W. Typical remote sensing target detection with data synthesis based on reinforcement learning[J]. Acta Aeronautica et Astronautica Sinica202546(23): 631955 (in Chinese).
[22] HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 770-778.
[23] RONNEBERGER O, FISCHER P, BROX T. U-Net: Convolutional networks for biomedical image segmentation[C]∥International Conference on Medical Image Computing and Computer-Assisted Intervention. 2015: 234-241.
[24] HAARNOJA T, ZHOU A, ABBEEL P, et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]∥Proceedings of the 35th International Conference on Machine Learning. Piscataway: IEEE Press, 2018: 1861-1870.
[25] MNIH V, KAVUKCUOGLU K, SILVER D, et al. Human-level control through deep reinforcement learning[J]. Nature2015518(7540): 529-533.
[26] SCHULMAN J, WOLSKI F, DHARIWAL P, et al. Proximal policy optimization algorithms[DB/OL]. arXiv preprint: 1707.06347, 2017.
[27] REN S, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence201739(6): 1137-1149.
[28] JOCHER G, QIU J. Ultralytics YOLO11[EB/OL]. (2025-08-30) [2025-09-30]. .
[29] XIE X X, CHENG G, WANG J B, et al. Oriented R-CNN for object detection[C]∥Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2021: 3500-3509.
[30] LI K, WAN G, CHENG G, et al. Object detection in optical remote sensing images: A survey and a new benchmark[J]. ISPRS Journal of Photogrammetry and Remote Sensing2020159: 296-307.
[31] XIA G S, BAI X, DING J, et al. DOTA: A large-scale dataset for object detection in aerial images[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 3974-3983.
[32] LIU Z, LIN Y T, CAO Y, et al. Swin Transformer: Hierarchical vision transformer using shifted windows[C]∥ Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2021: 9992-10002.
[33] LIN X Q, YU F H, HU J F, et al. Harnessing diffusion-yielded score priors for image restoration[J]. ACM Transactions on Graphics202544(6): 1-21.
[34] HUANG X H, LIU S Q, ZHANG K, et al. Reverse convolution and its applications to image restoration[C]∥ Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2025: 10507-10516.
[35] TIAN Y, YE Q, DOERMANN D. YOLOv12: Attention-centric real-time object detectors[C]∥Advances in Neural Information Processing Systems 38 (NeurIPS 2025). 2025: 1-12.
文章导航

/