首页 >

强化学习驱动的复杂遥感场景目标检测方法-航天遥感图像智能处理与分析

刘文林,胡锡坤,钟平   

  1. 国防科技大学电子科学学院
  • 收稿日期:2025-10-09 修回日期:2025-11-27 发布日期:2025-11-28
  • 通讯作者: 胡锡坤
  • 基金资助:
    国家自然科学基金

Reinforcement learning-driven object detection method for complex remote sensing scenes

  • Received:2025-10-09 Revised:2025-11-27 Published:2025-11-28
  • Contact: Xi-kun Hu
  • Supported by:
    National Natural Science Foundation of China

摘要: 卫星遥感图像目标检测技术是当前对地观测与智能解译的重要手段。然而,现有研究多集中于理想成像条件下,对于复杂天气、大气扰动及噪声干扰等复杂退化条件下的目标检测能力仍显不足。针对这一问题,本文提出了一种强化学习驱动的退化遥感图像目标检测方法,通过动态编排图像预处理算子以实现复杂场景下的鲁棒检测。该方法的核心思想是以目标检测性能为优化目标,利用强化学习的决策优势,自适应地迭代选择并组合图像去噪、去模糊、对比度增强等预处理操作,从而提升遥感影像的质量与检测精度。在使用YOLO11-OBB检测器基于DIOR和DOTA卫星遥感数据集构建的退化场景上进行的实验表明,所提方法均取得优异表现:在DIOR数据集上,相较于Raw-Syn(原始数据训练、退化场景数据验证)和Syn-Syn(退化场景数据训练与验证)方案,mAP50分别提升11.1%和2.5%,最终达80.8%;在DOTA数据集上,mAP50较Raw-Syn和Syn-Syn分别提升7.2%和2.8%,最终达76.6%。同时,处理后遥感影像的图像质量明显提升(PSNR>25dB),充分验证了所提方法在复杂环境下的有效性与适用性。

关键词: 卫星遥感, 目标检测, 复杂场景, 图像预处理, 强化学习, 自适应方法, 鲁棒性

Abstract: Satellite remote sensing image object detection constitutes a pivotal technique for earth observation and intelligent interpretation. However, most existing research has concentrated on ideal imaging conditions, and the capability for object detection under complex degradations, such as adverse weather, atmospheric turbulence, and noise interference, remains notably inadequate. To address this limitation, this paper proposes a reinforcement learning-based adaptive object detection methodology that dynamically orchestrates image preprocessing operators to achieve robust detection in complex scenarios. The core principle is to optimize object detection performance by leveraging reinforcement learning's decision-making capability to adaptively and iteratively select and compose preprocessing operations, including denoising, deblurring, and contrast enhancement, thereby enhancing both remote sensing imagery quality and detection precision. Experiments on the degraded scenarios constructed from the DIOR and DOTA satellite remote-sensing datasets with the YOLO11-OBB detector demonstrate that the proposed method achieves superior performance in all cases: on DIOR, the proposed method achieves mAP50 improvements of 11.1 and 2.5 points over Raw-Syn (trained on pristine data, validated on degraded data) and Syn-Syn (trained and validated on degraded data) baselines, respectively, attaining a final mAP50 of 80.8; on DOTA, it improves mAP50 by 7.2 and 2.8 points over the same baselines, reaching 76.6. Furthermore, the quality of processed remote sensing imagery is significantly enhanced (PSNR > 25 dB), substantiating the efficacy and applicability of the proposed approach in challenging environments.

Key words: Satellite remote sensing, Object detection, Complex scenarios, Image pre-processing, Reinforcement learning, Adaptive methods, Robustness

中图分类号: