| [1] Guo L, Wang Y, Liu Y, et al. Ultralight convolutional neural network for automatic modulation classification in internet of unmanned aerial vehicles[J]. IEEE Internet of Things Journal, 2024, 11(11): 20831-20839.[2] Liang P P, Ling C K, Cheng Y, et al. Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications[C]//ICLR. 2024.[3] 欧阳昱中,韩锐,刘驰.边缘侧领域自适应中长尾视觉识别技术研究[J/OL].计算机工程,1-10[2025-05-20]. https://doi.org/10.19678/j.issn.1000-3428.0069287.[4] He X, Wang Y, Zhao S, et al. Co-attention fusion network for multimodal skin cancer diagnosis[J]. Pattern Recognition, 2023, 133: 108990. [5] Lu Y, Zhao W, Sun N, et al. Enhancing multimodal knowledge graph representation learning through triple contrastive learning[C]//Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence. 2024: 5963-5971.[6] Zhang X, Demiris Y. Visible and infrared image fusion using deep learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(8): 10535-10554.[7] Tu Y, Lin Y, Hou C, et al. Complex-valued networks for automatic modulation classification[J]. IEEE Transactions on Vehicular Technology, 2020, 69(9): 10085-10089.[8] Guo L, Liu C, Liu Y, et al. Toward open-set specific emitter identification using auxiliary classifier generative adversarial network and OpenMax[J]. IEEE Transactions on Cognitive Communications and Networking, 2024, 10(6): 2019-2028.[9] Zhang Y, Latham P E, Saxe A. Understanding unimodal bias in multimodal deep linear networks[J]. arXiv preprint arXiv:2312.00935, 2023.[10] Wang J, Xu C, Zhao C, et al. Multimodal object detection of UAV remote sensing based on joint representation optimization and specific information enhancement[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 17: 12364-12373.[11] Huang C, Cai W, Jiang Q, et al. Multimodal representation distribution learning for medical image segmentation[C]//Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence. 2024: 4156-4164.[12] 郭浩, 李欣奕, 唐九阳, 等. 自适应特征融合的多模态实体对齐研究[J]. 自动化学报, 2024, 50(4): 758-770.[13] Ma H, He D, Wang X, et al. Multi-modal sarcasm detection based on dual generative processes[C]//Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence. 2024: 2279-2287.[14] Zhang X, Yoon J, Bansal M, et al. Multimodal representation learning by alternating unimodal adaptation[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2024: 27456-27466.[15] 韩佳艺, 刘建伟, 陈德华, 等. 深度长尾学习研究综述[J]. 自动化学报, 2025, 51(5): 1-36.[16] Choo Y H, Cai Z, Le V, et al. Multi-objective flexible job-shop scheduling with an ensemble optimisation model[C]//2022 IEEE Industrial Electronics and Applications Conference (IEACon). IEEE, 2022: 229-234.[17] Zhang S, Li Z, Yan S, et al. Distribution alignment: A unified framework for long-tail visual recognition[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 2361-2370.[18] Wang Q, Qu X, Jin P, et al. ODinMJ: A red, green, blue-thermal dataset for mountain jungle object detection[J]. IEEE Geoscience and Remote Sensing Magazine, 2024.[19] 魏秀参, 许玉燕, 杨健. 网络监督数据下的细粒度图像识别综述[J]. 中国图象图形学报, 2022, 27(7): 2057-2077.[20] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.[21] Cao K, Wei C, Gaidon A, et al. Learning imbalanced datasets with label-distribution-aware margin loss[J]. Advances in neural information processing systems, 2019, 32.[22] Wang Q, Yin C, Song H, et al. UTFNet: Uncertainty-guided trustworthy fusion network for RGB-thermal semantic segmentation[J]. IEEE Geoscience and Remote Sensing Letters, 2023, 20: 1-5. |