Recurrent Residual Deformable Conv Unit and Multi-Head with Channel Self-Attention Based on U-Net for Building Extraction from Remote Sensing Images

Wenling Yu; Bo Liu; Hua Liu; Guohua Gou

Recurrent Residual Deformable Conv Unit and Multi-Head with Channel Self-Attention Based on U-Net for Building Extraction from Remote Sensing Images

2023

Wenling Yu | Bo Liu | Hua Liu | Guohua Gou

Considering the challenges associated with accurately identifying building shape features and distinguishing between building and non-building features during the extraction of buildings from remote sensing images using deep learning, we propose a novel method for building extraction based on U-Net, incorporating a recurrent residual deformable convolution unit (RDCU) module and augmented multi-head self-attention (AMSA). By replacing conventional convolution modules with an RDCU, which adopts a deformable convolutional neural network within a residual network structure, the proposed method enhances the module&rsquo:s capacity to learn intricate details such as building shapes. Furthermore, AMSA is introduced into the skip connection function to enhance feature expression and positions through content&ndash:position enhancement operations and content&ndash:content enhancement operations. Moreover, AMSA integrates an additional fusion channel attention mechanism to aid in identifying cross-channel feature expression Intersection over Union (IoU) score differences. For the Massachusetts dataset, the proposed method achieves an Intersection over Union (IoU) score of 89.99%, PA (Pixel Accuracy) score of 93.62%, and Recall score of 89.22%. For the WHU Satellite dataset I, the proposed method achieves an IoU score of 86.47%, PA score of 92.45%, and Recall score of 91.62%, For the INRIA dataset, the proposed method achieves an IoU score of 80.47%, PA score of 90.15%, and Recall score of 85.42%.

اظهر المزيد [+]

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

remote sensing

المعلومات البيبليوغرافية

نُشرت في

Remote Sensing

المجلد 15 الإصدار 20 الرقم التسلسلي المعياري الدولي (ردمد) 2072-4292

الناشر

Multidisciplinary Digital Publishing Institute

مواضيع أخرى

Multi-head self-attention; Recurrent residual convolution; Building extraction; U-net

اللغة

إنجليزي

النوع

Journal Article

في أجريس منذ: 2025-07-18

تاريخ التعديل: 2025-10-23

نوع الملف: AGRIS AP

مزود البيانات

تم تزويد هذا السجل من قبل Multidisciplinary Digital Publishing Institute

اكتشف مجموعة مزود البيانات هذا في أجريس

الروابط

DOI https://www.mdpi.com/2072-4292/15/20/5048/pdf

تصفح الباحث العلمي من جوجل

إذا لاحظت أي معلومات غير صحيحة تتعلق بهذا السجل ، يرجى الاتصال بنا [email protected]

أجريس - النظام الدولي للعلوم الزراعية والتكنولوجيا

Share

Recurrent Residual Deformable Conv Unit and Multi-Head with Channel Self-Attention Based on U-Net for Building Extraction from Remote Sensing Images

2023

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

المعلومات البيبليوغرافية