Home>Schools

  • Complex
  • Title
  • Author
  • Keyword
  • Abstract
  • Scholars
Search

[期刊]

Swin-Transformer-Enabled YOLOv5 with Attention Mechanism for Small Object Detection on Satellite Images

Share
Edit Delete Claim

Author:

Gong, Hang (Gong, Hang.) | Mu, Tingkui (Mu, Tingkui.) | Li, Qiuxia (Li, Qiuxia.) | Unfold

Indexed by:

SCIE Scopus Web of Science

Abstract:

Object detection has made tremendous progress in natural images over the last decade. However, the results are hardly satisfactory when the natural image object detection algorithm is directly applied to satellite images. This is due to the intrinsic differences in the scale and orientation of objects generated by the bird's-eye perspective of satellite photographs. Moreover, the background of satellite images is complex and the object area is small; as a result, small objects tend to be missing due to the challenge of feature extraction. Dense objects overlap and occlusion also affects the detection performance. Although the self-attention mechanism was introduced to detect small objects, the computational complexity increased with the image's resolution. We modified the general one-stage detector YOLOv5 to adapt the satellite images to resolve the above problems. First, new feature fusion layers and a prediction head are added from the shallow layer for small object detection for the first time because it can maximally preserve the feature information. Second, the original convolutional prediction heads are replaced with Swin Transformer Prediction Heads (SPHs) for the first time. SPH represents an advanced self-attention mechanism whose shifted window design can reduce the computational complexity to linearity. Finally, Normalization-based Attention Modules (NAMs) are integrated into YOLOv5 to improve attention performance in a normalized way. The improved YOLOv5 is termed SPH-YOLOv5. It is evaluated on the NWPU-VHR10 dataset and DOTA dataset, which are widely used for satellite image object detection evaluations. Compared with the basal YOLOv5, SPH-YOLOv5 improves the mean Average Precision (mAP) by 0.071 on the DOTA dataset.

Keyword:

deep learning object detection satellite images self-attention mechanism Swin transformer

Author Community:

  • [ 1 ] [Gong, Hang]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 2 ] [Mu, Tingkui]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 3 ] [Li, Qiuxia]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 4 ] [Wang, Wenjing]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 5 ] [Han, Feng]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 6 ] [Tuniyazi, Abudusalamu]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 7 ] [Li, Haoyang]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 8 ] [Lang, Xuechan]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 9 ] [Li, Zhiyuan]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 10 ] [Wang, Bin]Xi An Jiao Tong Univ, Res Ctr Space Opt & Astron, Sch Phys, MOE Key Lab Nonequilibrium Synth & Modulat Conden, Xian 710049, Peoples R China
  • [ 11 ] [Dai, Haishan]Shanghai Acad Spaceflight Technol, Shanghai Inst Satellite Engn, Shanghai 201109, Peoples R China
  • [ 12 ] [Li, Chunlai]Chinese Acad Sci, Shanghai Inst Tech Phys, Shanghai 200083, Peoples R China
  • [ 13 ] [He, Zhiping]Chinese Acad Sci, Shanghai Inst Tech Phys, Shanghai 200083, Peoples R China

Reprint Author's Address:

  • T. Mu;;MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Research Center for Space Optics and Astronomy, School of Physics, Xi’an Jiaotong University, Xi’an, 710049, China;;email: tkmu@mail.xjtu.edu.cn;;

Show more details

Source :

REMOTE SENSING

Year: 2022

Issue: 12

Volume: 14

4 . 8 4 8

JCR@2020

ESI Discipline: GEOSCIENCES;

ESI HC Threshold:6

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 121

30 Days PV: 3

FAQ| About| Online/Total:309/217716829
Address:XI'AN JIAOTONG UNIVERSITY LIBRARY(No.28, Xianning West Road, Xi'an, Shaanxi Post Code:710049) Contact Us:029-82667865
Copyright:XI'AN JIAOTONG UNIVERSITY LIBRARY Technical Support:Beijing Aegean Software Co., Ltd.