Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
UAV detection in airborne optic videos using dilated convolutions
KTO Karatay University, TUR.
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0001-7536-3349
KTO Karatay University, TUR.
2021 (English)In: Journal of Optics (India), ISSN 0972-8821, Vol. 50, no 4, p. 569-582Article in journal (Refereed) Published
Abstract [en]

In this paper, a real-time unmanned aerial vehicles (UAVs) detection framework is proposed for GPU embedded applications. To achieve this, this paper proposed a new modified model based on You Only Look Once (YOLO) to detect multi-UAV in aerial images with a complex background. The proposed CNN architecture uses five inception module and dilated convolution with two different factors to increase detection accuracy of YOLOv3 tiny, while preserving its computational time. To further improve the detection accuracy, Generalized Intersection over Union loss is replaced with the bounding box regression loss in the original YOLOv3 tiny loss function. To obtain higher frame rate, scalable kernel correlation filter (sKCF) algorithm is integrated to the detection model. More specifically, the proposed UAV detection method is used to initialize the sKCF tracker at every nth frame. Thus, the detected UAVs can be detected in intermediate frames with a low memory footprint. The proposed model and compared models are trained and tested on a variety of training and test airborne videos, captured under different outdoor scenarios. The average precision of the proposed model is 0.8265 and achieves 32 FPS performance on Jetson-TX2. The results show that the proposed model which is widened with inception module using dilated convolutions has a very high performance in terms of accuracy and speed even when it is compared with deep CNN architectures such as Darknet 53. © 2021, The Optical Society of India.

Place, publisher, year, edition, pages
Springer, 2021. Vol. 50, no 4, p. 569-582
Keywords [en]
CNN, Deep learning, Detection, Dilated convolution, GIoU, Unmanned aerial vehicle, Antennas, Convolution, Unmanned aerial vehicles (UAV), Complex background, Computational time, Correlation filters, Detection accuracy, Detection framework, Detection methods, Embedded application, Generalized intersection, Aircraft detection
National Category
Computer Sciences Control Engineering
Identifiers
URN: urn:nbn:se:bth-22173DOI: 10.1007/s12596-021-00770-3ISI: 000696751200001Scopus ID: 2-s2.0-85115090468OAI: oai:DiVA.org:bth-22173DiVA, id: diva2:1599554
Available from: 2021-10-01 Created: 2021-10-01 Last updated: 2022-01-11Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Kusetogullari, Hüseyin

Search in DiVA

By author/editor
Kusetogullari, Hüseyin
By organisation
Department of Computer Science
Computer SciencesControl Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 59 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf