Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
AI-Driven Pattern Recognition and Object Detection: Emphasis on Precision Agriculture
Blekinge Institute of Technology, Faculty of Engineering, Department of Mathematics and Natural Sciences.ORCID iD: 0000-0001-9054-4746
2025 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Precision agriculture utilizes Artificial Intelligence (AI) and remote sensing to address challenges in sustainable food production. A significant obstacle is effective weed management, particularly for species like black-grass (Alopecurus myosuroides), which visually resembles wheat. This thesis introduces AI-driven methods using Unmanned Aerial Vehicle (UAV)-based multispectral image analysis to improve weed detection, addressing key challenges such as limited datasets, sensor misalignment, and the need for robust algorithms.

The first part of the thesis addresses data scarcity using synthetic data augmentation. A study using a Mask Region-based Convolutional Neural Network (Mask R-CNN) model for black-grass detection shows that scaling foreground objects is a particularly effective technique for improving performance, providing a basis for adapting models to new field conditions with less data.

The second part focuses on multispectral data quality, where sensor misalignment can corrupt Vegetation Indices (VIs) like Normalized Difference Vegetation Index (NDVI). A method for sensor calibration and image registration is presented to correct for these errors. The study then evaluates various VIs, finding that models integrating indices like Triangular Greenness Index (TGI) and Excess Green Index (ExG) show improved detection performance compared to using only the basic spectral bands, particularly between crop rows.

Finally, the third part draws parallels to biometrics to explore the robustness of different network architectures. By evaluating triplet-based and Siamese networks for fingerprint recognition, it is demonstrated that clustering-based approaches using contrastive loss offer better performance with incomplete and variable data. This insight highlights broader principles for creating robust recognition systems.

Overall, this thesis contributes methods for addressing key obstacles in precision agriculture, spanning data augmentation, sensor calibration, and the optimization of deep learning models. The findings of this work aim to contribute to the ongoing development of more effective agricultural practices.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2025. , p. 96
Series
Blekinge Institute of Technology Licentiate Dissertation Series, ISSN 1650-2140 ; 2025:10
Keywords [en]
Precision agriculture, Weed detection, Deep learning, Object detection, Multispectral imaging, Vegetation indices, Unmanned Aerial Vehicles (UAVs), Synthetic data augmentation
National Category
Computer and Information Sciences Agriculture, Forestry and Fisheries Artificial Intelligence
Research subject
Systems Engineering
Identifiers
URN: urn:nbn:se:bth-28605ISBN: 978-91-7295-510-3 (print)OAI: oai:DiVA.org:bth-28605DiVA, id: diva2:1996628
Presentation
2025-10-08, C413A, Valhallavägen 10, Karlskrona, 10:00 (English)
Opponent
Supervisors
Note

Paper III is excluded from the attached file because of being submitted for publication in a journal.

Available from: 2025-09-11 Created: 2025-09-10 Last updated: 2025-09-30Bibliographically approved
List of papers
1. Effects of Foreground Augmentations in Synthetic Training Data on the Use of UAVs for Weed Detection
Open this publication in new window or tab >>Effects of Foreground Augmentations in Synthetic Training Data on the Use of UAVs for Weed Detection
2024 (English)In: Proceedings of Machine Learning Research / [ed] Lutchyn T., Rivera A.R., Ricaud B., ML Research Press , 2024, Vol. 233Conference paper, Published paper (Refereed)
Abstract [en]

This study addresses the issue of black-grass, a herbicide-resistant weed that threatens wheat yields in Western Europe, through the use of high- resolution Unmanned Aerial Vehicles (UAVs) and synthetic data augmentation in precision agriculture. We mitigate challenges such as the need for large labeled datasets and environmental variability by employing synthetic data augmentations in training a Mask R-CNN model. Using a minimal dataset of 43 black-grass and 12 wheat field images, we achieved a 37% increase in Area Under the Curve (AUC) over the non-augmented baseline, with scaling as the most effective augmentation. The best model attained a recall of 53% at a precision of 64%, offering a promising approach for future precision agriculture applications. © NLDL 2024. All rights reserved.

Place, publisher, year, edition, pages
ML Research Press, 2024
Series
Proceedings of Machine Learning Research (PMLR), E-ISSN 2640-3498 ; 233
Keywords
Aircraft detection, Antennas, Large datasets, Unmanned aerial vehicles (UAV), Weed control, Aerial vehicle, Data augmentation, Herbicide resistant weeds, High resolution, Precision Agriculture, Synthetic data, Synthetic training data, Weed detection, Western Europe, Wheat yield
National Category
Computer graphics and computer vision Agricultural Science
Identifiers
urn:nbn:se:bth-26100 (URN)001221156400012 ()2-s2.0-85189301466 (Scopus ID)
Conference
5th Northern Lights Deep Learning Conference, NLDL 2024, Tromso, 9 January through 11 January 2024
Available from: 2024-04-12 Created: 2024-04-12 Last updated: 2025-09-30Bibliographically approved
2. Multispectral Image Registration and Sensor Calibration for Low-Altitude Agricultural Drones
Open this publication in new window or tab >>Multispectral Image Registration and Sensor Calibration for Low-Altitude Agricultural Drones
2024 (English)In: International Geoscience and Remote Sensing Symposium (IGARSS), Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 6209-6213Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents a crucial multispectral image registration and sensor calibration method for an agricultural application. The multispectral images are obtained using a special drone equipped with multiple cameras flying at low altitudes. However, the distance between lenses, the lens distortions and the low-altitude flights lead to a lack of alignment in the built-in normalized difference vegetation index (NDVI). This lack of alignment results in a very poor performance in further analysis, especially for image segmentation and target detection to distinguish crops from invasive plants. In this work, we point out the importance of reducing this misalignment. To do so, the near-infrared and red sensors are first calibrated to remove the lens distortions. Then, the corresponding keypoints are utilized to calculate the transformation matrix and to minimize the back-projection error. The registered near-infrared and red images are then used to compute NDVI. The experimental results show higher alignment and F1-score of 0.73 which is a significant improvement in the performance of a trained deep neural network using NDVI in the detection of invasive plants. This is particularly a challenging task as the invasive plants resemble the desired crops.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Series
IEEE International Geoscience and Remote Sensing Symposium proceedings, ISSN 2153-6996, E-ISSN 2153-7003
Keywords
Multispectral image registration, near-infrared image, normalized difference vegetation index (NDVI), sensor calibration, unmanned aerial vehicles, Aircraft detection, Drones, Image enhancement, Image registration, Image segmentation, Aerial vehicle, Images registration, Invasive plants, Multispectral images, Near- infrared images, Normalized difference vegetation index, Unmanned aerial vehicle, Deep neural networks
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:bth-27104 (URN)10.1109/IGARSS53475.2024.10642360 (DOI)001415226901020 ()2-s2.0-85208505152 (Scopus ID)
Conference
IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024, Athens, July 7-12, 2024
Available from: 2024-11-18 Created: 2024-11-18 Last updated: 2026-01-07Bibliographically approved
3. Impact of Multispectral Imaging and Vegetation Indices on Neural Network Performance for Weed Detection Using Agricultural UAVs
Open this publication in new window or tab >>Impact of Multispectral Imaging and Vegetation Indices on Neural Network Performance for Weed Detection Using Agricultural UAVs
(English)Manuscript (preprint) (Other academic)
National Category
Computer and Information Sciences Agriculture, Forestry and Fisheries
Research subject
Systems Engineering
Identifiers
urn:nbn:se:bth-28604 (URN)
Available from: 2025-09-10 Created: 2025-09-10 Last updated: 2025-09-30Bibliographically approved
4. Impact of Neural Network Architecture for Fingerprint Recognition
Open this publication in new window or tab >>Impact of Neural Network Architecture for Fingerprint Recognition
2024 (English)In: Intelligent Systems and Pattern Recognition: Third International Conference, ISPR 2023, Hammamet, Tunisia, May 11–13, 2023, Revised Selected Papers, Part I / [ed] Akram Bennour, Ahmed Bouridane, Lotfi Chaari, Springer, 2024, Vol. 1940, p. 3-14Conference paper, Published paper (Refereed)
Abstract [en]

This work investigates the impact of the neural networks architecture when performing fingerprint recognition. Three networks are studied; a Triplet network and two Siamese networks. They are evaluated on datasets with specified amounts of relative translation between fingerprints. The results show that the Siamese model based on contrastive loss performed best in all evaluated metrics. Moreover, the results indicate that the network with a categorical scheme performed inferior to the other models, especially in recognizing images with high confidence. The Equal Error Rate (EER) of the best model ranged between 4%−11% which was on average 6.5 percentage points lower than the categorical schemed model. When increasing the translation between images, the networks were predominantly affected once the translation reached a fourth of the image. Our work concludes that architectures designed to cluster data have an advantage when designing an authentication system based on neural networks.

Place, publisher, year, edition, pages
Springer, 2024
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 1940
Keywords
Fingerprint recognition, Neural network architecture, Siamese network
National Category
Computer graphics and computer vision
Research subject
Systems Engineering
Identifiers
urn:nbn:se:bth-25604 (URN)10.1007/978-3-031-46335-8_1 (DOI)2-s2.0-85177185075 (Scopus ID)978-3-031-46334-1 (ISBN)978-3-031-46335-8 (ISBN)
Conference
3rd International Conference on Intelligent Systems & Pattern Recognition, ISPR2023, Hammamet, 11/5 - 13/5 2023
Available from: 2023-11-14 Created: 2023-11-14 Last updated: 2025-09-30Bibliographically approved

Open Access in DiVA

fulltext(83028 kB)126 downloads
File information
File name FULLTEXT01.pdfFile size 83028 kBChecksum SHA-512
9f5707f777aa4c9ade01ca1d83af0b6b777b2921f87c6dd4376b940723e3391f42346d36e68277dc9e0585e774b858c99d757ef2af61d8b57dfe225836f61f89
Type fulltextMimetype application/pdf

Authority records

Hallösta, Simon

Search in DiVA

By author/editor
Hallösta, Simon
By organisation
Department of Mathematics and Natural Sciences
Computer and Information SciencesAgriculture, Forestry and FisheriesArtificial Intelligence

Search outside of DiVA

GoogleGoogle Scholar
Total: 126 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1879 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf