Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Comparison of Advanced DeepLearning Algorithms for Multi-digit Detection in Historical Documents
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.
2023 (Engelska)Självständigt arbete på avancerad nivå (masterexamen), 20 poäng / 30 hpStudentuppsats (Examensarbete)
Abstract [en]

Background: Historical handwritten documents are assets for future generations and should be appropriately secured, so handwritten digit detection plays an important role to preserve them. Handwritten digit detection is a fundamental problem and has been studied extensively for many years. While earlier methods were effective to some extent, they often required domain expertise and extensive parameter tuning, making them time-consuming and difficult to generalize to new data. The development of deep learning techniques has led to significant improvements in handwritten digit recognition. They can automatically learn relevant features from raw image data, making them more robust to variations in handwriting styles.

Objectives: This study first considers a few deep learning algorithms for detecting the digits, considering the different challenges of the handwritten digits, and then finds the best algorithm among them using metrics to know which is the best-performing deep learning model.

Methods: Literature Review and experimentation are the research methodologies employed in this study. We have chosen four advanced deep-learning methods(YOLOV5, Faster R-CNN, RetinaNet, YOLOV7) to identify handwritten digits in digit string images. Each method is trained and tested using the DIDA dataset. Performance evaluation is conducted to determine the best method based on all analyses from experiments on all selected DL methods.

Results: The augmented and digit string datasets are used for training and testing the deep learning models. The chosen models are evaluated using metrics for an efficient model. The results from the experimental evaluation show the best deep learning model among the selected models for detecting multi-digit strings in historical handwritten digit images.

Conclusions: Results obtained from the performance metrics of the respective algorithm justify that the YOLOv7 algorithm has more efficiency and accuracy compared to YOLOv5, RetinaNet, and Faster R-CNN for the detection of handwritten digits in historical document images.

Ort, förlag, år, upplaga, sidor
2023.
Nyckelord [en]
Deep Learning Methods, Handwritten digits, Digit Detection, Image processing on document images, YOLOV5, Faster R-CNN, RetinaNet, YOLOV7
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:bth-24323OAI: oai:DiVA.org:bth-24323DiVA, id: diva2:1740872
Ämne / kurs
DV2572 Masterarbete i Datavetenskap
Utbildningsprogram
DVADA Plan för kvalifikation till masterexamen inom datavetenskap
Presentation
2023-01-23, Karlskrona, 13:00 (Engelska)
Handledare
Examinatorer
Tillgänglig från: 2023-03-02 Skapad: 2023-03-02 Senast uppdaterad: 2025-09-30Bibliografiskt granskad

Open Access i DiVA

A comparison of Advanced deep learning algorithms for multi-digit recognition in historical documents.(2695 kB)1264 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 2695 kBChecksumma SHA-512
db5fe1ecff502ac34a9d2eec58859dd88b3d3ab8967e278bea7ab966236448ddfa40041b698c8f88f5f1fe7e9b29471e3328fe629d812ec6a8f652e93cf8bdd6
Typ fulltextMimetyp application/pdf

Av organisationen
Institutionen för datavetenskap
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 1265 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 1087 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf