Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Comparison of Advanced DeepLearning Algorithms for Multi-digit Detection in Historical Documents
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datavetenskap.
2023 (engelsk)Independent thesis Advanced level (degree of Master (Two Years)), 20 poäng / 30 hpOppgave
Abstract [en]

Background: Historical handwritten documents are assets for future generations and should be appropriately secured, so handwritten digit detection plays an important role to preserve them. Handwritten digit detection is a fundamental problem and has been studied extensively for many years. While earlier methods were effective to some extent, they often required domain expertise and extensive parameter tuning, making them time-consuming and difficult to generalize to new data. The development of deep learning techniques has led to significant improvements in handwritten digit recognition. They can automatically learn relevant features from raw image data, making them more robust to variations in handwriting styles.

Objectives: This study first considers a few deep learning algorithms for detecting the digits, considering the different challenges of the handwritten digits, and then finds the best algorithm among them using metrics to know which is the best-performing deep learning model.

Methods: Literature Review and experimentation are the research methodologies employed in this study. We have chosen four advanced deep-learning methods(YOLOV5, Faster R-CNN, RetinaNet, YOLOV7) to identify handwritten digits in digit string images. Each method is trained and tested using the DIDA dataset. Performance evaluation is conducted to determine the best method based on all analyses from experiments on all selected DL methods.

Results: The augmented and digit string datasets are used for training and testing the deep learning models. The chosen models are evaluated using metrics for an efficient model. The results from the experimental evaluation show the best deep learning model among the selected models for detecting multi-digit strings in historical handwritten digit images.

Conclusions: Results obtained from the performance metrics of the respective algorithm justify that the YOLOv7 algorithm has more efficiency and accuracy compared to YOLOv5, RetinaNet, and Faster R-CNN for the detection of handwritten digits in historical document images.

sted, utgiver, år, opplag, sider
2023.
Emneord [en]
Deep Learning Methods, Handwritten digits, Digit Detection, Image processing on document images, YOLOV5, Faster R-CNN, RetinaNet, YOLOV7
HSV kategori
Identifikatorer
URN: urn:nbn:se:bth-24323OAI: oai:DiVA.org:bth-24323DiVA, id: diva2:1740872
Fag / kurs
DV2572 Master´s Thesis in Computer Science
Utdanningsprogram
DVADA Master Qualification Plan in Computer Science
Presentation
2023-01-23, Karlskrona, 13:00 (engelsk)
Veileder
Examiner
Tilgjengelig fra: 2023-03-02 Laget: 2023-03-02 Sist oppdatert: 2025-09-30bibliografisk kontrollert

Open Access i DiVA

A comparison of Advanced deep learning algorithms for multi-digit recognition in historical documents.(2695 kB)1262 nedlastinger
Filinformasjon
Fil FULLTEXT02.pdfFilstørrelse 2695 kBChecksum SHA-512
db5fe1ecff502ac34a9d2eec58859dd88b3d3ab8967e278bea7ab966236448ddfa40041b698c8f88f5f1fe7e9b29471e3328fe629d812ec6a8f652e93cf8bdd6
Type fulltextMimetype application/pdf

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 1263 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 1079 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf