Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Comparison of Advanced DeepLearning Algorithms for Multi-digit Detection in Historical Documents
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.
2023 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Background: Historical handwritten documents are assets for future generations and should be appropriately secured, so handwritten digit detection plays an important role to preserve them. Handwritten digit detection is a fundamental problem and has been studied extensively for many years. While earlier methods were effective to some extent, they often required domain expertise and extensive parameter tuning, making them time-consuming and difficult to generalize to new data. The development of deep learning techniques has led to significant improvements in handwritten digit recognition. They can automatically learn relevant features from raw image data, making them more robust to variations in handwriting styles.

Objectives: This study first considers a few deep learning algorithms for detecting the digits, considering the different challenges of the handwritten digits, and then finds the best algorithm among them using metrics to know which is the best-performing deep learning model.

Methods: Literature Review and experimentation are the research methodologies employed in this study. We have chosen four advanced deep-learning methods(YOLOV5, Faster R-CNN, RetinaNet, YOLOV7) to identify handwritten digits in digit string images. Each method is trained and tested using the DIDA dataset. Performance evaluation is conducted to determine the best method based on all analyses from experiments on all selected DL methods.

Results: The augmented and digit string datasets are used for training and testing the deep learning models. The chosen models are evaluated using metrics for an efficient model. The results from the experimental evaluation show the best deep learning model among the selected models for detecting multi-digit strings in historical handwritten digit images.

Conclusions: Results obtained from the performance metrics of the respective algorithm justify that the YOLOv7 algorithm has more efficiency and accuracy compared to YOLOv5, RetinaNet, and Faster R-CNN for the detection of handwritten digits in historical document images.

Place, publisher, year, edition, pages
2023.
Keywords [en]
Deep Learning Methods, Handwritten digits, Digit Detection, Image processing on document images, YOLOV5, Faster R-CNN, RetinaNet, YOLOV7
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-24323OAI: oai:DiVA.org:bth-24323DiVA, id: diva2:1740872
Subject / course
DV2572 Master´s Thesis in Computer Science
Educational program
DVADA Master Qualification Plan in Computer Science
Presentation
2023-01-23, Karlskrona, 13:00 (English)
Supervisors
Examiners
Available from: 2023-03-02 Created: 2023-03-02 Last updated: 2023-03-02Bibliographically approved

Open Access in DiVA

A comparison of Advanced deep learning algorithms for multi-digit recognition in historical documents.(2695 kB)499 downloads
File information
File name FULLTEXT02.pdfFile size 2695 kBChecksum SHA-512
db5fe1ecff502ac34a9d2eec58859dd88b3d3ab8967e278bea7ab966236448ddfa40041b698c8f88f5f1fe7e9b29471e3328fe629d812ec6a8f652e93cf8bdd6
Type fulltextMimetype application/pdf

By organisation
Department of Computer Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 499 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 459 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf