Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Cluster-based Sample Selection for Document Image Binarization
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0002-2161-7371
2019 (English)In: 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, IEEE , 2019, p. 47-52Conference paper, Published paper (Refereed)
Abstract [en]

The current state-of-the-art, in terms of performance, for solving document image binarization is training artificial neural networks on pre-labelled ground truth data. As such, it faces the same issues as other, more conventional, classification problems; requiring a large amount of training data. However, unlike those conventional classification problems, document image binarization involves having to either manually craft or estimate the binarized ground truth data, which can be error-prone and time-consuming. This is where sample selection, the act of selecting training samples based on some method or metric, might help. By reducing the size of the training dataset in such a way that the binarization performance is not impacted, the required time spent creating the ground truth is also reduced. This paper proposes a cluster-based sample selection method that uses image similarity metrics and the relative neighbourhood graph to reduce the underlying redundancy of the dataset. The method, implemented with affinity propagation and the structural similarity index, reduces the training dataset on average by 49.57% while reducing the binarization performance only by 0.55%.

Place, publisher, year, edition, pages
IEEE , 2019. p. 47-52
Series
Proceedings of the International Conference on Document Analysis and Recognition, ISSN 1520-5363
Keywords [en]
document image binarization, sample selection, neural networks, computer vision
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-19355DOI: 10.1109/ICDARW.2019.40080ISI: 000518786800009ISBN: 978-1-7281-5054-3 OAI: oai:DiVA.org:bth-19355DiVA, id: diva2:1421150
Conference
15th IAPR International Conference on Document Analysis and Recognition (ICDAR) / 2nd Workshop of Machine Learning (WML), SEP 21-22, 2019, Sydney, AUSTRALIA
Part of project
Bigdata@BTH- Scalable resource-efficient systems for big data analytics, Knowledge Foundation
Funder
Knowledge Foundation, 20140032
Note

open access

Available from: 2020-04-02 Created: 2020-04-02 Last updated: 2021-10-07Bibliographically approved

Open Access in DiVA

fulltext(1050 kB)218 downloads
File information
File name FULLTEXT01.pdfFile size 1050 kBChecksum SHA-512
3757c88587b5c29d422334ec0fffead2d0c05ae6fd35dbdcc4bb6803f0aa17b2ca3c26307dc638d55df75e71a80ab16fa4478b58e4cb79ceeb7d87f7f97abf4b
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Westphal, Florian

Search in DiVA

By author/editor
Westphal, Florian
By organisation
Department of Computer Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 218 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 88 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf