Change search
ExportLink to record
Permanent link

Direct link
BETA

Project

Project type/Form of grant
International cooperation
Title [en]
DocPRESERV – Preserving & Processing Historical Document Images with Artificial Intelligence
Abstract [sv]
Att bevara och dela tillgång till vårt dokumentära arv, som representerar ett levande och kollektivt minne av våra samhällen, är mycket viktigt. Digitala arkiv av handskrivna formulär har för närvarande inga genomförbara eller praktiska sätt att söka efter åtkomst.SyfteProjektet syftar till att undersöka och implementera ett lämpligt IKT-baserat system för att ge verklig tillgång till de digitaliserade arkivdokumenten och ger en växande gemenskap av användare automatiserade verktyg för igenkänning, transkription och indexering av handskrivna arkivdokument.GenomförandeHuvudidén med projektet är att tillhandahålla AI-baserade lösningar för att göra digitala historiska handskrivna dokument mer tillgängliga.Utmaningen ligger i det faktum att digitaliserade dokument är av mycket komplex struktur och uppvisar varierande skrivstilar på grund av olika författare eller åldrar bland andra frågor.Projektet kommer att driva innovation inom historisk handskriven dokumentanalys och erkännande och kommer att utveckla innovativa verktyg för att förbättra kapaciteten för historisk dokumentbehandling och hämtning.
Abstract [en]
Preserving and sharing access to our documentary heritage, which represents a living and collective memory of our communities, is very important. Digital archives of handwritten forms currently have no feasible or practical means of searching for access.Aim of the projectThe project aims to investigate and implement a suitable ICT-based system to provide real access to the digitized archival documents and provide a growing community of users with automated tools for the recognition, transcription and indexing of handwritten archival documents.Implementation of the projectThe main idea of the project is to provide AI-based solutions to make digital historical handwritten documents more accessible.The challenge lies in the fact that digitized documents are of very complex structure and exhibit varying writing styles due to different authors or ages among other issues.The project will drive innovation in historical handwritten document analysis and recognition and will develop innovative tools to improve the capabilities of historical document processing and retrieval.
Publications (3 of 3) Show all publications
Khamekhem Jemni, S., Ammar, S., Souibgui, M. A., Kessentini, Y. & Cheddad, A. (2026). ST-KeyS: Self-supervised Transformer for Keyword Spotting in historical handwritten documents. Pattern Recognition, 170, Article ID 112036.
Open this publication in new window or tab >>ST-KeyS: Self-supervised Transformer for Keyword Spotting in historical handwritten documents
Show others...
2026 (English)In: Pattern Recognition, ISSN 0031-3203, E-ISSN 1873-5142, Vol. 170, article id 112036Article in journal (Refereed) Published
Abstract [en]

Keyword spotting (KWS) in historical documents is an important tool for the initial exploration of digitized collections. Nowadays, the most efficient KWS methods rely on machine learning techniques, which typically require a large amount of annotated training data. However, in the case of historical manuscripts, there is a lack of annotated corpora for training. To handle the data scarcity issue, we investigate the merits of self-supervised learning to extract useful representations of the input data without relying on human annotations and then use these representations in the downstream task. We propose ST-KeyS, a masked auto-encoder model based on vision transformers where the pretraining stage is based on the mask-and-predict paradigm without the need for labeled data. In the fine-tuning stage, the pre-trained encoder is integrated into a fine-tuned Siamese neural network model to improve feature embedding from the input images. We further improve the image representation using pyramidal histogram of characters (PHOC) embedding to create and exploit an intermediate representation of images based on text attributes. The proposed approach outperforms state-of-the-art methods trained on the same datasets in an exhaustive experimental evaluation of five widely used benchmark datasets (Botany, Alvermann Konzilsprotokolle, George Washington, Esposalles, and RIMES). 

Place, publisher, year, edition, pages
Elsevier, 2026
Keywords
Keyword spotting, Masked autoencoders, PHOC embedding, Self-supervised learning, Siamese neural networks, Visual transformers, Character recognition, History, Image representation, Labeled data, Learning algorithms, Learning systems, Neural networks, Supervised learning, Auto encoders, Embeddings, Handwritten document, Historical documents, Masked autoencoder, Neural-networks, Pyramidal histogram of character embedding, Siamese neural network, Visual transformer, Signal encoding
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:bth-28471 (URN)10.1016/j.patcog.2025.112036 (DOI)001528801600002 ()2-s2.0-105009722690 (Scopus ID)
Funder
The Swedish Foundation for International Cooperation in Research and Higher Education (STINT), AF2020-8892
Available from: 2025-08-11 Created: 2025-08-11 Last updated: 2025-09-30Bibliographically approved
Zhao, M., Hochuli, A. G. & Cheddad, A. (2021). End-to-End Approach for Recognition of Historical Digit Strings. In: Lladós J., Lopresti D., Uchida S. (Ed.), Lecture Notes in Computer Science: . Paper presented at 16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Online, 5 September 2021 - 10 September 2021 (pp. 595-609). Springer Science and Business Media Deutschland GmbH
Open this publication in new window or tab >>End-to-End Approach for Recognition of Historical Digit Strings
2021 (English)In: Lecture Notes in Computer Science / [ed] Lladós J., Lopresti D., Uchida S., Springer Science and Business Media Deutschland GmbH , 2021, p. 595-609Conference paper, Published paper (Refereed)
Abstract [en]

The plethora of digitalised historical document datasets released in recent years has rekindled interest in advancing the field of handwriting pattern recognition. In the same vein, a recently published data set, known as ARDIS, presents handwritten digits manually cropped from 15.000 scanned documents of Swedish churches’ books that exhibit various handwriting styles. To this end, we propose an end-to-end segmentation- free deep learning approach to handle this challenging ancient handwriting style of dates present in the ARDIS dataset (4-digits long strings). We show that with slight modifications in the VGG-16 deep model, the framework can achieve a recognition rate of 93.2%, resulting in a feasible solution free of heuristic methods, segmentation, and fusion methods. Moreover, the proposed approach outperforms the well-known CRNN method (a model widely applied in handwriting recognition tasks). © 2021, Springer Nature Switzerland AG.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2021
Series
Lecture Notes in Computer Science, ISSN 03029743, E-ISSN 16113349 ; 12823
Keywords
Handwriting digit string recognition, Historical document processing, Segmentation-free, Character recognition, Copying, Deep learning, History, Data set, Document datasets, End to end, Handwriting Styles, Handwritten digit, Historical documents, Swedishs, Heuristic methods
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:bth-22169 (URN)10.1007/978-3-030-86334-0_39 (DOI)000711646700039 ()2-s2.0-85115317825 (Scopus ID)9783030863333 (ISBN)
Conference
16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Online, 5 September 2021 - 10 September 2021
Funder
The Swedish Foundation for International Cooperation in Research and Higher Education (STINT), AF2020-8892
Note

open access

Available from: 2021-10-01 Created: 2021-10-01 Last updated: 2025-09-30Bibliographically approved
Cheddad, A., Kusetogullari, H., Hilmkil, A., Sundin, L., Yavariabdi, A., Aouache, M. & Hall, J. (2021). SHIBR-The Swedish Historical Birth Records: a semi-annotated dataset. Neural Computing & Applications, 33(22), 15863-15875
Open this publication in new window or tab >>SHIBR-The Swedish Historical Birth Records: a semi-annotated dataset
Show others...
2021 (English)In: Neural Computing & Applications, ISSN 0941-0643, E-ISSN 1433-3058, Vol. 33, no 22, p. 15863-15875Article in journal (Refereed) Published
Abstract [en]

This paper presents a digital image dataset of historical handwritten birth records stored in the archives of several parishes across Sweden, together with the corresponding metadata that supports the evaluation of document analysis algorithms' performance. The dataset is called SHIBR (the Swedish Historical Birth Records). The contribution of this paper is twofold. First, we believe it is the first and the largest Swedish dataset of its kind provided as open access (15,000 high-resolution colour images of the era between 1800 and 1840). We also perform some data mining of the dataset to uncover some statistics and facts that might be of interest and use to genealogists. Second, we provide a comprehensive survey of contemporary datasets in the field that are open to the public along with a compact review of word spotting techniques. The word transcription file contains 17 columns of information pertaining to each image (e.g., child's first name, birth date, date of baptism, father's first/last name, mother's first/last name, death records, town, job title of the father/mother, etc.). Moreover, we evaluate some deep learning models, pre-trained on two other renowned datasets, for word spotting in SHIBR. However, our dataset proved challenging due to the unique handwriting style. Therefore, the dataset could also be used for competitions dedicated to a large set of document analysis problems, including word spotting.

Place, publisher, year, edition, pages
Springer London, 2021
Keywords
Historical data of birth recordsHandwritten documentsPublic datasetWord spotting
National Category
Public Health, Global Health and Social Medicine Computer Sciences
Identifiers
urn:nbn:se:bth-22072 (URN)10.1007/s00521-021-06207-z (DOI)000667130400001 ()
Funder
Knowledge Foundation, 20140032The Swedish Foundation for International Cooperation in Research and Higher Education (STINT), AF2020-8892
Note

open access

Available from: 2021-09-02 Created: 2021-09-02 Last updated: 2025-09-30Bibliographically approved
Principal InvestigatorCheddad, Abbas
Coordinating organisation
Blekinge Institute of Technology
Funder
Period
2021-05-01 - 2022-04-29
National Category
Computer Sciences
Identifiers
DiVA, id: project:9548Project, id: AF2020-8892

Search in DiVA

Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

Link to external project page