Text Steganalysis based on Convolutional Neural Networks
2022 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE credits
Student thesis
Abstract [en]
The CNN-based steganalysis model is able to capture some complex statistical dependencies and also learn feature representations. The proposed model uses a word embedding layer to map the words into dense vectors thus, achieving more accurate representations of the words. The proposed model extracts both, the syntax and semantic features. Files having less than 200 words are referred to as short text. Preprocessing for short text is done through word segmenting and encoding the words into indexes according to the position of words in the dictionary. Once this is performed, the index sequences are fed to the CNN to learn the feature representations. Files containing over 200 words are considered as long texts. Considering the wide range of length variation of these long texts, the proposed model tokenized long texts into their sentence components with a relatively consistent length prior to preprocessing the data. Eventually, the proposed model uses a decision strategy to make the final decision to check if the text file contains stego text or not.
Place, publisher, year, edition, pages
2022.
Keywords [en]
CNN, text steganalysis, Deep Learning, Decision Strategy
National Category
Engineering and Technology Computer Sciences
Identifiers
URN: urn:nbn:se:bth-23405OAI: oai:DiVA.org:bth-23405DiVA, id: diva2:1679825
Subject / course
DV1478 Bachelor Thesis in Computer Science
Educational program
DVGDT Bachelor Qualification Plan in Computer Science 60.0 hp
Presentation
2022-05-25, J1640, Blekinge Institute of Technology, Karlskrona, 12:45 (English)
Supervisors
Examiners
2022-07-042022-07-012022-07-04Bibliographically approved