Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Analysis and text classification of privacy policies from rogue and top-100 fortune global companies
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datalogi och datorsystemteknik.ORCID-id: 0000-0002-9316-4842
Blekinge Tekniska Högskola, Fakulteten för datavetenskaper, Institutionen för datalogi och datorsystemteknik.
2019 (Engelska)Ingår i: International Journal of Information Security and Privacy, ISSN 1930-1650, E-ISSN 1930-1669, Vol. 13, nr 2, s. 47-66Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In the present article, the authors investigate to what extent supervised binary classification can be used to distinguish between legitimate and rogue privacy policies posted on web pages. 15 classification algorithms are evaluated using a data set that consists of 100 privacy policies from legitimate websites (belonging to companies that top the Fortune Global 500 list) as well as 67 policies from rogue websites. A manual analysis of all policy content was performed and clear statistical differences in terms of both length and adherence to seven general privacy principles are found. Privacy policies from legitimate companies have a 98% adherence to the seven privacy principles, which is significantly higher than the 45% associated with rogue companies. Out of the 15 evaluated classification algorithms, Naïve Bayes Multinomial is the most suitable candidate to solve the problem at hand. Its models show the best performance, with an AUC measure of 0.90 (0.08), which outperforms most of the other candidates in the statistical tests used. Copyright © 2019, IGI Global.

Ort, förlag, år, upplaga, sidor
IGI Global , 2019. Vol. 13, nr 2, s. 47-66
Nyckelord [en]
Classification, Classification algorithms, Information security, Machine learning, Privacy policies, Privacy policy data set, Data mining, Data privacy, Learning systems, Security of data, Text processing, Websites, Binary classification, Classification algorithm, Data set, Fortune global 500, Privacy principle, Statistical differences, Text classification, Classification (of information)
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:bth-17875DOI: 10.4018/IJISP.2019040104ISI: 000467764600004Scopus ID: 2-s2.0-85064536690OAI: oai:DiVA.org:bth-17875DiVA, id: diva2:1313083
Tillgänglig från: 2019-05-02 Skapad: 2019-05-02 Senast uppdaterad: 2025-09-30Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Boldt, Martin

Sök vidare i DiVA

Av författaren/redaktören
Boldt, MartinRekanar, Kaavya
Av organisationen
Institutionen för datalogi och datorsystemteknik
I samma tidskrift
International Journal of Information Security and Privacy
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 637 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf