Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Python Data Odyssey: Mining User feedback from Google Play store
Northwestern Polytechnical University, China.
Emerson University, Pakistan.
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0001-9336-4361
Chinese Academy of Sciences, China.
2024 (English)In: Data in Brief, E-ISSN 2352-3409, Vol. 54, article id 110499Article in journal (Refereed) Published
Abstract [en]

Context

The Google Play Store is widely recognized as one of the largest platforms for downloading applications, both free and paid1. On a daily basis, millions of users avail themselves of this marketplace, sharing their thoughts through various means such as star ratings, user comments, suggestions, and feedback. These insights, in the form of comments and feedback, constitute a valuable resource for organizations, competitors, and emerging companies seeking to expand their market presence. These comments provide insights into app deficiencies, suggestions for new features, identified issues, and potential enhancements. Unlocking the potential of this repository of suggestions holds significant value.

Objective

This study sought to gather and analyze user reviews from the Google Play store for leading game apps. The primary aim was to construct a dataset for subsequent analysis utilizing requirements engineering, machine learning, and competitive assessment.

Methodology

The authors employed a Python-based web scraping method to extract a comprehensive set of over 429,000+ reviews from the Google Play pages of selected apps. The scraped data encompassed reviewer names (removed due to privacy), ratings, and the textual content of the reviews.

Results

The outcome was a dataset comprising the extracted user reviews, ratings, and associated metadata. A total of 429,000+ reviews were acquired through the scraping process for popular apps like Subway Surfers, Candy Crush Saga, PUBG Mobile, among others. This dataset not only serves as a valuable educational resource for instructors, aiding in the training of students in data analysis, but also offers practitioners the opportunity for in-depth examination and insights (in the past data of top apps).

Place, publisher, year, edition, pages
Elsevier, 2024. Vol. 54, article id 110499
Keywords [en]
App reviews, Crowd-source data, Data mining, NLP, User reviews
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-26183DOI: 10.1016/j.dib.2024.110499ISI: 001266124000002Scopus ID: 2-s2.0-85192669911OAI: oai:DiVA.org:bth-26183DiVA, id: diva2:1857025
Available from: 2024-05-09 Created: 2024-05-09 Last updated: 2024-08-12Bibliographically approved

Open Access in DiVA

fulltext(1230 kB)70 downloads
File information
File name FULLTEXT01.pdfFile size 1230 kBChecksum SHA-512
1674a60169cb3421d478cc06de0ce232d54cfdc8de9e2968584925ea2e8a58cdf074507cec08488f4ed0cf41a08bcdceb38060018d5fe190eff93f76ceb5d0fe
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Ghazi, Ahmad Nauman

Search in DiVA

By author/editor
Ghazi, Ahmad Nauman
By organisation
Department of Software Engineering
In the same journal
Data in Brief
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 70 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 534 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf