Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Predicting Airbnb Prices in European Cities Using Machine Learning
Blekinge Institute of Technology, Faculty of Computing.
Blekinge Institute of Technology, Faculty of Computing.
2023 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Background: Machine learning is a field of computer science that focuses on creating models that can predict patterns and relations among data. In this thesis, we use machine learning to predict Airbnb prices in various European cities to help the hosts in setting reasonable prices for their properties. Different supervised machine learning algorithms will be used to determine which model will provide the highest accuracy so that hosts set profitable prices for their housing properties.

Objectives: The main goal of this thesis is to use machine learning algorithms to assist the hosts in setting reasonable rental prices for their properties so that they can keep their properties affordable for renters across Europe and achieve maximum occupancy.

Methods: The dataset for Airbnb in European cities is gathered from Kaggle and then has been pre-processed using techniques like one-hot encoding, label encoder, standardscaler and principle component analysis. The data set is divided into three parts for training, validation and testing. Next, feature selection is done to determine the most important features that contribute to the pricing, and the dimensionality of the dataset is reduced. Supervised machine learning algorithms are utilized for training. The models are evaluated with reliable performance estimates after tuning the hyperparameters using k-fold cross-validation.

Results: The feature_importance_ predicts that room capacity, type of room(shared or not), and the country appear in all three algorithms. Although scores vary between algorithms, these are among the top five attributes that influence the target variable. Day, cleanliness rating, and attr index are some other attributes that are among the top five characteristics. Among the chosen learning algorithms, the random forest regressor gave the best regression model with a R2 score of 0.70. The second best is the gradient boosting regressor with a R2 score of 0.32. While SVM gave the least score of 0.06.

Conclusions: Random forest regressor was the best algorithm for predicting the prices of Airbnb and suggests hosts setting reasonable rental prices for their properties with more accurate pricing for renters across Europe compared to other chosen models. Contrary to our expectations SVM had performed the least for this dataset.

Place, publisher, year, edition, pages
2023. , p. 41
Keywords [en]
Machine Learning, Supervised Learning, Regression Algorithms, Airbnb Price Prediction
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:bth-25055OAI: oai:DiVA.org:bth-25055DiVA, id: diva2:1777746
Subject / course
DV1478 Bachelor Thesis in Computer Science
Educational program
DVGDT Bachelor Qualification Plan in Computer Science 60.0 hp
Presentation
2023-05-23, J1650, Blekinge Tekniska Högskola, 371 79 Karlskrona, Karlskrona, 11:15 (English)
Supervisors
Examiners
Available from: 2023-07-03 Created: 2023-06-29 Last updated: 2023-07-03Bibliographically approved

Open Access in DiVA

Predicting Airbnb Prices in European Cities Using Machine Learning(1277 kB)1131 downloads
File information
File name FULLTEXT02.pdfFile size 1277 kBChecksum SHA-512
455535badc862b5ac814fb178dc36d84eb91afd7a06ec544cc13277ed413338a624428ebd18f6431d4d61b1398c17217e0efa432438480ca16ce2ca0b7edcfc7
Type fulltextMimetype application/pdf

By organisation
Faculty of Computing
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1131 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1160 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf