Performance Analysis of kNN Query Processing on large datasets using CUDA & Pthreads: comparing between CPU & GPU
2017 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Telecom companies do a lot of analytics to provide consumers a better service and to stay in competition. These companies accumulate special big data that has potential to provide inputs for business. Query processing is one of the major tool to fire analytics at their data.
Traditional query processing techniques which follow in-memory algorithm cannot cope up with the large amount of data of telecom operators. The k nearest neighbour technique(kNN) is best suitable method for classification and regression of large datasets. Our research is focussed on implementation of kNN as query processing algorithm and evaluate the performance of it on large datasets using single core, multi-core and on GPU.
This thesis shows an experimental implementation of kNN query processing on single core CPU, Multicore CPU and GPU using Python, P- threads and CUDA respectively. We considered different levels of sizes, dimensions and k as inputs to evaluate the performance. The experiment shows that GPU performs better than CPU single core on the order of 1.4 to 3 times and CPU multi-core on the order of 5.8 to 16 times for different levels of inputs.
Place, publisher, year, edition, pages
2017. , p. 60
Keywords [en]
GPU, Multicore CPU, Parallel computing, Performance, Single core CPU, kNN, Query Processing
National Category
Telecommunications
Identifiers
URN: urn:nbn:se:bth-14758OAI: oai:DiVA.org:bth-14758DiVA, id: diva2:1120683
Subject / course
ET2580 Master's Thesis (120 credits) in Electrical Engineering with emphasis on Telecommunication Systems
Educational program
ETATX Master of Science Programme in Electrical Engineering with emphasis on Telecommunication Systems
2017-07-072017-07-062017-07-07Bibliographically approved