Email Classification with Machine Learning and Word Embeddings for Improved Customer Support
2018 (English)Independent thesis Advanced level (professional degree), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Classifying emails into distinct labels can have a great impact on customer support. By using machine learning to label emails the system can set up queues containing emails of a specific category. This enables support personnel to handle request quicker and more easily by selecting a queue that match their expertise.
This study aims to improve the manually defined rule based algorithm, currently implemented at a large telecom company, by using machine learning. The proposed model should have higher F1-score and classification rate. Integrating or migrating from a manually defined rule based model to a machine learning model should also reduce the administrative and maintenance work. It should also make the model more flexible.
By using the frameworks, TensorFlow, Scikit-learn and Gensim, the authors conduct five experiments to test the performance of several common machine learning algorithms, text-representations, word embeddings and how they work together.
In this article a web based interface were implemented which can classify emails into 33 different labels with 0.91 F1-score using a Long Short Term Memory network.
The authors conclude that Long Short Term Memory networks outperform other non-sequential models such as Support Vector Machines and ADABoost when predicting labels for emails.
Place, publisher, year, edition, pages
2018.
Keywords [en]
Email Classification, Machine Learning, Long Short Term Memory, Natural Language Processing
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:bth-15946OAI: oai:DiVA.org:bth-15946DiVA, id: diva2:1189491
External cooperation
Telenor
Subject / course
Degree Project in Master of Science in Engineering 30.0
Educational program
DVACD Master of Science in Computer Security
Supervisors
Examiners
2018-03-122018-03-112022-05-12Bibliographically approved