Customer churn prediction in telecom using machine learning and social network analysis in big data platform

AI-generated keywords: Customer churn Predictive analytics Machine learning Social network analysis Big data

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Customer churn is a major concern for large companies in the telecom industry as it directly impacts revenues.
  • Companies are increasingly using predictive analytics to identify at-risk customers to combat customer churn.
  • The study focuses on developing a churn prediction model using machine learning techniques and social network analysis (SNA) on a big data platform.
  • The key contribution of the research is the development of a predictive model that assists telecom operators in identifying customers most likely to churn, achieving an AUC value of 93.3%.
  • Incorporating SNA features further enhances the model's performance, increasing the AUC from 84% to 93.3%.
  • The study utilized a large dataset provided by SyriaTel telecom company spanning nine months for testing and validation of the model.
  • Four different algorithms were experimented with, with Extreme Gradient Boosting (XGBOOST) emerging as the most effective classifier for churn prediction.
  • Advanced analytics techniques can effectively address customer churn in the telecom sector by leveraging big data and machine learning.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Abdelrahim Kasem Ahmad, Assef Jafar, Kadan Aljoumaa

Journal of Big Data 2019 6:28
24 pages, 14 figures. PDF https://rdcu.be/budKg

Abstract: Customer churn is a major problem and one of the most important concerns for large companies. Due to the direct effect on the revenues of the companies, especially in the telecom field, companies are seeking to develop means to predict potential customer to churn. Therefore, finding factors that increase customer churn is important to take necessary actions to reduce this churn. The main contribution of our work is to develop a churn prediction model which assists telecom operators to predict customers who are most likely subject to churn. The model developed in this work uses machine learning techniques on big data platform and builds a new way of features' engineering and selection. In order to measure the performance of the model, the Area Under Curve (AUC) standard measure is adopted, and the AUC value obtained is 93.3%. Another main contribution is to use customer social network in the prediction model by extracting Social Network Analysis (SNA) features. The use of SNA enhanced the performance of the model from 84 to 93.3% against AUC standard. The model was prepared and tested through Spark environment by working on a large dataset created by transforming big raw data provided by SyriaTel telecom company. The dataset contained all customers' information over 9 months, and was used to train, test, and evaluate the system at SyriaTel. The model experimented four algorithms: Decision Tree, Random Forest, Gradient Boosted Machine Tree "GBM" and Extreme Gradient Boosting "XGBOOST". However, the best results were obtained by applying XGBOOST algorithm. This algorithm was used for classification in this churn predictive model.

Submitted to arXiv on 01 Apr. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1904.00690v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Customer churn is a major concern for large companies in the telecom industry as it directly impacts revenues. To combat this issue, companies are increasingly turning to predictive analytics to identify at-risk customers. This study focuses on developing a churn prediction model using machine learning techniques and social network analysis (SNA) on a big data platform. The key contribution of this research lies in the development of a predictive model that assists telecom operators in identifying customers most likely to churn. By leveraging machine learning algorithms and innovative feature engineering and selection methods, the model achieves an impressive Area Under Curve (AUC) value of 93.3%. Additionally, incorporating SNA features further enhances the model's performance, increasing the AUC from 84% to 93.3%. To test and validate the model, a large dataset provided by SyriaTel telecom company was utilized. This dataset contained comprehensive customer information spanning nine months and served as the basis for training, testing, and evaluating the predictive system. The study experimented with four different algorithms - Decision Tree, Random Forest, Gradient Boosted Machine Tree (GBM), and Extreme Gradient Boosting (XGBOOST) - with XGBOOST emerging as the most effective classifier for churn prediction. Overall, this research showcases how advanced analytics techniques can effectively address customer churn in the telecom sector by harnessing the power of big data and machine learning. Telecom companies can proactively identify at-risk customers and implement targeted retention strategies to mitigate revenue loss.
Created on 24 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.