Forecasting the movements of Bitcoin prices: an application of machine learning algorithms

AI-generated keywords: Cryptocurrencies Bitcoin Machine Learning Forecasting Performance

AI-generated Key Points

  • Cryptocurrencies, particularly Bitcoin, are controversial and complex technological innovations in today's financial system
  • Forecasting the movements of Bitcoin prices with a high degree of accuracy is important research
  • Four different Machine Learning (ML) algorithms were applied simultaneously: Support Vector Machines (SVM), Artificial Neural Network (ANN), Naive Bayes (NB), and Random Forest (RF)
  • Logistic regression (LR) was used as a benchmark model for comparison purposes
  • Nine technical input parameters were used, including closing prices, high and low prices for computing technical indicators such as simple 14-day moving average, simple 14-day weighted moving average, momentum, stochastic K%, stochastic D%, relative strength index(RSI)
  • The study applies four different ML algorithms simultaneously to compare their performances and uses both existing continuous datasets and discrete datasets created specifically for this study
  • The study aims to understand which ML algorithm offers higher forecasting performance in continuous and discrete datasets separately and comparatively
  • RF has the highest forecasting performance in the continuous dataset while NB has the lowest performance. On the other hand, ANN has the highest performance while NB has the lowest in the discrete dataset.
  • Using a discrete dataset improves overall forecasting performance across all models estimated.
  • A validation dataset consisting of 100 Bitcoin series between June 2020–October 2020 was used to compare model performances with benchmark LR statistics using t-tests.
  • Accuracy statistics were calculated for both continuous and discrete datasets.
  • This study provides valuable insights into the forecasting of Bitcoin prices using ML algorithms.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hakan Pabuccu, Serdar Ongan, Ayse Ongan

arXiv: 2303.04642v1 - DOI (q-fin.CP)
14 pages, 2 figures and 15 tables
License: CC BY 4.0

Abstract: Cryptocurrencies, such as Bitcoin, are one of the most controversial and complex technological innovations in today's financial system. This study aims to forecast the movements of Bitcoin prices at a high degree of accuracy. To this aim, four different Machine Learning (ML) algorithms are applied, namely, the Support Vector Machines (SVM), the Artificial Neural Network (ANN), the Naive Bayes (NB) and the Random Forest (RF) besides the logistic regression (LR) as a benchmark model. In order to test these algorithms, besides existing continuous dataset, discrete dataset was also created and used. For the evaluations of algorithm performances, the F statistic, accuracy statistic, the Mean Absolute Error (MAE), the Root Mean Square Error (RMSE) and the Root Absolute Error (RAE) metrics were used. The t test was used to compare the performances of the SVM, ANN, NB and RF with the performance of the LR. Empirical findings reveal that, while the RF has the highest forecasting performance in the continuous dataset, the NB has the lowest. On the other hand, while the ANN has the highest and the NB the lowest performance in the discrete dataset. Furthermore, the discrete dataset improves the overall forecasting performance in all algorithms (models) estimated.

Submitted to arXiv on 08 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.04642v1

Cryptocurrencies, particularly Bitcoin, have emerged as one of the most controversial and complex technological innovations in today's financial system. The ability to forecast the movements of Bitcoin prices with a high degree of accuracy has become an important area of research. This study aims to achieve this objective by applying four different Machine Learning (ML) algorithms simultaneously, namely, Support Vector Machines (SVM), Artificial Neural Network (ANN), Naive Bayes (NB), and Random Forest (RF). Additionally, logistic regression (LR) is used as a benchmark model for comparison purposes. The study uses nine technical input parameters that were previously identified by Armano et al. (2005), Atsalakis & Valavanis (2009), Kara et al. (2011), and Kim (2003). These parameters include closing prices, high and low prices for computing technical indicators such as simple 14-day moving average, simple 14-day weighted moving average, momentum, stochastic K%, stochastic D%, relative strength index(RSI). The study differs from previous studies in three aspects. Firstly, it applies four different ML algorithms simultaneously to compare their performances. Secondly, it uses both existing continuous datasets and discrete datasets created specifically for this study. Finally, the study aims to understand which ML algorithm offers higher forecasting performance in continuous and discrete datasets separately and comparatively. The empirical findings reveal that RF has the highest forecasting performance in the continuous dataset while NB has the lowest performance. On the other hand, ANN has the highest performance while NB has the lowest in the discrete dataset. Furthermore, using a discrete dataset improves overall forecasting performance across all models estimated. To validate these findings further, a validation dataset consisting of 100 Bitcoin series between June 2020–October 2020 was used to compare model performances with benchmark LR statistics using t-tests. Accuracy statistics were calculated for both continuous and discrete datasets. In conclusion, this study provides valuable insights into the forecasting of Bitcoin prices using ML algorithms. The use of both continuous and discrete datasets along with the application of multiple ML algorithms contributes to a better understanding of which algorithm offers higher forecasting performance.
Created on 17 May. 2023
Available in other languages: fr

Assess the quality of the AI-generated content by voting

Score: 1

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.