Technical ReportPDF Available

Fundamental Research Questions and Proposals on Predicting Cryptocurrency Prices using DNNs

Authors:

Abstract and Figures

In last decade, cryptocurrency has emerged in financial area as a key factor in businesses and financial market opportunities. Accurate predictions can assist cryptocurrency investors towards right investing decisions and lead to potential increased profits. Additionally, they can also support policy makers and financial researchers in studying cryptocurrency markets behavior. Nevertheless, cryptocurrency price prediction is considered a very challenging task, due to its chaotic and very complex nature. In this study we investigate three major research questions: i) Can deep learning efficiently predict cryp-tocurrency prices? ii) Are cryptocurrency prices a random walk process? iii) Is there a proper validation method of cryptocurrency price prediction models? To this end, we evaluate some of the most successful and widely used in bibliography deep learning algorithms forecasting cryptocurrency prices. The results obtained, provide significant evidence that deep learning models are not able to solve this problem efficiently and effectively. Following detailed experimentation and results analysis, we conclude that it is essential to invent and incorporate new techniques, strategies and alternative approaches such as more sophisticated prediction algorithms, advanced ensemble methods, feature engineering techniques and other validation metrics.
Content may be subject to copyright.
TECHNICAL
RE P O R T
University of Patras
Department of Mathematics
GR-265 00 Patras, Greece.
http://www.math.upatras.gr/
Fundamental Research Questions and
Proposals on Predicting Cryptocurrency
Prices using DNNs
E. Pintelas1, I.E. Livieris1†, S. Stavroyiannis2,
T. Kotsilieris3, P. Pintelas1
NO. TR20-01
1University of Patras, Department of Mathematics, GR-265 00, Patras, Greece.
2Department of Accounting & Finance, University of the Peloponesse, GR 241 00,
Antikalamos, Greece.
3Department of Business Administration, University of the Peloponnese, GR 241 00,
Antikalamos, Greece.
corresponding author
Part of this work has been submitted in Artificial Intelligence Application and Innova-
tions (AIAI) 2020.
TECHNICAL REPORT
NO. TR20-01
Fundamental Research Questions and Proposals on
Predicting Cryptocurrency Prices using DNNs
E. Pintelas, I.E. Livieris, S. Stavroyiannis, T. Kotsilieris and P. Pintelas
February 2020
Abstract. In last decade, cryptocurrency has emerged in financial area as a key factor in
businesses and financial market opportunities. Accurate predictions can assist cryptocur-
rency investors towards right investing decisions and lead to potential increased profits.
Additionally, they can also support policy makers and financial researchers in studying
cryptocurrency markets behavior. Nevertheless, cryptocurrency price prediction is consid-
ered a very challenging task, due to its chaotic and very complex nature. In this study we
investigate three major research questions: i) Can deep learning efficiently predict cryp-
tocurrency prices? ii) Are cryptocurrency prices a random walk process? iii) Is there a
proper validation method of cryptocurrency price prediction models? To this end, we eval-
uate some of the most successful and widely used in bibliography deep learning algorithms
forecasting cryptocurrency prices. The results obtained, provide significant evidence that
deep learning models are not able to solve this problem efficiently and effectively. Follow-
ing detailed experimentation and results analysis, we conclude that it is essential to invent
and incorporate new techniques, strategies and alternative approaches such as: more sophis-
ticated prediction algorithms, advanced ensemble methods, feature engineering techniques
and other validation metrics.
keywords. Deep learning; CNN; LSTM; BiLSTM; Cryptocurrency price prediction; time
series.
PAGE 2 NO. TR20-01
Contents
1 Introduction 3
2 Brief description of advanced deep learning models 4
3 Experimental methodology 5
3.1 Description of the dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 Validation metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Experimental results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4 Discussion 13
4.1 Can deep learning efficiently predict cryptocurrency prices? ............ 13
4.2 Are cryptocurrency prices a random walk process? . . . . . . . . . . . . . . . . . 14
4.3 What is a proper validation method of cryptocurrency price prediction models? . . 15
5 Revisiting the problem 16
6 Conclusions 17
NO. TR20-01 PAGE 3
1 Introduction
Cryptocurrency is a new type of digital currency which utilizes blockchain technology and crypto-
graphic functions to gain transparency, decentralization and immutability [27,28]. Bitcoin (BTC)
is considered the first and the most popular cryptocurrency, which was invented by an anonymous
group or person in 2009. Since then, 4000 alternative cryptocurrencies like Etherium (ETH) and
Ripple (XRP) were created proving that the cryptocurrency market has emerged in financial area.
BTC, ETH and XRP are the three most popular cryptocurrencies, since they almost hold the 79.5%
of the global cryptocurrency market capitalization.
Cryptocurrency price prediction can provide a lending hand to cryptocurrency investors for
making proper investment decisions in order to acquire higher profits while it can also support
policy decision-making and financial researchers for studying cryptocurrency markets behavior.
Cryptocurrency price prediction can be considered as a common type of time series problems, like
the stock price prediction. Traditional time series methods such as the well-known AutoRegressive
Integrated Moving Average (ARIMA) model, have been applied for cryptocurrencies price and
movement prediction [5,8,29,36,38]. However, these models are not able to capture non-linear
patterns of very complicated prediction problems in contrast to Deep Learning algorithms which
achieve greater performance on forecasting time series problems [17,34].
Deep Learning algorithms (DL) are powerful machine learning algorithms which specialize in
solving nonlinear and complex problems exploiting most of the times big amounts of data in order
to become efficient predictor models. The accurate cryptocurrency price prediction is by nature
a significantly challenging and complex problem since its values have very big fluctuations over
time following an almost chaotic and unpredictable behavior. Therefore, deep learning techniques
may constitute the proper methodology to solve this problem.
Recent research efforts have adopted deep learning techniques for predicting cryptocurrency
price. Ji et al. [16] conducted a comparison of state-of-the-art deep neural networks such as Long
Short-Term Memory (LSTM), Deep Neural Networks (DNNs), deep residual network and their
combinations for predicting Bitcoin price. Their results demonstrated slightly better accuracy of
LSTM compared to other models for regression problem while DNNs outperformed all models
on price movement prediction. Shintate and Pichl [33] developed a trend prediction classifica-
tion framework for predicting non-stationary cryptocurrency time series utilizing deep learning.
Their results revealed that their proposed model outperformed LSTM baseline model while the
profitability analysis showed that simple buy-and-hold strategy was superior to their model and
thus it cannot yet be used for algorithmic trading. Lahmiri and Bekiros [21] utilized LSTM and
generalized regression neural networks for cryptocurrency price prediction. Their results showed
that LSTM was superior to the generalized regression neural architecture concluding that deep
learning is a very efficient method in predicting the inherent chaotic dynamics of cryptocurrency
prices. Amjad and Shah [4] used live streaming Bitcoin data for predicting price changes (in-
crease, decrease or no-change), building a model based on the most confident predictions, in order
to perform profitable trades. The classification algorithms which they used were Random Forest,
Logistic Regression and Linear Discriminant Analysis. Their results seem to be very impressive
PAGE 4 NO. TR20-01
since they achieved a high prediction accuracy (>6070%) and about 5.33xaverage return on
investments on a test set.
In this work, we evaluatethe performance of advanced deep learning algorithms for predicting
the price and movement of the three most popular cryptocurrencies (BTC, ETH and XRP). The
main contribution of this research lies in investigating three major questions:
i) Can deep learning efficiently predict cryptocurrency prices?
ii) Are cryptocurrency prices a random walk process?
iii) Is there a proper validation method of cryptocurrency price prediction models?
Furthermore, it also lies in the recommendation for new algorithms and alternative approaches for
the cryptocurrency prediction problem.
The remainder of this research is organized as follows: Section 2performs a brief introduc-
tion to the advanced deep learning models utilized in our experiments. Section 3presents our
research methodology and experimental results. Section 4discusses and answers the three re-
search questions, while Section 5presents our suggestions on possible alternative solutions for the
cryptocurrency prediction problem. Finally, Section 6presents our concluding remarks.
2 Brief description of advanced deep learning models
Deep learning algorithms constitute one of the most powerful machine learning algorithms cat-
egories which have been successfully applied on a multitude of commercial applications. Long
Short Term Memory and Convolutional Neural Networks are probably the most popular, success-
ful and widely used deep learning techniques [7,11,22].
Long Short-Term Memory (LSTM) [12,13] constitute a special type of deep neural networks,
which are able to learn long-term dependencies by utilizing feedback connections in order to “re-
member” past network cell states. These networks have become very popular since they have been
successfully applied on a wide range of applications and have shown remarkable performance on
time series forecasting [10,18,19,25,31]. More specifically, LSTM networks are composed by a
memory cell, an input, output and forget gate. The input gate controls the new stored information
into the memory cell, while the forget gate controls the information which must be vanished. Fi-
nally, the output gate controls the final output information value which is given after a delay into
the forget, input gate utilizing a feedback connection loop. In this way, LSTM is able to create
a controlled information flow filtering unnecessary information and thus achieving to learn long
term dependencies.
Bidirectional Long Short-Term Memory (BiLSTM) [12,32] are a special type of recurrent
neural networks which connect two LSTM layers of converse directions to the same output, in
order to remember past and future network cell states. The principle idea is that each training
sequence is presented forwards and backwards into two separate LSTM layers aiming in accessing
both past and future contexts for a given time. More specifically, the first hidden layer possesses
NO. TR20-01 PAGE 5
recurrent connections from the past time steps; while in the second one, the recurrent connections
are reversed, transferring activation backwards along the sequence.
Convolutional Neural Networks (CNN) [1,3,20] constitute another type of deep neural net-
works which utilize convolution and pooling layers in order to filter the raw input data and extract
valuable features, which will feed a fully connected layer in order to produce the final output. More
specifically, they apply convolution operations in the input data and in order to produce new more
useful features. The convolutional layers are usually followed by a pooling layer which extracts
values from the convolved features producing a lower dimension instance. In fact, a pooling layer
produces new features which can be considered as summarized versions of the convolved features
produced by the convolutional layer. This implies that pooling operations can significantly assist
the network to be more robust since small changes of the inputs, which are usually detected by the
convolutional layers, will become approximately invariant.
3 Experimental methodology
In this work, we evaluate the performance of advanced DL models for predicting the price of BTC,
ETH and XRP. The evaluated DL models are constituted by CNN, LSTM, BiLSTM and dense
layers. Table 1depicts our DL models for the best identified topologies. We have to mention that
exhaustive and thorough experiments were performed in order to identify the DL topologies which
incur the best performance results.
We recall that the basic idea of utilizing LSTM and BiLSTM on cryptocurrency price prediction
problems, is that they might be able to capture useful long or short sequence pattern dependencies,
due to their special architecture design, assisting on prediction performance, while the convolu-
tional layers of a CNN model might filter out the noise of the raw input data and extract valuable
features producing a less complicated dataset which would be more useful for the final prediction
model [23]. Therefore, we expect that a noticeable performance increase will be achieved by the
incorporation of these advanced models comparing to classic machine learning algorithms.
Additionally, the performance of the DL models was compared against that of traditional state-
of-the-art ML models: Support Vector Regressor (SVR) [9], 3-Nearest Neighbors (3NN) [2] and
Decision Tree Regressor (DTR) [24]. The implementation code was written in Python 3.4 while for
all deep learning models we utilized Keras library [14] and Theano as back-end while Scikit-learn
library [30] was used for the machine learning models.
3.1 Description of the dataset
For the purpose of this research, we utilized data from Jan-2018 to Aug-2019, concerning the
hourly prices in USD and were divided into training set consisting of data from Jan-2018 to Feb-
2019 (10176 values) and testing set from Mar-2019 to Aug-2019 (4416 values).
The data were obtained from www.kraken.com website, which is a trading platform for cryp-
tocurrency exchanges. Also, we utilized four different values for the forecasting horizon (number
PAGE 6 NO. TR20-01
of past prices taken into consideration), i.e., 4, 9, 12 and 16 hours. From a forecasting aspect, the
value of the forecasting horizon Fis considered essential crucial for the prediction accuracy of an
intelligent model. The forecasting horizon stands for the number of hourly prices which are taken
into consideration by a forecasting model for predicting the next price. More specifically, in case
the forecasting horizon is equal to 12 means that the model takes into account prices collected from
12 past hours and for predicting the price on the next hour.
Model Description
LSTM1LSTM layer with 50 units.
Output layer of 1 neuron.
LSTM2Two LSTM layers with 30 and 15 units, respectively.
Output layer of 1 neuron.
BiLSTM1BiLSTM layer with 60 units.
Output layer of 1 neuron.
BiLSTM2TwoBiLSTM layers with 40 and 20 units, respectively.
Output layer of 1 neuron.
CNN-LSTM1Convolutional layer with 64 of filters of size (2, ).
Convolutional layer with 128 of filters of size (2, ).
Max pooling layer with size (2, ).
LSTM layer with 100 units.
Output layer of 1 neuron.
CNN-LSTM2Convolutional layer with 64 of filters of size (2, ).
Convolutional layer with 128 of filters of size (2, ).
Max pooling layer with size (2, ).
LSTM layer with 70 units.
Dense layer with 16 neurons.
Output layer of 1 neuron.
CNN-BiLSTM1Convolutional layer with 64 of filters of size (2, ).
Convolutional layer with 128 of filters of size (2, ).
Max pooling layer with size (2, ).
LSTM layer with 100 units.
CNN-BiLSTM2Convolutional layer with 64 of filters of size (2, ).
Convolutional layer with 128 of filters of size (2, ).
Max pooling layer with size (2, ).
BiLSTM layer with 70 units.
Dense layer with 16 neurons.
Table 1: Best identified topologies for our deep learning models
3.2 Validation metrics
The most common validation metrics for measuring the performance of regression problems are
Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) [6]. Since the cryptocurrency
price prediction problem can be considered a regression problem, in our experiments we utilized
these two evaluation metrics. Moreover, by comparing the predicted prices of our models, with
the real ones, we managed to compute the classification accuracy of price movement direction pre-
NO. TR20-01 PAGE 7
diction (if the price will increase or decrease). Therefore, we utilized two additional performance
metrics: Accuracy (Acc) and F1-score (F1).
Finally, in order to check the reliability of our prediction models, we have utilized a test of
autocorrelation in the residuals [26,37]. This test examines the presence of autocorrelation be-
tween the residuals (differences between predicted and real values). In case autocorrelation exists,
then the prediction model may be inefficient since it did not manage to capture all the possible
information which lies into the data [26].
3.3 Experimental results
Tables 2,3,4and 5present the experimental results of our DL models ML models for forecasting
horizon 4, 9, 12 and 16, respectively. CNN-LSTM and CNN-BiLSTM models exhibited the best
overall performance among all prediction models. In particular, the CNN-LSTM exhibited the
highest RMSE performance for all datasets for every forecasting horizon comparing to other DL
models, while the CNN-BiLSTM exhibited the best Acc and F1score in most cases. Nevertheless,
the performance variations for all DL models seem to be minimal. The 3NN model reported
the highest RMSE performance for forecasting horizon 4 among all the ML models on BTC and
ETH datasets, while the DTR exhibited the highest RMSE on XRP. For forecasting horizon 9 the
DTR outperformed all ML models for all dataset, regarding to RMSE score. Furthermore, the
3NN model managed to achieve the best overall performance in Acc score almost in all cases. In
summary, advanced DL models seem to slightly outperform ML models while they did not manage
to achieve a noticeable performance increase comparing to our ML models.
BTC ETH XRP
Model MAE RMSE Acc F1MAE RMSE Acc F1MAE RMSE Acc F1
SVR 0.0178 0.0209 50.80% 0.484 0.0131 0.0163 49.14% 0.482 0.0323 0.0370 47.67% 0.644
3NN 0.0102 0.0161 49.51% 0.469 0.0106 0.0149 51.56% 0.481 0.0073 0.0115 52.33% 0.492
DTR 0.0119 0.0170 49.22% 0.465 0.0127 0.0199 50.11% 0.467 0.0099 0.0157 50.14% 0.504
LSTM10.0089 0.0138 52.86% 0.524 0.0086 0.0130 53.59% 0.531 0.0065 0.0106 53.40% 0.509
LSTM20.0092 0.0144 53.63% 0.522 0.0117 0.0165 52.68% 0.501 0.0072 0.0116 53.13% 0.514
BiLSTM10.0096 0.0142 52.05% 0.508 0.0084 0.0130 53.89% 0.538 0.0069 0.0114 53.21% 0.519
BiLSTM20.0090 0.0132 52.56% 0.519 0.0090 0.0134 53.30% 0.502 0.0074 0.0118 52.75% 0.518
CNN-LSTM10.0072 0.0109 53.21% 0.530 0.0066 0.0106 53.77% 0.533 0.0059 0.0097 52.42% 0.472
CNN-LSTM20.0061 0.0099 52.50% 0.502 0.0067 0.0106 54.20% 0.524 0.0061 0.0097 53.03% 0.436
CNN-BiLSTM10.0064 0.0107 54.51% 0.544 0.0078 0.0121 53.91% 0.524 0.0076 0.0119 53.61% 0.534
CNN-BiLSTM20.0060 0.0101 55.43% 0.548 0.0072 0.0115 54.18% 0.541 0.0076 0.0117 53.46% 0.506
Table 2: Performance of DL and ML forecasting models for F=4
PAGE 8 NO. TR20-01
BTC ETH XRP
Model MAE RMSE Acc F1MAE RMSE Acc F1MAE RMSE Acc F1
SVR 0.0152 0.0192 52.57% 0.546 0.0101 0.0146 49.30% 0.501 0.0214 0.0292 47.92% 0.457
3NN 0.0139 0.0197 51.57% 0.484 0.0144 0.0195 51.50% 0.504 0.0088 0.0132 54.35% 0.485
DTR 0.0134 0.0179 49.54% 0.459 0.0148 0.0228 49.64% 0.465 0.0100 0.0161 49.89% 0.504
LSTM10.0108 0.0159 51.99% 0.479 0.0113 0.0158 53.06% 0.500 0.0083 0.0117 51.34% 0.413
LSTM20.0141 0.0207 52.45% 0.499 0.0153 0.0208 52.75% 0.527 0.0077 0.0115 51.66% 0.456
BiLSTM10.0117 0.0170 52.00% 0.489 0.0111 0.0165 53.31% 0.512 0.0078 0.0126 52.73% 0.517
BiLSTM20.0098 0.0168 52.97% 0.530 0.0109 0.0166 53.80% 0.527 0.0074 0.0121 55.22% 0.534
CNN-LSTM10.0074 0.0119 53.92% 0.536 0.0086 0.0130 53.92% 0.530 0.0058 0.0096 51.06% 0.453
CNN-LSTM20.0065 0.0107 54.20% 0.532 0.0081 0.0124 54.45% 0.537 0.0065 0.0100 51.54% 0.493
CNN-BiLSTM10.0092 0.0149 53.44% 0.533 0.0108 0.0158 53.10% 0.522 0.0097 0.0148 54.01% 0.540
CNN-BiLSTM20.0077 0.0125 54.89% 0.541 0.0101 0.0152 53.95% 0.533 0.0104 0.0157 53.95% 0.532
Table 3: Performance of DL and ML forecasting models for F=9
BTC ETH XRP
Model MAE RMSE Acc F1MAE RMSE Acc F1MAE RMSE Acc F1
SVR 0.0133 0.0179 51.97% 0.512 0.0177 0.0218 51.99% 0.511 0.0201 0.0285 49.86% 0.518
3NN 0.0168 0.0238 50.77% 0.512 0.0198 0.0277 51.04% 0.509 0.0102 0.0153 53.91% 0.535
DTR 0.0173 0.0263 49.95% 0.503 0.0157 0.0227 49.04% 0.490 0.0100 0.0159 51.39% 0.514
LSTM10.0119 0.0178 52.75% 0.511 0.0085 0.0135 53.76% 0.528 0.0071 0.0109 52.71% 0.489
LSTM20.0155 0.0217 51.81% 0.489 0.0151 0.0213 52.76% 0.521 0.0086 0.0123 52.11% 0.449
BiLSTM10.0119 0.0179 52.46% 0.516 0.0135 0.0192 52.70% 0.499 0.0087 0.0127 51.60% 0.460
BiLSTM20.0135 0.0200 52.77% 0.523 0.0141 0.0201 53.31% 0.521 0.0081 0.0126 52.49% 0.487
CNN-LSTM10.0095 0.0145 53.38% 0.525 0.0108 0.0160 53.62% 0.528 0.0068 0.0103 50.51% 0.426
CNN-LSTM20.0075 0.0121 53.26% 0.528 0.0098 0.0148 53.81% 0.525 0.0073 0.0106 50.95% 0.441
CNN-BiLSTM10.0104 0.0161 53.37% 0.519 0.0110 0.0166 53.80% 0.533 0.0068 0.0103 50.06% 0.459
CNN-BiLSTM20.0110 0.0167 53.49% 0.516 0.0112 0.0168 53.68% 0.525 0.0080 0.0112 49.94% 0.499
Table 4: Performance of DL and ML forecasting models for F=12
BTC ETH XRP
Model MAE RMSE Acc F1MAE RMSE Acc F1MAE RMSE Acc F1
SVR 0.0139 0.0185 50.70% 0.512 0.0208 0.0267 51.97% 0.512 0.0188 0.0279 49.45% 0.514
3NN 0.0194 0.0280 50.40% 0.509 0.0211 0.0291 50.99% 0.508 0.0120 0.0176 53.42% 0.530
DTR 0.0172 0.0262 49.05% 0.494 0.0161 0.0226 48.30% 0.482 0.0102 0.0159 51.44% 0.513
LSTM10.0143 0.0198 51.29% 0.465 0.0145 0.0196 51.74% 0.464 0.0090 0.0122 49.90% 0.407
LSTM20.0145 0.0199 51.80% 0.473 0.0213 0.0294 52.49% 0.507 0.0114 0.0148 52.22% 0.430
BiLSTM10.0154 0.0221 52.20% 0.509 0.0142 0.0202 53.38% 0.508 0.0088 0.0127 54.36% 0.489
BiLSTM20.0154 0.0222 52.27% 0.514 0.0148 0.0213 53.27% 0.523 0.0087 0.0130 55.90% 0.506
CNN-LSTM10.0092 0.0147 53.31% 0.531 0.0119 0.0172 53.61% 0.517 0.0074 0.0108 52.00% 0.419
CNN-LSTM20.0094 0.0140 51.30% 0.501 0.0113 0.0166 53.20% 0.527 0.0066 0.0102 53.17% 0.444
CNN-BiLSTM10.0129 0.0195 51.99% 0.506 0.0134 0.0199 53.22% 0.529 0.0066 0.0102 51.71% 0.445
CNN-BiLSTM20.0133 0.0200 51.51% 0.501 0.0138 0.0203 53.38% 0.524 0.0066 0.0101 50.90% 0.466
Table 5: Performance of DL and ML forecasting models for F=16
NO. TR20-01 PAGE 9
In the sequel, in order to identify the reliability of models’ predictions, we applied a residual
autocorrelation test for the models which achieved the best overall scores regarding all performance
metrics and dataset, namely CNN-LSTM and CNN-BiLSTM for forecasting horizons 4 and 9.
Figures 1-12 present the Auto-Correlation Function (ACF) plot of both selected convolutional-
based models for BTC, ETH and XRP. Notice that the confident limits (blue dashed line) are con-
structed assuming that the residuals follow a Gaussian probability distribution. Clearly, all present
ACF plots reveal that some correlation coefficients were not within the confidence limits (dashed
lines), violating the assumption of no auto-correlation in the errors. Therefore, the presence of cor-
relation indicates that the advanced DL models are unreliable for cryptocurrency price predictors
since there exists some significant information left over which should be taken into account for
obtaining better predictions.
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 1: ACF plots on the residuals for BTC using CNN-LSTM for F=4
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 2: ACF plots on the residuals for BTC using CNN-LSTM for F=9
PAGE 10 NO. TR20-01
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 3: ACF plots on the residuals for BTC using CNN-BiLSTM for F=4
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 4: ACF plots on the residuals for BTC using CNN-BiLSTM for F=9
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 5: ACF plots on the residuals for ETH using CNN-LSTM for F=4
NO. TR20-01 PAGE 11
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 6: ACF plots on the residuals for ETH using CNN-LSTM for F=9
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 7: ACF plots on the residuals for ETH using CNN-BiLSTM for F=4
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 8: ACF plots on the residuals for ETH using CNN-BiLSTM for F=9
PAGE 12 NO. TR20-01
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 9: ACF plots on the residuals for XRP using CNN-LSTM for F=4
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 10: ACF plots on the residuals for XRP using CNN-LSTM for F=9
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 11: ACF plots on the residuals for XRP using CNN-BiLSTM for F=4
NO. TR20-01 PAGE 13
-0.2
0
0.2
0.4
0.6
0.8
1
Auto-Correlation
0 1 2 3 4 5 6 7 8 9 10
Lag
Figure 12: ACF plots on the residuals for XRP using CNN-BiLSTM for F=9
Regarding BTC, the ACF plots reveal that some correlation coefficients were not within the
confidence limits, violating the assumption of no auto-correlation in the errors. More specifically,
the ACF plots of CNN-LSTM for F=4 and F=9 as well as the ACF plot of CNN-BiLSTM for
F=9 presented that there are significant spikes at lags 1 5 while the ACF plot of CNN-BiLSTM
presented that there are significant spikes at lags 1 and 2. Regarding ETH, both ACF plots of
CNN-LSTM and CNN-BiLSTM for F=4 revealed that there are significant spikes at lags 1 and 2
while for F=9 revealed that that there exist significant spikes at lags 1-5. Finally, regarding XRP,
both ACF plots of CNN-LSTM and the ACF-plot of CNN-BiLSTM for F=4 presented that there
exist some small spikes at lag 1 and 2 while the ACF-plot of CNN-BiLSTM for F=9 presented a
negligible spike at lags 1 and 2.
Summarizing, the interpretation of Figures 1-12 illustrated the presence of auto-correlation in
the residuals of all advanced DL models. Thus, we concluded that these models are unreliable
for cryptocurrency price predictors since there exists some significant information left over which
should be taken into account for obtaining better predictions.
4 Discussion
Following our experiments, this section is dedicated in providing a thorough and sufficiently de-
tailed discussion of our findings with regard to the predefined three research questions: Can deep
learning algorithms efficiently predict cryptocurrency prices? Are cryptocurrency prices a random
walk process? Which is a proper validation method of cryptocurrency price prediction models?
4.1 Can deep learning efficiently predict cryptocurrency prices?
Deep learning algorithms are considered to be the most powerful and the most effective methods in
approximating extremely complex and non-linear classification and regression problems, therefore
it was expected that a noticeable performance increase will be achieved by the incorporation of
PAGE 14 NO. TR20-01
these models comparing to classic machine learning algorithms. Surprisingly, our results demon-
strated that the utilized DL algorithms, slightly outperformed the other ML algorithms utilized
in our experiments, whereas instead a noticeable performance increase was anticipated. So, it is
paramount importance to investigate the reason why that happened. To this end, we summarize
two possible reasons: The problem we are trying to solve is a random walk process or very close
to it, thus any attempt for prediction might be of poor quality or the problem is just too compli-
cated that even advanced deep learning methods cannot find any pattern that would lead to any
reliable prediction. Thus, more sophisticated methodologies, techniques and innovative strategies
are needed to be investigated.
When a time series prediction problem follows a random walk process or it is so complicated
that most models face it as a random process, then the more efficient method to face it, is the
employment of present values as the prediction values for the next state [26]. That is exactly what
a persistence model does and maybe what most prediction models really do and possibly that’s
the reason why ML models used in our experiments achieve almost the same performance score
compared to the deep learning models used in our experiments. In contrast, the deep learning
models may attempt forecasting based on patterns that were traced and as a result are unable to
achieve high performance because either those patterns are false or because there exist no such
patterns at all, in the case that the cryptocurrency price prediction problem is a random walk
process.
Nevertheless, as mentioned before, the DL models did not manage to achieve a noticeable
performance score in our experiments, since their score was almost the same with the ML models.
Thus, we conclude that these advanced DL models cannot efficiently predict cryptocurrency prices
because the utilized datasets with the specific form which we fed” them to our prediction models,
probably follow almost a random walk process and thus not sufficient information lies on them in
order to perform accurate and reliable future predictions.
4.2 Are cryptocurrency prices a random walk process?
Towards the construction of a model which performs reliable and accurate predictions, firstly, we
have to identify if the cryptocurrency price prediction problem is a random walk process. In a
recent study, Stavroyiannis et al. [35] proved that Bitcoin prices follow a random walk process
since their experiments revealed the presence of unit roots, for several time intervals from 1-min to
180-min, and thus reliable profitable trading opportunities may not be possible in Bitcoin markets.
However, since this problem is highly affected by time evolution and external changes, these results
maybe temporary and reverse in future.
However, there are numerous technical strategies that the majority of the professional traders
utilize in order to make trading decisions in stock market and cryptocurrency investments. Most
of them seem to be heuristic and empirical strategies which are based on various technical indica-
tors and patterns such as the “Engulfing Pattern” and the “Evening Star”. A recent study utilized
those technical indicators and trading patterns strategies in order to predict stock market and cryp-
tocurrency prices [15]. Their results provide evidence that technical analysis strategies have strong
NO. TR20-01 PAGE 15
predictive power and thus can be useful in cryptocurrencies markets like Bitcoin.
Therefore, we conclude that the cryptocurrency prices in general are not totally a random walk
process but they may be close to it, which means that probably exist some actual patterns on historic
data that could assist on forecasting attempts. In other words, we liken this problem as a “huge
sea of random walk points where small hidden islands (patterns) may exist in”. As a result, more
research is required for the discovery of alternative, innovative and more sophisticated methods
such as the incorporation of new feature engineering strategies and the creation of new algorithmic
and ensemble methods.
4.3 What is a proper validation method of cryptocurrency price prediction
models?
As mentioned above, the most common validation metrics for measuring the performance of most
regression algorithms are MAE and RMSE. However, finding a proper validation metric for cryp-
tocurrency price prediction models can be a very complicated and tricky task and cannot be con-
sidered an easy and straightforward process. The MAE and RMSE may constitute an incomplete
way for validating cryptocurrency price prediction problems since a prediction model may have
excellent MAE and RMSE performance but cannot properly predict the cryptocurrency price di-
rection move (classification problem). A cryptocurrency trader or investor may be more interested
in the future price direction movement rather than knowing the exact future cryptocurrency price.
Profitability analysis for algorithmic trading strategies reveal that classification prediction models
were more effective than regression models [16].
Even if we utilize a third evaluation metric which will measure the performance accuracy of
cryptocurrency price direction movement, that may still constitute an incomplete method for val-
idating cryptocurrency prediction algorithms. Consider the following example: Suppose we wish
to validate 2 cryptocurrency prediction models utilizing a test set of 100 questions, e.g. what is the
future price direction movement on the next 100 time steps? The first model answers (predicts) all
questions while it answers correctly 52 questions achieving an accuracy score of 52%. The second
model answers only 5 from 100 questions but it cannot answer the other 95 questions, while these
5 answers are correct. So, the second model achieves a score of 5%. Thus, an important question
is raised, “which is the best model”? A cryptocurrency trader or investor will probably choose the
second model since it acts in a more reliable way and it would be more valuable for him to possess
a model which performs accurate predictions on random times (specified by the model), rather
than possessing a model which performs unreliable predictions on every moment (specified by the
user).
Therefore, we conclude that finding a proper validation metric for cryptocurrency price pre-
diction models is a very challenging task and thus alternative and new methods for evaluating
cryptocurrency prediction models are essential.
PAGE 16 NO. TR20-01
5 Revisiting the problem
One of the most significant steps in order to solve any problem, especially the really hard and chal-
lenging ones, lies in finding a proper strategy approach and securing the complete understanding of
the problem we try to solve. A proper strategy approach should answer questions such as: should
we have to predict prices, price movement direction, price trends, price spikes and so on. Next,
should we apply data preprocessing and feature engineering strategies (e.g. which features should
we use in order to efficiently train a prediction model ?) Also, what is the best prediction model to
apply (e.g. DNNs, other sophisticated prediction models, ensemble models and so on) and finally,
which is a proper method to validate this model? All these issues, considered as discrete steps in
the process, should be taken into serious consideration since each one of them can significantly
contribute to any prediction attempt in order to efficiently approximate the problem.
These steps are not a straightforward process, since we should always have to consider its
chaotic and extremely complicated nature with respect to its practical contribution after a possible
solution. For example, it may be an easier task to solve and possibly more beneficial for the invest-
ment and trading world to predict if the price will just increase or decrease (classification problem
for price direction movement prediction) rather than predicting the exact value of cryptocurrency
price. Some strategy approaches examples are presented in brief below.
Instead of adopting a specific time interval, one could utilize various time intervals of higher
and lower frequency historic datasets for predicting the prices on a specific future interval in order
to utilize and exploit in a more efficient way all possible information that a historic dataset may
contain. Another approach could be instead of predicting the price or the movement direction on
one discrete future time value, to predict the average and movement direction price or peak price
inside a future time window frame (this approach would be more similar to a trend prediction
problem).
Pattern identification and recognition could be another approach. This approach would be more
similar to a pattern detection framework in which the model would detect specific pattern areas in
order to perform a prediction. More specifically, if we are able to identify the feature characteristics
of possible useful patterns that a prediction model found, then we could filter out useless sequence
inputs which have no predictive information and then utilize only the useful sequence inputs which
will possibly assist on reliable and accurate predictions. In this case the prediction model will
perform prediction operations only when the input sequence falls into the same category with the
chosen patterns. This framework would be more similar to the way that a professional trader often
acts, who performs investment decisions based on his/her personal chosen patterns and indicators
recipes on technical analysis of historic price charts.
Finally, another approach could be the investigation of heuristic patterns and other financial
indicators which professional traders and bankers utilize in their trading and financial technical
analysis. It is essential to identify how these methods actually assist predictions and investment de-
cisions in a more mathematic way (if they actually work) and maybe incorporate these techniques
in a machine learning framework for developing co-operative prediction models. That could be an
effective cryptocurrency prediction framework.
NO. TR20-01 PAGE 17
6 Conclusions
In this work, we evaluated advanced DL models for predicting cryptocurrency prices and also
investigated three research question concerning this problem in a review and discussion approach.
Our results revealed that the presented models are inefficient and unreliable cryptocurrency price
predictors, probably due to the fact that this problem is a very complicated one, that even advanced
deep learning techniques such as LSTM and CNNs are not able to solve efficiently. Also, based on
our experimental results and investigation regarding to our research questions about cryptocurrency
price problem, we conclude that cryptocurrency prices follow almost a random walk process while
few hidden patterns may probably exist in, where an intelligent framework has to identify them in
order a prediction model to make accurate and reliable forecasts. Therefore, new sophisticated
algorithmic methods, alternative approaches, new validation methods and the incorporation of
ensemble methods should be explored.
References
[1] Hamed Habibi Aghdam and Elnaz Jahani Heravi. Guide to convolutional neural networks.
New York, NY: Springer. doi, 10:978–3, 2017.
[2] David W Aha. Lazy learning. Springer Science & Business Media, 2013.
[3] Neena Aloysius and M Geetha. A review on deep convolutional neural networks. In 2017
International Conference on Communication and Signal Processing, pages 588–592. IEEE,
2017.
[4] MuhammadAmjad and Devavrat Shah. Trading bitcoin and online time series prediction. In
NIPS 2016 Time Series Workshop, pages 1–15, 2017.
[5] Nashirah Abu Bakar and Sofian Rosbi. Autoregressive integrated moving average (ARIMA)
model for forecasting cryptocurrency exchange rate in high volatility environment: A new
insight of bitcoin transaction. International Journal of Advanced Engineering Research and
Science, 4(11), 2017.
[6] Alexei Botchkarev. Performance metrics (error measures) in machine learning regression,
forecasting and prognostics: Properties and typology. arXiv preprint arXiv:1809.03006,
2018.
[7] Eugene Charniak. Introduction to deep learning. The MIT Press, 2019.
[8] Shi Chen, Cathy Yi-Hsuan Chen, Wolfgang Karl H¨ardle, TM Lee, and Bobby Ong.
Econometric analysis of a cryptocurrency index for portfolio investment. In Handbook of
Blockchain, Digital Finance, and Inclusion, Volume 1, pages 175–206. Elsevier, 2018.
PAGE 18 NO. TR20-01
[9] N. Deng, Y. Tian, and C. Zhang. Support vector machines: optimization based theory, algo-
rithms, and extensions. Chapman and Hall/CRC, 2012.
[10] Hassan Ismail Fawaz, Germain Forestier, Jonathan Weber, Lhassane Idoumghar, and Pierre-
Alain Muller. Deep learning for time series classification: a review. Data Mining and Knowl-
edge Discovery, 33(4):917–963, 2019.
[11] Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep learning book. MIT Press,
521(7553):800, 2016.
[12] Alex Graves. Long short-term memory. In Supervised sequence labelling with recurrent
neural networks, pages 37–45. Springer, 2012.
[13] Klaus Greff, Rupesh K Srivastava, Jan Koutn´ık, Bas R Steunebrink, and J¨urgen Schmidhu-
ber. LSTM: A search space odyssey. IEEE Transactions on Neural Networks and Learning
Systems, 28(10):2222–2232, 2016.
[14] A. Gulli and S. Pal. Deep Learning with Keras. Packt Publishing Ltd, 2017.
[15] Jing-Zhi Huang, William Huang, and Jun Ni. Predicting Bitcoin returns using high-
dimensional technical indicators. The Journal of Finance and Data Science, 5(3):140–155,
2019.
[16] Suhwan Ji, Jongmin Kim, and Hyeonseung Im. A comparative study of Bitcoin price predic-
tion using deep learning. Mathematics, 7(10):898, 2019.
[17] ES Karakoyun and AO Cibikdiken. Comparison of ARIMA time series model and LSTM
deep learning algorithm for bitcoin price forecasting. In The 13th MultidisciplinaryAcademic
Conference in Prague, pages 171–180, 2018.
[18] Fazle Karim, Somshubra Majumdar, Houshang Darabi, and Shun Chen. Lstm fully convolu-
tional networks for time series classification. IEEE Access, 6:1662–1669, 2017.
[19] Fazle Karim, Somshubra Majumdar, Houshang Darabi, and Samuel Harford. Multivariate
LSTM-FCNS for time series classification. Neural Networks, 116:237–245, 2019.
[20] Salman Khan, Hossein Rahmani, Syed Afaq Ali Shah, and Mohammed Bennamoun. A
guide to convolutional neural networks for computer vision. Synthesis Lectures on Computer
Vision, 8(1):1–207, 2018.
[21] Salim Lahmiri and Stelios Bekiros. Cryptocurrency forecasting with deep learning chaotic
neural networks. Chaos, Solitons & Fractals, 118:35–40, 2019.
[22] Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. Nature, 521(7553):436–
444, 2015.
NO. TR20-01 PAGE 19
[23] Ioannis E Livieris, Emmanuel Pintelas, and Panagiotis Pintelas. A CNN-LSTM model for
gold price time series forecasting. Neural Computing and Applications, 2020 (under revi-
sion).
[24] Wei-Yin Loh. Classification and regression tree methods. Wiley StatsRef: Statistics Reference
Online, 2014.
[25] Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, and Puneet Agarwal. Long short term mem-
ory networks for anomaly detection in time series. In Proceedings, volume 89. Presses uni-
versitaires de Louvain, 2015.
[26] Douglas C Montgomery, Cheryl L Jennings, and Murat Kulahci. Introduction to time series
analysis and forecasting. John Wiley & Sons, 2015.
[27] Ujan Mukhopadhyay, Anthony Skjellum, Oluwakemi Hambolu, Jon Oakley, Lu Yu, and
Richard Brooks. A brief survey of cryptocurrency systems. In 14th Annual Conference
on Privacy, Security and Trust, pages 745–752. IEEE, 2016.
[28] Arvind Narayanan, Joseph Bonneau, Edward Felten, Andrew Miller, and Steven Goldfeder.
Bitcoin and cryptocurrency technologies: a comprehensive introduction. Princeton Univer-
sity Press, 2016.
[29] D Olvera-Juarez and E Huerta-Manzanilla. Forecasting bitcoin pricing with hybrid models:
A review of the literature. International Journal of Advanced Engineering Research and
Science, 6(9), 2019.
[30] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel,
P. Prettenhofer, R. Weiss, and V. Dubourg. Scikit-learn: Machine learning in Python. Journal
of Machine Learning Research, 12:2825–2830, 2011.
[31] Alaa Sagheer and Mostafa Kotb. Time series forecasting of petroleum production using deep
LSTM recurrent networks. Neurocomputing, 323:203–213, 2019.
[32] Mike Schuster and Kuldip K Paliwal. Bidirectional recurrent neural networks. IEEE trans-
actions on Signal Processing, 45(11):2673–2681, 1997.
[33] Takuya Shintate and Luk´s Pichl. Trend prediction classification for high frequency bitcoin
time series with deep learning. Journal of Risk and Financial Management, 12(1):17, 2019.
[34] Sima Siami-Namini, Neda Tavakoli, and Akbar Siami Namin. A comparison of ARIMA and
LSTM in forecasting time series. In 2018 17th IEEE International Conference on Machine
Learning and Applications, pages 1394–1401. IEEE, 2018.
[35] Stavros Stavroyiannis, Vassilios Babalos, Stelios Bekiros, Salim Lahmiri, and Gazi Salah Ud-
din. The high frequency multifractal properties of Bitcoin. Physica A: Statistical Mechanics
and its Applications, 520:62–71, 2019.
PAGE 20 NO. TR20-01
[36] Gergana Taneva. An analysis and a forecast of the cryptomarket based on the ARIMA model.
Economic Thought journal, (4):66–84, 2019.
[37] Ruey S Tsay. Analysis of financial time series, volume 543. John Wiley & Sons, 2005.
[38] Is¸il Yenido˘gan, Aykut C¸ ayir, Ozan Kozan, Tu˘gc¸e Da˘g, and C¸ i˘gdem Arslan. Bitcoin forecast-
ing using ARIMA and prophet. In 3rd International Conference on Computer Science and
Engineering, pages 621–624. IEEE, 2018.
... After the model structure has been defined, function f (·) can be determined by traditional time-series methods such as ARIMA (Auto-Regressive Integrated Moving Average) and GARCH-type models and their variations [2][3][4] or by machine learning methods such as Artificial Neural Networks (ANNs) [5,6]. However, both mentioned approaches fail to depict the stochastic and chaotic nature of cryptocurrency time-series and be successfully effective for accurate forecasting [7]. To this end, more sophisticated algorithmic approaches have to be applied such as deep learning and ensemble learning methods. ...
... Along this line, Livieris et al. [10] and Pintelas et al. [11] proposed CNN-LSTM models with various architectures for efficiently forecasting gold and cryptocurrency time-series price and movement, reporting some interesting results. Nevertheless, although deep learning models are tailored to cope with temporal correlations and efficiently extract more valuable information from the training set, they failed to generate reliable forecasting models [7,11]; while in contrast ensemble learning models although they are an elegant solution to develop stable models and address the high variance of individual forecasting models, their performance heavily depends on the diversity and accuracy of the component learners. Therefore, a time-series prediction model, which exploits the benefits of both mentioned methodologies may significantly improve the prediction performance. ...
... Pintelas et al. [7,11] conducted a detailed research, evaluating advanced deep learning models for predicting major cryptocurrency prices and movements. Additionally, they conducted a detailed discussion regarding the fundamental research questions: Can deep learning algorithms efficiently predict cryptocurrency prices? ...
Article
Full-text available
Nowadays, cryptocurrency has infiltrated almost all financial transactions; thus, it is generally recognized as an alternative method for paying and exchanging currency. Cryptocurrency trade constitutes a constantly increasing financial market and a promising type of profitable investment; however, it is characterized by high volatility and strong fluctuations of prices over time. Therefore, the development of an intelligent forecasting model is considered essential for portfolio optimization and decision making. The main contribution of this research is the combination of three of the most widely employed ensemble learning strategies: ensemble-averaging, bagging and stacking with advanced deep learning models for forecasting major cryptocurrency hourly prices. The proposed ensemble models were evaluated utilizing state-of-the-art deep learning models as component learners, which were comprised by combinations of long short-term memory (LSTM), Bi-directional LSTM and convolutional layers. The ensemble models were evaluated on prediction of the cryptocurrency price on the following hour (regression) and also on the prediction if the price on the following hour will increase or decrease with respect to the current price (classification). Additionally, the reliability of each forecasting model and the efficiency of its predictions is evaluated by examining for autocorrelation of the errors. Our detailed experimental analysis indicates that ensemble learning and deep learning can be efficiently beneficial to each other, for developing strong, stable, and reliable forecasting models.
... The results showed that LSTM-NN models performed better than the other ANN models. However, the recent researches ( [19,20]) reported that the advanced deep learning methods (e.g., LSTM, convolutional neural networks (CNN)-LSTM, bidirectional LSSTM (BiLSTM) and CNN-BiLSTM) produced inefficient and unreliable predictions for the complicated times series (e.g., cryptocurrency price and movement prediction, or streamflow). ...
... Streamflow phenomenon is affected from many factors such as precipitation, air temperature, evapotranspiration, infiltration, slope of the basin and vegetation coverage, and it involves deterministic, stochastic and chaotic parts. For this reason, new sophisticated methods or alternatives with new validation techniques are needed for prediction of complicated time series like streamflow ( [19,20]). ...
Article
Full-text available
Accurate estimation of streamflow has a vital importance in water resources engineering, management and planning. In the present study, the abilities of group method of data handling-neural networks (GMDH-NN), dynamic evolving neural-fuzzy inference system (DENFIS) and multivariate adaptive regression spline (MARS) methods are investigated for monthly streamflow prediction. Precipitation, temperature and streamflows from Kalam and Chakdara stations at Swat River basin (mountainous basin), Pakistan, are used as inputs to the applied models in the form of different input scenarios, and models’ performances are evaluated on the basis of root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe efficiency (NSE) and combined accuracy (CA) indexes. Test results of the Kalam Station show that the DENFIS model provides more accurate prediction results in comparison of GMDH-NN and MARS models with the lowest RMSE (18.9 m3/s), MAE (13.1 m3/s), CA (10.6 m3/s) and the highest NSE (0.941). For the Chakdara Station, the MARS outperforms the GMDH-NN and DENFIS models with the lowest RMSE (47.5 m3/s), MAE (31.6 m3/s), CA (26.1 m3/s) and the highest NSE (0.905). Periodicity (month number of the year) effect on models’ accuracies in predicting monthly streamflow is also examined. Obtained results demonstrate that the periodicity improves the models’ accuracies in general but not necessarily in every case. In addition, the results also show that the monthly streamflow could be successfully predicted using only precipitation and temperature variables as inputs.
... Recently, Pintelas at al. [24,25] evaluated the performance of several deep learning models for price and movement forecasting of major cryptocurrencies. The major novelty in their work was the application of a series of tests for examining the prediction efficiency but mostly the reliability of the models. ...
... Pintelas et al. [24,25] performed a comprehensive research, evaluating advanced deep learning models for forecasting the prices and directional movements of major cryptocurrencies. Furthermore, the authors conducted a detailed discussion, concerning if deep learning models can be trusted as reliable predictors and if the cryptocurrencies prices follow a random walk process. ...
Article
Full-text available
Time-series analysis and forecasting is generally considered as one of the most challenging problems in data mining. During the last decade, powerful deep learning methodologies have been efficiently applied for time-series forecasting; however, they cannot guarantee the development of reliable prediction models. In this work, we introduce a novel framework for supporting deep learning in enhancing accurate, efficient and reliable time-series models. The major novelty of our proposed methodology is that it ensures a time-series to be "suitable" for fitting a deep learning model by performing a series transformations in order to satisfy the stationarity property. The enforcement of stationarity is performed by the application of Augmented Dickey-Fuller test and transformations based on first differences or returns, without the loss of any embedded information. The reliability of the deep learning model's predictions is guaranteed by rejecting the hypothesis of auto-correlation in the model's errors, which is demonstrated by autocorrelation function plots and Ljung-Box Q-test. Our numerical experiments were performed utilizing time-series from three real-world application domains (financial market, energy sector, cryptocurrency area), which incorporate most of global research interest. The performance of all forecasting models were compared on both problems of forecasting time-series price (regression) and time-series directional movements (classification). Additionally, the reliability of the models' forecasts was evaluated by examining the existence of autocorrelation in the errors. Our numerical experiments indicate that our proposed methodology considerably improves the forecasting performance of a deep learning model, in terms of efficiency, accuracy and reliability.
... Deep learning networks are used extensively to forecast the movement of the crypto market. They are "considered to be the most powerful and the most effective methods in approximating extremely complex and non-linear classification and regression problems" [Pintelas et al. 2020b]. According to this author, deep neural networks can deliver modest improvements over traditional machine learning methods. ...
Conference Paper
Full-text available
Cryptocurrency has become a popular asset in global financial markets, meaning that individual investors and asset management companies worldwide are considering this new investment class. The main contribution of this research is to address an intra-day forecasting problem with hourly granularity by comparing deep network architectures, including ones with attention mechanisms for the Ethereum intrinsic cryptocurrency (ETH). Since variations on the deep learning model parameter values may also introduce variability in the results produced by the models, different statistical validations were considered part of the comparison process. Finally, this work shows that the Temporal Convolutional Network model (TCN) outperformed other architectures considered for a short-term forecast period in terms of processing time. The TCN deep learning model is also amongst the most accurate models, using an auto-regressive integrated moving average model (ARIMA) as a baseline.
... Deep learning networks are used extensively to forecast the movement of the crypto market and "considered to be the most powerful and the most effective methods in approximating extremely complex and non-linear classification and regression problems" [Pintelas et al. 2020b]. Albeit modest, DNNs did deliver an improvement over traditional machine learning methods. ...
Conference Paper
Full-text available
The main contribution of this research is to investigate whether an Artificial Neural Network is an option to predict Ethereum cryptocurrency close price on a time constrained scenario. The ANN training time and time lagged data availability are considered as constraints on finding the fastest and the most accurate regression model using ARIMA results as a baseline. As part of the study, hourly aggregated data is processed to generate a step-ahead forecast and then processing time is compared for each architecture. Previous work related to cryptocurrency forecasting usually focus the analysis only on accuracy, and use coarser data granularity. Results have shown that convolutional neural networks over performed other architectures for accuracy and time objectives.
... Pintelas et al. [98] investigated the efficiency of DL in predicting time-series cryptocurrency prices and whether there is a proper validation method of the prediction models. Their results provided evidence that DL models are not able to solve this problem effectively. ...
Article
Full-text available
Time-series forecasting is a significant discipline of data modeling where past observations of the same variable are analyzed to predict the future values of the time series. Its prominence lies in different use cases where it is required, including economic, weather, stock price, business development, and other use cases. In this work, a review was conducted on the methods of analyzing time series starting from the traditional linear modeling techniques until the automated machine learning (AutoML) frameworks, including deep learning models. The objective of this review article is to support identifying the time-series forecasting challenge and the different techniques to meet the challenge. This work can be additionally an assist and a reference for researchers and industries demanding to use AutoML to solve the problem of forecasting. It identifies the gaps of the previous works and techniques used to solve the problem of forecasting time series.
... In a recent study, Pintelas et al. [43,44] evaluated the performance of some robust DL models and identified the drawbacks of these models for time-series forecasting as well as the way for reliable model evaluation. More specifically, the authors stated that the achievement of good prediction accuracy based on classic regression metrics, can guarantee only how close is the actual value to the the predicted value and not the ability to conduct reliable predictions. ...
Article
Full-text available
Multi-step forecasting is considered as an open challenge in time-series analysis. Although several approaches were proposed to address this complex prediction problem, none of them could secure the development of an efficient as well as a reliable multi-step forecasting model. In this research , we present a novel strategy for the development of accurate, robust and reliable multi-step deep learning models. The proposed strategy is based on a sophisticated al-gorithmic framework, which is able to process, transform and deliver "high-quality" and "suitable" time-series training data. The suitability of the transformed data is secured by taking into consideration and exploiting the dynamics and the sampling of the time-series in conjunction with the forecasting horizon as well as the imposition of the station-arity property. The conducted numerical experiments performed on challenging real-world time-series datasets from the application domains: finance, commodity, climate and air quality, which demonstrate the efficacy, robustness and reliability of the proposed multi-step strategy.
... LSTM is the most successful and widely used algorithm for prediction, so many authors have used it and improved the prediction accuracy [72]. For example, Pintelas et al. [73] employed three models based on DNN, namely LSTM, the classical machine learning-based models, and an MSE is considered for evaluation purposes. Another finding from the literature survey is that no one has used the direction of the parent's coin to predict the price of the dependent cryptocurrency. ...
Article
Full-text available
Blockchain technology is becoming increasingly popular because of its applications in various fields. It gives an edge over the traditional centralized methods as it provides decentralization, immutability, integrity, and anonymity. The most popular application of this technology is cryptocurrencies, which showed a massive rise in their popularity and market capitalization in recent years. Individual investors, big institutions, and corporate firms are investing heavily in it. However, the crypto market is less stable than traditional commodity markets. It can be affected by many technical, sentimental, and legal factors, so it is highly volatile, uncertain, and unpredictable. Plenty of research has been done on various cryptocurrencies to forecast accurate prices, but the majority of these approaches can not be applied in real-time. Motivated from the aforementioned discussion, in this paper, we propose a deep-learning-based hybrid model (includes Gated Recurrent Units (GRU) and Long Short Term Memory (LSTM)) to predict the price of Litecoin and Zcash with inter-dependency of the parent coin. The proposed model can be used in real-time scenarios and it is well trained and evaluated using standard data sets. Results illustrate that the proposed model forecasts the prices with high accuracy compared to existing models.
Article
Cryptocurrencies are becoming popular day by day and their use in financial applications has also increased; hence they are recognized as a method for payments. For decades, trading in cryptocurrency has been popular and constantly increasing in financial markets; hence investors are looking forward to their positive returns through these cryptocurrencies. However, volatility of cryptocurrency prices is very significant; the volatility in financial markets records the changes in the price of a cryptocurrency. Steady increases or decreases in price within a certain range can be healthy. There can also be rapidchanges in prices in either direction. Healthy volatility mainly opens profit opportunities. Investors can invest accordingly and earn maximum profit. So, there is a need to assess the investment concerning the profit scenario. An accurate price prediction model helps investors to make decisions and earn profits. Hence, there is a strong need for developing an accurate price prediction model. In this research work, an LSTM model is used to predict five major cryptocurrencies that are; Ethereum, Cardano ADA, EOS, NEO, and Tron. For measuring model performance, RMSE (Root Mean Squared Error) has been computed for all five cryptocurrencies. It has been observed that Tron cryptocurrency has the least RMSE and Ethereum has the worst RMSE. Hence, it can be concluded that the LSTM model is the best fit for the Tron dataset and can predict accurate prices for the future; therefore, the investor can make an investment and earnprofit.
Article
Full-text available
Gold price volatilities have a significant impact on many financial activities of the world. The development of a reliable prediction model could offer insights in gold price fluctuations, behavior and dynamics and ultimately provide the opportunity of gaining significant profits. In this work, we propose a new deep learning forecasting model for the accurate prediction of gold price and movement. The proposed model exploits the ability of convolutional layers for extracting useful knowledge and learning the internal representation of time-series data as well as the effectiveness of Long Short Term Memory (LSTM) layers for identifying short-term and long-term dependencies. We conducted a series of experiments and evaluated the proposed model against state-of-the-art deep learning and machine learning models. The preliminary experimental analysis illustrated that the utilization of LSTM layers along with additional convolutional layers could provides a significant boost in increasing the forecasting performance.
Article
Full-text available
Bitcoin has recently received a lot of attention from the media and the public due to its recent price surge and crash. Correspondingly, many researchers have investigated various factors that affect the Bitcoin price and the patterns behind its fluctuations, in particular, using various machine learning methods. In this paper, we study and compare various state-of-the-art deep learning methods such as a deep neural network (DNN), a long short-term memory (LSTM) model, a convolutional neural network, a deep residual network, and their combinations for Bitcoin price prediction. Experimental results showed that although LSTM-based prediction models slightly outperformed the other prediction models for Bitcoin price prediction (regression), DNN-based models performed the best for price ups and downs prediction (classification). In addition, a simple profitability analysis showed that classification models were more effective than regression models for algorithmic trading. Overall, the performances of the proposed deep learning-based prediction models were comparable.
Article
Full-text available
Time Series Classification (TSC) is an important and challenging problem in data mining. With the increase of time series data availability, hundreds of TSC algorithms have been proposed. Among these methods, only a few have considered Deep Neural Networks (DNNs) to perform this task. This is surprising as deep learning has seen very successful applications in the last years. DNNs have indeed revolutionized the field of computer vision especially with the advent of novel deeper architectures such as Residual and Convolutional Neural Networks. Apart from images, sequential data such as text and audio can also be processed with DNNs to reach state-of-the-art performance for document classification and speech recognition. In this article, we study the current state-of-the-art performance of deep learning algorithms for TSC by presenting an empirical study of the most recent DNN architectures for TSC. We give an overview of the most successful deep learning applications in various time series domains under a unified taxonomy of DNNs for TSC. We also provide an open source deep learning framework to the TSC community where we implemented each of the compared approaches and evaluated them on a univariate TSC benchmark (the UCR/UEA archive) and 12 multivariate time series datasets. By training 8730 deep learning models on 97 time series datasets, we propose the most exhaustive study of DNNs for TSC to date.
Article
Full-text available
We provide a trend prediction classification framework named the random sampling method (RSM) for cryptocurrency time series that are non-stationary. This framework is based on deep learning (DL). We compare the performance of our approach to two classical baseline methods in the case of the prediction of unstable Bitcoin prices in the OkCoin market and show that the baseline approaches are easily biased by class imbalance, whereas our model mitigates this problem. We also show that the classification performance of our method expressed as the F-measure substantially exceeds the odds of a uniform random process with three outcomes, proving that extraction of deterministic patterns for trend classification, and hence market prediction, is possible to some degree. The profit rates based on RSM outperformed those based on LSTM, although they did not exceed those of the buy-and-hold strategy within the testing data period, and thus do not provide a basis for algorithmic trading.
Article
The investment potential of the cryptocurrency market is examined through an analysis of the factors that lead to sharp fluctuations and a forecast of its volatility is made with the use of the CRIX index. This index is dynamic and its structure changes every three months, which provides detailed information on the volatility of the cryptocurrency market. The CRIX index includes the most liquid cryptocurrencies, which makes it a representative indicator of the cryptocurrency market as well as a reliable indicator when it comes to making market forecasts in the future. The Autoregressive Integrated Moving Average (ARIMA) model of forecasting dynamic rows used in the research is a barometer of the cryptocurrency market. The research follows the changes in the monthly price of the CRIX index from January 2015 to January 2019.
Conference Paper
In this study, ARIMA Time Series Model and the LSTM Deep Learning Algorithm have been compared to estimate the future price of Bitcoin. The ARIMA model, which is widely used in the prediction of time series, is run in the R programming language. The LSTM model, which is the machine learning algorithm used in predicting the time series, is also built using the Keras framework in Python. The prices of the next 30 days were estimated with the obtained models. The results have been obtained approximately MAPE 11.86% with ARIMA and MAPE 1.40% with LSTM. Also according to other accuracy test results, it is observed that the LSTM Model is more successful when the results are compared.
Article
Following the new advances in encryption and network computing, Bitcoin emerged as a private sector system facilitating peer-to-peer exchange via distributed ledgers based on blockchains, driving a transformational change towards a global economy outside the core financial system. The main purpose of this paper is to examine the multifractal properties of the Bitcoin price using high frequency data. The methods used are the wavelet transform modulus maxima and the multifractal detrended fluctuation analysis. The results indicate that Bitcoin exhibits a large degree of multifractality in all examined time intervals, and the main source of multifractality is attributed to the high kurtosis and the fat distributional tails of the series returns.
Article
We implement deep learning techniques to forecast the price of the three most widely traded digital currencies i.e., Bitcoin, Digital Cash and Ripple. To the best of our knowledge, this is the first work to make use of deep learning in cryptocurrency prediction. The results from testing the existence of nonlinearity revealed that the time series of all digital currencies exhibit fractal dynamics, long memory and self-similarity. The predictability of long-short term memory neural network topologies (LSTM) is significantly higher when compared to the generalized regression neural architecture, set forth as our benchmark system. The latter failed to approximate global nonlinear hidden patterns regardless of the degree of contamination with noise, as they are based on Gaussian kernels suitable only for local approximation of non-stationary signals. Although the computational burden of the LSTM model is higher as opposed to brute force in nonlinear pattern recognition, eventually deep learning was found to be highly efficient in forecasting the inherent chaotic dynamics of cryptocurrency markets.