Multi-stage approach with DTW and clustering for forecasting of average deposit rate in Ukraine
DOI:
https://doi.org/10.17721/1812-5409.2022/4.7Keywords:
Dynamic Time Warping, ARIMA, Clustering, Random Forest, Deposit rates, Web-scrappingAbstract
The paper is dedicated to the development of the multi-stage forecasting method that is based on Dynamic Time Warping, Clustering and AutoARIMA techniques, which is compared with several traditional benchmarks on the unique dataset.
The goal is to forecast an average deposit rate in Ukraine using data that has been scrapped from banks' websites about their individual deposit rates on the daily basis. From this rich dataset the paper focuses only on 12-month deposits, UAH, for each bank. Most of the issues that are traditional for web-scraping approach are irrelevant in our case due to the dataset features.
These rates are aggregated into groups by similarity in dynamics, forecasted separately with an AutoARIMA routine and finally aggregated into the entire forecast using weights that have been obtained with an OLS estimation.
The paper presents the result and comparison with several benchmarks, starting from simple Random Walk, a few specifications of ARIMA and simple Random Forest. The multi-stage approach outperforms benchmarks by an RMSE and graphical analysis over the latter period of the data.
Pages of the article in the issue: 55 - 65
Language of the article: English
References
KHALIL, S., FAKIR, M. (2017): RCrawler: An R Package for Parallel Web Crawling and Scraping. "SoftwareX”, Volume 6, pp.98-106.
KHDER, M. (2021): Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application. International Journal of Advances in Soft Computing and its Applications, 13(3), pp.145-168.
PESARAN, H., SHIN, Y. (1995): An Autoregressive Distributed Lag Modeling Approach to Co-integration Analysis. “In S. Strøm (Ed.), Econometrics and Economic Theory in the 20th Century: The Ragnar Frisch Centennial Symposium (Econometric Society Monographs), Cambridge: Cambridge University Press”, pp. 371-413.
NATIONAL BANK OF UKRAINE (2022): Inflation Report, October 2022.
DSHALALOW, J., WHITE, R. (2021): Current Trends in Random Walks on Random Lattices. “Mathematics”, 9(10), pp. 11-48.
GANDHI, P. (2020): 7 Statistical Tests to validate and help to fit ARIMA model.
ALMASARWEH, M., WADI, S. (2018): ARIMA Model in Predicting Banking Stock Market Data. “Modern Applied Science”, Vol. 12, No. 11.
YOON, J. (2020): Forecasting of Real GDP Growth Using Machine Learning Models: Gradient Boosting and Random Forest Approach. “Computational Economics”, volume 57, pp. 247–265.
BIAU, G. (2012): Analysis of a Random Forests Model. “Journal of Machine Learning”, Volume 13, pp. 1063-1095.
KRUKOVETS, D. (2020): Analysis of similarity between artificially simulated time series with Dynamic Time Warping. “Proceedings of Workshop on Intelligent Information Systems WIIS2020”, pp.97-108.
SALVADOR, S., CHAN, P. (2004): FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space. “Intelligent Data Analysis”, 11(5), pp.70-80.
GIORGINO, T. (2009): Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package. “Journal of Statistical Software”, 31(7), pp. 1–24.
DATORRO, J. (2019): Convex optimization and Euclidean distance geometry.
JIE, C., JIYUE, Z., JUNHUI, W., YUSHENG, W., HUIPING, S., KAIYAN, L. (2020): Review on the Research of K-means Clustering Algorithm in Big Data. “2020 IEEE 3rd International Conference on Electronics and Communication Engineering (ICECE)”, Xi'An, China, 107-111.
HYNDMAN, R., KHANDAKAR, Y. (2008): Automatic Time Series Forecasting: The forecast Package for R. “Journal of Statistical Software”, 27(3), 1-22.
MCANDREW, T., WATTANACHIT, N., GIBSON, GC., REICH, NG. (2021): Aggregating predictions from experts: a review of statistical methods, experiments, and applications. Wiley Interdiscip Rev Comput Stat, 13(2), e1514.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 The Author(s)

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).