Multi-stage approach with DTW and clustering for forecasting of average deposit rate in Ukraine

Authors

  • D. Krukovets Taras Shevchenko National University of Kyiv

DOI:

https://doi.org/10.17721/1812-5409.2022/4.7

Keywords:

Dynamic Time Warping, ARIMA, Clustering, Random Forest, Deposit rates, Web-scrapping

Abstract

The paper is dedicated to the development of the multi-stage forecasting method that is based on Dynamic Time Warping, Clustering and AutoARIMA techniques, which is compared with several traditional benchmarks on the unique dataset.

The goal is to forecast an average deposit rate in Ukraine using data that has been scrapped from banks' websites about their individual deposit rates on the daily basis. From this rich dataset the paper focuses only on 12-month deposits, UAH, for each bank. Most of the issues that are traditional for web-scraping approach are irrelevant in our case due to the dataset features.

These rates are aggregated into groups by similarity in dynamics, forecasted separately with an AutoARIMA routine and finally aggregated into the entire forecast using weights that have been obtained with an OLS estimation.

The paper presents the result and comparison with several benchmarks, starting from simple Random Walk, a few specifications of ARIMA and simple Random Forest. The multi-stage approach outperforms benchmarks by an RMSE and graphical analysis over the latter period of the data.

Pages of the article in the issue: 55 - 65

Language of the article: English

References

KHALIL, S., FAKIR, M. (2017): RCrawler: An R Package for Parallel Web Crawling and Scraping. "SoftwareX”, Volume 6, pp.98-106.

KHDER, M. (2021): Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application. International Journal of Advances in Soft Computing and its Applications, 13(3), pp.145-168.

PESARAN, H., SHIN, Y. (1995): An Autoregressive Distributed Lag Modeling Approach to Co-integration Analysis. “In S. Strøm (Ed.), Econometrics and Economic Theory in the 20th Century: The Ragnar Frisch Centennial Symposium (Econometric Society Monographs), Cambridge: Cambridge University Press”, pp. 371-413.

NATIONAL BANK OF UKRAINE (2022): Inflation Report, October 2022.

DSHALALOW, J., WHITE, R. (2021): Current Trends in Random Walks on Random Lattices. “Mathematics”, 9(10), pp. 11-48.

GANDHI, P. (2020): 7 Statistical Tests to validate and help to fit ARIMA model.

ALMASARWEH, M., WADI, S. (2018): ARIMA Model in Predicting Banking Stock Market Data. “Modern Applied Science”, Vol. 12, No. 11.

YOON, J. (2020): Forecasting of Real GDP Growth Using Machine Learning Models: Gradient Boosting and Random Forest Approach. “Computational Economics”, volume 57, pp. 247–265.

BIAU, G. (2012): Analysis of a Random Forests Model. “Journal of Machine Learning”, Volume 13, pp. 1063-1095.

KRUKOVETS, D. (2020): Analysis of similarity between artificially simulated time series with Dynamic Time Warping. “Proceedings of Workshop on Intelligent Information Systems WIIS2020”, pp.97-108.

SALVADOR, S., CHAN, P. (2004): FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space. “Intelligent Data Analysis”, 11(5), pp.70-80.

GIORGINO, T. (2009): Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package. “Journal of Statistical Software”, 31(7), pp. 1–24.

DATORRO, J. (2019): Convex optimization and Euclidean distance geometry.

JIE, C., JIYUE, Z., JUNHUI, W., YUSHENG, W., HUIPING, S., KAIYAN, L. (2020): Review on the Research of K-means Clustering Algorithm in Big Data. “2020 IEEE 3rd International Conference on Electronics and Communication Engineering (ICECE)”, Xi'An, China, 107-111.

HYNDMAN, R., KHANDAKAR, Y. (2008): Automatic Time Series Forecasting: The forecast Package for R. “Journal of Statistical Software”, 27(3), 1-22.

MCANDREW, T., WATTANACHIT, N., GIBSON, GC., REICH, NG. (2021): Aggregating predictions from experts: a review of statistical methods, experiments, and applications. Wiley Interdiscip Rev Comput Stat, 13(2), e1514.

Downloads

Published

2023-04-03

How to Cite

Krukovets, D. (2023). Multi-stage approach with DTW and clustering for forecasting of average deposit rate in Ukraine. Bulletin of Taras Shevchenko National University of Kyiv. Physical and Mathematical Sciences, (4), 55–65. https://doi.org/10.17721/1812-5409.2022/4.7

Issue

Section

Computer Science and Informatics