ECONOMETRIC MODELS VERSUS DEEP LEARNING FOR STOCK RETURN FORECASTING: EVIDENCE FROM THE KAZAKHSTAN STOCK EXCHANGE

СРАВНИТЕЛЬНЫЙ АНАЛИЗ ЭКОНОМЕТРИЧЕСКИХ МОДЕЛЕЙ И МЕТОДОВ ГЛУБОКОГО ОБУЧЕНИЯ ДЛЯ ПРОГНОЗИРОВАНИЯ ДОХОДНОСТИ АКЦИЙ: ДАННЫЕ KASE

Kumarova I.

28.04.2026 103

4(145)

10. Информатика, вычислительная техника и управление

Цитировать:

Kumarova I. ECONOMETRIC MODELS VERSUS DEEP LEARNING FOR STOCK RETURN FORECASTING: EVIDENCE FROM THE KAZAKHSTAN STOCK EXCHANGE // Universum: технические науки : электрон. научн. журн. 2026. 4(145). URL: https://7universum.com/ru/tech/archive/item/22532 (дата обращения: 28.05.2026).

Прочитать статью:

DOI - 10.32743/UniTech.2026.145.4.22532

Статья поступила в редакцию: 31.03.2026

Принята к публикации: 14.04.2026

Опубликована: 28.04.2026

АННОТАЦИЯ

Целью исследовательской работы данной диссертации является сравнительная оценка моделей ARIMA, SARIMAX и LSTM для одношагового прогноза доходности семи акций, обращающихся на Казахстанской фондовой бирже (KASE) в период с 2020 по 2025 год. Методология включает walk-forward валидацию с разделением на тестовую и обучающую выборки в пропорциях 80:20 и девять технических индикаторов в качестве экзогенных регрессоров SARIMAX. Результаты показывают, что SARIMAX достигает наименьших RMSE и MAE для шести из семи активов и наивысшую направленческую точность (среднее DA=0,811) для всех семи. Тестом Диболда-Мариано подтверждается статистическое превосходство SARIMAX над LSTM (p<0,001). Выводы: эконометрические модели с экзогенными переменными формируют точные и легко интерпретируемые прогнозы в развивающихся рынках, где предоставлено очень ограниченное количество данных.

ABSTRACT

This research evaluates ARIMA, SARIMAX, and LSTM models for predicting one-step-ahead smoothed log-returns of seven stocks listed on the Kazakhstan Stock Exchange (KASE) over 2020-2025. The methodology employs walk-forward validation with an 80/20 split; SARIMAX model includes nine technical indicators as exogenous regressors. Results show that SARIMAX achieves the lowest RMSE and MAE for six of seven equities and the highest directional accuracy (mean DA = 0.811) for all seven. The Diebold-Mariano test validates the statistical advantage of SARIMAX compared to LSTM (p < 0.001). These results indicate that econometric models enhanced with structured exogenous variables deliver precise and comprehensible forecasts in data-limited emerging markets.

Ключевые слова: прогнозирование фондового рынка, ARIMA, SARIMAX, LSTM, направленческая точность, KASE, развивающиеся рынки.

Keywords: stock market forecasting, ARIMA, SARIMAX, LSTM, directional accuracy, KASE, emerging markets.

Введение

Forecasting equity returns remains one of the most challenging tasks in financial time-series analysis. Stock prices exhibit non-stationarity, volatility clustering, structural breaks, and characteristically low signal-to-noise ratios, all of which render accurate prediction exceptionally difficult [15]. Despite decades of research, no universally superior forecasting model has emerged-particularly for emerging markets, where lower liquidity, thinner order books, and idiosyncratic market microstructure further complicate the prediction problem [5], [12].

ARIMA models constitute the traditional baseline for financial time-series forecasting. However, their purely autoregressive structure constrains them to exploiting only the internal temporal dependence of the target series, potentially missing predictive signals embedded in complementary market indicators. In practice, equity returns are influenced by momentum, trend strength, volatility regimes, and trading activity-factors that can be quantified through well-established technical indicators and incorporated as exogenous regressors within the SARIMAX framework [21], [23].

Concurrently, deep learning architectures-particularly Long Short-Term Memory (LSTM) networks-have attracted substantial attention owing to their capacity to model complex nonlinear temporal dependencies [11], [20]. Nevertheless, the empirical evidence on their superiority over classical statistical models is mixed. Deep learning models typically require large datasets, substantial computational resources, and extensive hyperparameter tuning; in data-scarce environments, they may overfit or fail to generalize [12], [16]. Moreover, the majority of comparative studies focus exclusively on developed market equities, employ standard accuracy metrics without formal statistical testing, and neglect directional accuracy-a metric with direct relevance to trading signal generation.

A critical gap persists in the literature: few studies systematically compare econometric and deep learning models for emerging market equities under a consistent methodological framework that includes walk-forward validation, exogenous feature engineering, and rigorous statistical significance testing. In particular, the Kazakhstan Stock Exchange (KASE) has received virtually no attention in the forecasting competition literature, despite its growing importance as Central Asia’s principal equity market.

The goal of this study is to conduct a comprehensive comparative evaluation of ARIMA, SARIMAX, and LSTM for one-step-ahead prediction of smoothed log-returns for seven major KASE equities over the period 2020-2025. The specific objectives are: (1) to apply a strict walk-forward validation protocol with a time-ordered 80/20 split; (2) to evaluate forecasts using RMSE, MAE, and Directional Accuracy; (3) to test statistical significance via the Diebold-Mariano test. The following hypotheses are tested:

H1: The SARIMAX model achieves statistically significantly lower forecast errors (RMSE, MAE) than the ARIMA and LSTM models.

H2: The SARIMAX model demonstrates higher Directional Accuracy than the ARIMA and LSTM models across all examined equities.

H3: The observed performance differences between SARIMAX and LSTM are statistically significant as confirmed by the Diebold-Mariano test.

Financial time-series forecasting has evolved substantially over the past three decades. The earliest approaches relied on autoregressive stochastic processes-AR, ARMA, and ARIMA models [1], [21], [23]. However, financial time series frequently exhibit nonlinear dynamics, volatility clustering, and structural breaks [9], [15]. Nonlinear extensions have achieved improvements [5], [13], yet remain constrained by rapid regime transitions. Decomposition and hybrid models combining EMD with autoregressive components have outperformed standalone linear models [4], [6-7]. Machine learning algorithms-tree-based ensembles, boosting, and SVMs-have expanded the toolkit [2], [14], [16], though overfitting remains a concern [2], [7], [9], [15]. Deep learning, particularly LSTM networks, has been widely applied [3], [5], [8-10], but imposes substantial data and computational requirements that may not be met in emerging markets [12], [16]. This study addresses these gaps with a rigorous comparison under a unified walk-forward framework, focusing on KASE-an emerging market that has received negligible attention in the literature.

Materials and Methods

This study uses daily historical market data for seven equities listed on KASE: Air Astana (AIRA), Bank TsentrKredit (CCBN), Kaspi.kz (KSPI), Kazakhstanskaya Kompaniya (KEGC), Kazakhtelekom (KZTK), KazMunayGaz (KMGZ), and KazTransOil (KZTO). The sample period spans 2020-2025. Each dataset includes Date, Open, High, Low, Close, and Volume. After cleaning and feature construction, the final dataset comprises 8,090 observations. A time-ordered 80/20 split yields 1,620 out-of-sample test observations [5].

Table 1

Sample of raw KASE price data

Date	Ticker	Open	High	Low	Close	Volume
2024-01-03	AIRA	1104.90	1115.00	1083.75	1111.00	125,410
2020-01-06	CCBN	236.00	236.00	236.00	237.82	310
2020-01-04	KEGC	1650.00	1655.39	1639.00	1640.00	110
2022-09-12	KMGZ	8877.00	9048.00	8657.00	9030.00	261,500
2021-01-09	KSPI	49108.74	49108.74	48830.00	48830.00	20

Log-returns are computed as . The forecasting target is the three-period smoothed log-return Nine technical indicators serve as exogenous features: log_return, SMA_5, SMA_10, EMA_10, volatility_10, RSI_14, MACD, MACD_signal, and Volume_change. Standardization uses a StandardScaler fitted on the training segment only.

Figure 1. Methodology Pipeline

ARIMA serves as the univariate baseline with order (2, 0, 4) selected via AIC grid search. SARIMAX extends ARIMA with the nine technical indicators as exogenous regressors (). The LSTM architecture consists of two stacked layers (64 and 32 units), Dropout (0.2), Adam optimizer (lr = 5×10⁻⁴), Huber loss (δ = 0.5), up to 50 epochs with early stopping (patience = 10), and a rolling window of w = 40 days.

A one-step-ahead walk-forward protocol is adopted: ARIMA and SARIMAX are refit at each step; LSTM is trained once. Performance is measured by RMSE, MAE, and Directional Accuracy (DA). The Diebold-Mariano (DM) test with Newey-West HAC standard errors and Harvey-Leybourne-Newbold correction tests statistical significance of forecast differences.

Results and Discussion

Tables 2 and 3 present out-of-sample RMSE and MAE. SARIMAX achieves the lowest values for six of seven equities; ARIMA ranks first only for CCBN. Mean RMSE: ARIMA 0.0233, SARIMAX 0.0168, LSTM 0.0347. Mean MAE: ARIMA 0.0147, SARIMAX 0.0116, LSTM 0.0252. SARIMAX reduces mean RMSE by 28% relative to ARIMA and 52% relative to LSTM.

Table 2.

Out-of-sample RMSE (one-step-ahead, walk-forward)

Ticker	ARIMA	SARIMAX	LSTM
AIRA	0.0170	0.0085	0.0258
CCBN	0.0521	0.0525	0.0770
KSPI	0.0219	0.0126	0.0285
KEGC	0.0059	0.0030	0.0071
KZTK	0.0258	0.0200	0.0349
KMGZ	0.0324	0.0168	0.0588
KZTO	0.0079	0.0040	0.0108
Mean	0.0233	0.0168	0.0347

Table 3.

Out-of-sample MAE (one-step-ahead, walk-forward)

Ticker	ARIMA	SARIMAX	LSTM
AIRA	0.0125	0.0065	0.0215
CCBN	0.0294	0.0350	0.0574
KSPI	0.0141	0.0094	0.0215
KEGC	0.0041	0.0020	0.0052
KZTK	0.0162	0.0129	0.0242
KMGZ	0.0213	0.0122	0.0393
KZTO	0.0054	0.0031	0.0075
Mean	0.0147	0.0116	0.0252

Название: Figure 2. Actual vs Predicted Values for all seven KASE equities - описание: Figure 2. Actual vs Predicted Values for all seven KASE equities

Figure 2. Actual vs Predicted Values for all seven KASE equities

Название: Figure 3. Model Performance Metrics: RMSE, MAE, and Directional Accuracy - описание: Figure 3. Model Performance Metrics: RMSE, MAE, and Directional Accuracy

Figure 3. Model Performance Metrics: RMSE, MAE, and Directional Accuracy

Table 4 presents Directional Accuracy. SARIMAX attains the highest DA for all seven equities (0.774-0.846). Mean DA: ARIMA 0.694, SARIMAX 0.811, LSTM 0.565. The SARIMAX mean DA of 0.811 indicates ~81% of next-day return signs are correctly predicted.

Table 4.

Directional Accuracy (DA)

Ticker	ARIMA	SARIMAX	LSTM
AIRA	0.703	0.846	0.505
CCBN	0.753	0.774	0.630
KSPI	0.657	0.819	0.565
KEGC	0.661	0.842	0.562
KZTK	0.628	0.814	0.562
KMGZ	0.736	0.804	0.595
KZTO	0.722	0.780	0.533

Название: Figure 4. RMSE Heatmap by Ticker and Model - описание: Figure 4. RMSE Heatmap by Ticker and Model

Figure 4. RMSE Heatmap by Ticker and Model

Название: Figure 5. Directional Accuracy Heatmap by Ticker and Model - описание: Figure 5. Directional Accuracy Heatmap by Ticker and Model

Figure 5. Directional Accuracy Heatmap by Ticker and Model

The Diebold-Mariano test (Table 5) rejects equal predictive accuracy for all seven equities (p < 0.001). All DM statistics are negative (-9.679 to -5.167), confirming SARIMAX incurs lower squared-error loss than LSTM.

Table 5.

Diebold-Mariano test: SARIMAX vs. LSTM

Ticker	DM Statistic	p-value
AIRA	-7.230	< 0.001
CCBN	-5.180	< 0.001
KSPI	-7.597	< 0.001
KEGC	-9.679	< 0.001
KZTK	-5.167	< 0.001
KMGZ	-6.704	< 0.001
KZTO	-7.222	< 0.001

SARIMAX surprasses ARIMA by using exogenous regressors to capture trend, momentum, volatility, and volume dynamics. The only exception is CCBN (SARIMAX RMSE 0.0525 vs. ARIMA 0.0521), which is probably due to extremely low liquidity. SARIMAX’s DA dominance (mean 0.811 vs. 0.565 for LSTM) indicates robust directional forecasting skill, while LSTM’s DA slightly exceeds the random baseline of 0.50.

Several factors explain LSTM’s underperformance: (1) limited data (~1,500 observations per equity); (2) non-stationarity and regime shifts including the COVID-19 disruption; (3) asymmetric retraining-econometric models are refit at each step while LSTM is trained once; (4) low signal-to-noise ratio in daily returns. These findings are consistent with the broader literature suggesting model complexity should match data volume [2], [8].

Limitations include: DM test applied only with squared-error loss; asymmetric retraining protocol favoring SARIMAX; absence of strict close-to-close temporal alignment; non-exhaustive LSTM hyperparameter search; and exclusion of macroeconomic variables and transaction costs.

Conclusion

This study demonstrates that SARIMAX achieves the best out-of-sample accuracy for six of seven KASE equities (H1), the highest directional accuracy for all seven (H2, mean DA = 0.811), and statistically significant superiority over LSTM confirmed by the Diebold-Mariano test for all equities at p < 0.001 (H3). Mean RMSE is reduced by 28% relative to ARIMA and 52% relative to LSTM.

The inclusion of structured exogenous regressors provides substantial incremental predictive value beyond univariate autoregressive models. From a practical standpoint, parsimonious econometric models augmented with technical indicators offer a viable, interpretable, and computationally efficient alternative to deep learning in data-constrained emerging markets.

Future research should implement expanding-window retraining for LSTM, systematic Bayesian hyperparameter optimization, strict close-to-close temporal alignment, macroeconomic and commodity feature expansion, and formal trading simulation with transaction costs and risk-adjusted returns.

References:

Adebiyi, A.A. Stock price prediction using the ARIMA model / A.A. Adebiyi, A.O. Adewumi, C.K. Ayo // Proceedings UKSim 2014. - 2014. - P. 106-112.
Akhtar, M.M. Stock market prediction based on statistical data using machine learning algorithms / M.M. Akhtar, A.S. Zamani, S. Khan [et al.] // Journal of King Saud University - Science. - 2022. - Vol. 34.
Bustos, O. Machine learning, stock market forecasting, and market efficiency: a comparative study / O. Bustos, A. Pomares-Quimbaya, R. Stellian // International Journal of Data Science and Analytics. - 2025. - Vol. 20. - P. 6815-6839.
Casolaro, A. Deep Learning for Time Series Forecasting: Advances and Open Problems / A. Casolaro, V. Capone, G. Iannuzzo, F. Camastra // Information. - 2023. - Vol. 14.
Chatzis, S.P. Forecasting stock market crisis events using deep and statistical machine learning techniques / S.P. Chatzis, V. Siakoulis, A. Petropoulos [et al.] // Expert Systems with Applications. - 2018. - Vol. 112. - P. 353-371.
Çeli̇k, T.B. Extending machine learning prediction capabilities by explainable AI in financial time series prediction / T.B. Çeli̇k, Ö. İcan, E. Bulut // Applied Soft Computing. - 2023. - Vol. 132.
Cheng, D. Financial time series forecasting with multi-modality graph neural network / D. Cheng, F. Yang, S. Xiang, J. Liu // Pattern Recognition. - 2022. - Vol. 121.
Fathali, Z. Stock Market Prediction of NIFTY 50 Index Applying Machine Learning Techniques / Z. Fathali, Z. Kodia, L. Ben Said // Applied Artificial Intelligence. - 2022. - Vol. 36.
Ghosh, A. Stock price prediction using LSTM on Indian share market / A. Ghosh, S.B.G.M.N.D.S. Sen // Int. Conf. Comput. Appl. Ind. Eng. - 2019. - Vol. 63. - P. 101-110.
González-Núñez, E. Expanding a machine learning class towards its application to the stock market forecast / E. González-Núñez, L.A. Trejo, M. Kampouridis // Applied Intelligence. - 2025. - Vol. 55.
Gumelar, A.B. Boosting the Accuracy of Stock Market Prediction using XGBoost and Long Short-Term Memory / A.B. Gumelar, S. Nilowardono, D.P. Adi, H. Setyorini // Proc. iSemantic 2020. - 2020.
Henrique, B.M. Machine learning techniques applied to financial market prediction. Literature review / B.M. Henrique, V.A. Sobreiro, H. Kimura // Expert Systems with Applications. - 2019. - Vol. 124. - P. 226-251.
Hsu, T.-Y. Machine learning applied to stock index performance enhancement / T.-Y. Hsu // Journal of Banking and Financial Technology. - 2021. - Vol. 5. - P. 21-33.
Huang, J. Deep learning in finance and banking: A literature review and classification / J. Huang, J. Chai, S. Cho // Frontiers of Business Research in China. - 2020. - Vol. 14.
Lu, W. A CNN-LSTM-based model to forecast stock prices / W. Lu, J. Li, Y. Li, A. Sun, J. Wang // Complexity. - 2020.
Mukherjee, S. Stock market prediction using deep learning algorithms / S. Mukherjee, B. Sadhukhan, N. Sarkar [et al.] // CAAI Transactions on Intelligence Technology. - 2023. - Vol. 8. - P. 82-94.
Nazareth, N. Financial applications of machine learning: A literature review / N. Nazareth, Y.V. Ramana Reddy // Expert Systems with Applications. - 2023. - Vol. 219.
Phuoc, T. Applying machine learning algorithms to predict the stock price trend in the stock market / T. Phuoc, P.T.K. Anh, P.H. Tam, C.V. Nguyen // Humanities and Social Sciences Communications. - 2024. - Vol. 11.
Rodriguez, F.S. A machine learning approach to predict the S&P 500 absolute percent change / F.S. Rodriguez, P. Norouzzadeh, Z. Anwar [et al.] // Discover Artificial Intelligence. - 2024. - Vol. 4.
Sarika, V. An LSTM-Based Model for Stock Price Prediction / V. Sarika, G.V.S. Kamal, S.V. Pratham [et al.] // 2023 AICERA/ICIS. - 2023.
Taylor, M.P. The use of technical analysis in the foreign exchange market / M.P. Taylor, H. Allen // Journal of International Money and Finance. - 1992. - Vol. 11. - P. 304-314.
Wu, J.M.T. A graph-based CNN-LSTM stock price prediction algorithm with leading indicators / J.M.T. Wu, Z. Li, N. Herencsar [et al.] // Multimedia Systems. - 2021. - Vol. 29. - P. 1751-1770.
Zhong, X. Forecasting daily stock market return using dimensionality reduction / X. Zhong, D. Enke // Expert Systems with Applications. - 2017. - Vol. 67. - P. 126-139.