Articles producció científicaGestió d'Empreses

Data vs. information: Using clustering techniques to enhance stock returns forecasting

  • Identification data

    Identifier:  imarina:9295899
    Authors:  Saenz, JV; Quiroga, FM; Bariviera, AF
    Abstract:
    This paper explores the use of clustering models of stocks to improve both (a) the prediction of stock prices and (b) the returns of trading algorithms. We cluster stocks using k-means and several alternative distance metrics, using as features quarterly financial ratios, prices and daily returns. Then, for each cluster, we train ARIMA and LSTM forecasting models to predict the daily price of each stock in the cluster. Finally, we employ the clustering-empowered forecasting models to analyze the returns of different trading algorithms. We obtain three key results: (i) LSTM models outperform ARIMA and benchmark models, obtaining positive investment returns in several scenarios; (ii) forecasting is improved by using the additional information provided by the clustering methods, therefore selecting relevant data is an important preprocessing task in the forecasting process; (iii) using information from the whole sample of stocks deteriorates the forecasting ability of LSTM models. These results have been validated using data of 240 companies of the Russell 3000 index spanning 2017 to 2022, training and testing with different subperiods.
  • Others:

    APA: Saenz, JV; Quiroga, FM; Bariviera, AF (2023). Data vs. information: Using clustering techniques to enhance stock returns forecasting. International Review of Financial Analysis, 88(), 102657-. DOI: 10.1016/j.irfa.2023.102657
    Paper original source: International Review of Financial Analysis. 88 102657-
    Article's DOI: 10.1016/j.irfa.2023.102657
    Journal publication year: 2023-07-01
    Entity: Universitat Rovira i Virgili
    Paper version: info:eu-repo/semantics/publishedVersion
    Record's date: 2026-05-02
    URV's Author/s: Fernández Bariviera, Aurelio
    Department: Gestió d'Empreses
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Publication Type: Journal Publications
    Author, as appears in the article.: Saenz, JV; Quiroga, FM; Bariviera, AF
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Thematic Areas: Matemática / probabilidade e estatística, Interdisciplinar, Finance, Engenharias iv, Engenharias iii, Economics and econometrics, Economia, Direito, Ciencias sociales, Business, finance, Administração, ciências contábeis e turismo, Administração pública e de empresas, ciências contábeis e turismo
    Author's mail: aurelio.fernandez@urv.cat, aurelio.fernandez@urv.cat
  • Keywords:

    Trading
    Stock price forecast
    Investment algorithms
    Industry
    innovation and infrastructure
    Financial reports
    Deep learning
    Clustering
    Business
    Finance
    Economics and Econometrics
    Matemática / probabilidade e estatística
    Interdisciplinar
    Engenharias iv
    Engenharias iii
    Economia
    Direito
    Ciencias sociales
    Administração
    ciências contábeis e turismo
    Administração pública e de empresas
  • Documents:

  • Cerca a google

    Search to google scholar