Application of Unsupervised Feature Selection, Machine Learning and Evolutionary Algorithm in Predicting Stock Returns: A Study of Indian Firms

The IUP Journal of Financial Risk Management, Vol. XIII, No. 3, pp. 20-46, September 2016

Posted: 12 May 2017

See all articles by Tamal Chaudhuri

Tamal Chaudhuri

Calcutta Business School

Indranil Ghosh

Calcutta Business School

Shahira Eram

Calcutta Business School

Date Written: September 2016

Abstract

Prediction of stock prices has become an important area of research in the field of financial analytics and has garnered a lot of attention among academicians. Drawing on the literature on application of econometric tools and also machine learning techniques, this paper presents a framework for predicting stock returns using three unsupervised feature selection techniques, four predictive modeling techniques and finally an ensemble combining the four predictive modeling techniques. To design the ensemble, evolutionary algorithm is applied. In order to assess the results of our study, four different performance measures, namely, Mean Absolute Error (MAE), Mean Squared Error (MSE), Nash-Sutcliffe Efficiency (NSE) and Index of Agreement (IA) have been utilized. Our feature selection results indicate that all explanatory variables are not significant for different classes of companies and also for different time periods. This gives us insight into the fact that, for stock returns prediction, one has to be careful of the predictors to be chosen. Further, results indicate that for all the forecasting methods, namely, random forest, bagging, boosting and support vector regression, forecasting efficiency for large cap and mid-cap firms was better than that of small cap firms. Statistical analysis through Analysis of Variance (ANOVA) suggests that of all four predictive modeling techniques, boosting was the most efficient technique for forecasting the stock returns. We then proceeded to construct an ensemble of the above four methods. In terms of all four measurement metrics, performance of the proposed ensemble was better in both training and testing phase as compared to the efficiency of the individual predictive modeling techniques.

Suggested Citation

Chaudhuri, Tamal and Ghosh, Indranil and Eram, Shahira, Application of Unsupervised Feature Selection, Machine Learning and Evolutionary Algorithm in Predicting Stock Returns: A Study of Indian Firms (September 2016). The IUP Journal of Financial Risk Management, Vol. XIII, No. 3, pp. 20-46, September 2016, Available at SSRN: https://ssrn.com/abstract=2966591

Tamal Chaudhuri

Calcutta Business School ( email )

Bishnupur
South 24 Parganas
Kolkata, West Bengal 743503
India

Indranil Ghosh (Contact Author)

Calcutta Business School ( email )

Bishnupur
South 24 Parganas
Kolkata, West Bengal 743503
India

Shahira Eram

Calcutta Business School ( email )

Bishnupur
South 24 Parganas
Kolkata, West Bengal 743503
India

Do you have negative results from your research you’d like to share?

Paper statistics

Abstract Views
367
PlumX Metrics