An Automation Algorithm for Harvesting Capital Market Information from the Web

Managerial Finance, Vol. 35 No. 5, 2009

12 Pages Posted: 22 Jun 2015 Last revised: 9 Aug 2015

Date Written: March 21, 2009

Abstract

The purpose of this paper is to develop an algorithm to harvest user specified information on finance portals and compile it into machine‐readable datasets for quantitative analysis. The Visual Basic macro language in Microsoft Excel is applied to develop code that is not constrained by the single‐query function of Excel. The core of the algorithm is built around the splitting of the URL connector line and the placement of a continuously updating variable into which are looped as many tickers as there are in the input list. The output is then written to non‐overlapping cells. Numerical information placed on major finance websites can be harvested into structured machine‐readable datasets by applying this algorithm. This has been implemented in the Returnfinder App, which produces Total Return charts that include dividends.The algorithm extends user accessibility to websites that do not provide the facility of simultaneous downloading of information on multiple stock tickers. Furthermore, the procedure automates the downloading of multiple pieces of information (fields) and entire tables per ticker (record).

Keywords: web retrieval, harvesting algorithm program, download stock market data from the web, total return charts, total return dividend charts, capital market information

JEL Classification: G, C, Z

Suggested Citation

Agrrawal, Pankaj, An Automation Algorithm for Harvesting Capital Market Information from the Web (March 21, 2009). Managerial Finance, Vol. 35 No. 5, 2009, Available at SSRN: https://ssrn.com/abstract=2621156

Pankaj Agrrawal (Contact Author)

University of Maine ( email )

Orono, ME 04469
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
121
Abstract Views
1,483
Rank
433,736
PlumX Metrics