Index Arbitrage and Refresh Time Bias in Covariance Estimation

16 Pages Posted: 15 Jan 2012 Last revised: 23 Jan 2012

Date Written: January 14, 2011


Estimating covariance matrices using high-frequency data is crucial for market makers, investors in newly-issued securities, and risk managers. These estimations often handle the asynchrony of high-frequency trades by using returns for periods between when all instruments have traded (refresh times). We show that index arbitrage trading biases estimates of variances and covariances. The mean reversion of the index arbitrage spread adds a second data generating process which biases variance estimates. That second process creates refresh times simultaneous with trading-induced comovement of index members which bias covariance estimates. Initial results show there is a bias, that removing likely index arbitrage trades yields a lower estimate of covariances, and that estimators may converge sooner using such cleaned data. Our results suggest overestimates of variances and covariances of about 2%-3% -- equivalent to expected returns of 3%-6% higher and implying overly diversified portfolios.

Keywords: high-frequency volatility estimation, refresh times, bias, data cleaning

JEL Classification: C32, C31, C83

Suggested Citation

Rosenthal, Dale W. R. and Zhang, Jin, Index Arbitrage and Refresh Time Bias in Covariance Estimation (January 14, 2011). Available at SSRN: or

Dale W. R. Rosenthal (Contact Author)

Q36 LLC ( email )

332 S. Michigan Ave.
Suite 900
Chicago, IL 60604
United States


Jin Zhang

Bank of America ( email )

NEW YORK, NY 10281
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Abstract Views
PlumX Metrics