Public User

# Some Important Statistical Issues Courts Should Consider in Their Assessment of Statistical Analyses Submitted in Class Certification Motions: Implications for Dukes v. Wal-Mart

44 Pages Posted: 20 Mar 2011

## Joseph L. Gastwirth

George Washington University - Columbian College of Arts and Sciences

## Efstathia Bura

affiliation not provided to SSRN

## Weiwen Miao

Haverford College - Math Department

Date Written: March 16, 2011

### Abstract

When many individual plaintiffs have similar claims against the same defendant, often it is more efficient for them to be combined into a single class action. Due to their increased complexity and larger stakes, in the United States there are special criteria a party seeking to proceed as a class action needs to satisfy. Statistical evidence is often submitted to establish that the members of the proposed class were affected by a common event or policy. In equal employment cases involving an employer with a number of locations or sub-units, defendants may argue that the data should be examined separately for each unit while plaintiffs may pool the data into one or several large samples or focus on a few units in which statistical significance was observed. After describing the statistical issues involved, it will be seen that requiring plaintiffs to demonstrate a statistically significant disparity in a pre-set fraction, e.g. majority of the sub-units is too stringent as the power of the statistical test to detect a meaningful disparity in most sub-units is too small. On the other hand, when many statistical tests are calculated on data from a fair system, a small percentage of significant disparities will be obtained. Thus, allowing a class action to proceed if the plaintiffs can demonstrate a statistically significant difference in a few sub-units is too lax. The use of established methods for combining statistical tests for data organized by appropriate subgroups will be illustrated on data from two recent cases. Using the concept of power, the expected number, E, of sub-units in which a statistically significant result would occur if there were a legally meaningful disparity can be determined. Then the observed number, O, of units with a significant disparity can be compared to E, to see whether data are consistent with a pattern, O close to E, indicating unfairness or O clearly less than E, reflecting fairness. Without such a comparison, the number of units with a statistically significant disparity is not meaningful. Both parties in Dukes v. Wal-Mart introduced summaries of the p-values of many individual statistical tests that grouped them into a small number of categories. An appropriate overall procedure combines them into a single summary statistic. This analysis shows that the promotion data for the 40 or 41 regions in the Wal-Mart case are consistent with an overall system in which the odds an eligible female had of being promoted were about 70 to 80 percent of those of a male. A similar analysis of the p-values of Wal-Martâ€™s sub-unit regressions also is consistent with a general pattern of a degree of underpayment of female employees relative to that of similarly qualified males.

Keywords: class action, class certification, combination methods, discrimination, Dukes v. Wal-mart, equal employment, power of statistical tests, statistical proof, stratified data, trend tests

JEL Classification: C12, C13, J71, K31

Suggested Citation

Gastwirth, Joseph L. and Bura, Efstathia and Miao, Weiwen, Some Important Statistical Issues Courts Should Consider in Their Assessment of Statistical Analyses Submitted in Class Certification Motions: Implications for Dukes v. Wal-Mart (March 16, 2011). Available at SSRN: https://ssrn.com/abstract=1788331 or http://dx.doi.org/10.2139/ssrn.1788331