Explaining Rare Events in International Relations
International Organization, Vol. 55, No. 3, pp. 693-715, Summer 2001
23 Pages Posted: 17 Jan 2008
Abstract
Some of the most important phenomena in international conflict are coded s "rare events data," binary dependent variables with dozens to thousands of times fewer events, such as wars, coups, etc., than "nonevents". Unfortunately, rare events data are difficult to explain and predict, a problem that seems to have at least two sources. First, and most importantly, the data collection strategies used in international conflict are grossly inefficient. The fear of collecting data with too few events has led to data collections with huge numbers of observations but relatively few, and poorly measured, explanatory variables. As it turns out, more efficient sampling designs exist for making valid inferences, such as sampling all available events (e.g., wars) and a tiny fraction of non-events (peace). This enables scholars to save as much as 99% of their (non-fixed) data collection costs, or to collect much more meaningful explanatory variables. Second, logistic regression, and other commonly used statistical procedures, can underestimate the probability of rare events. We introduce some corrections that outperform existing methods and change the estimates of absolute and relative risks by as much as some estimated effects reported in the literature. We also provide easy-to-use methods and software that link these two results, enabling both types of corrections to work simultaneously.
Suggested Citation: Suggested Citation
Do you have a job opening that you would like to promote on SSRN?
Recommended Papers
-
Making the Most Of Statistical Analyses: Improving Interpretation and Presentation
By Gary King, Michael Tomz, ...
-
Logistic Regression in Rare Events Data
By Gary King and Langche Zeng
-
A Statistical Model for Multiparty Electoral Data
By Jonathan N. Katz and Gary King
-
Improving Quantitative Studies of International Conflict: A Conjecture
By Nathaniel Beck, Gary King, ...
-
Improving Forecasts of State Failure
By Gary King and Langche Zeng
-
Estimating Risk and Rate Levels, Ratios, and Differences in Case-Control Studies
By Gary King and Langche Zeng
-
By Gary King and Christopher Murray
-
Proper Nouns and Methodological Propriety: Pooling Dyads in International Relations Data
By Gary King