1932

Abstract

We survey the weak instrumental variables (IV) literature with the aim of giving simple advice to applied researchers. This literature focuses heavily on the problem of size inflation in two-stage least squares (2SLS) two-tailed -tests that arises if instruments are weak. A common standard for acceptable instrument strength is a first-stage of 10, which renders this size inflation modest. However, 2SLS suffers from other important problems that exist at much higher levels of instrument strength. In particular, 2SLS standard errors tend to be artificially small in samples where the 2SLS estimate is close to ordinary least squares (OLS). This power asymmetry means the -test has inflated power to detect false positive effects when the OLS bias is positive. The Anderson-Rubin (AR) test avoids this problem and should be used in lieu of the -test even with strong instruments. We illustrate the practical importance of this issue in IV papers published in the from 2011 to 2023. Use of the AR test often reverses -test results. In particular, IV estimates that are close to OLS and significant according to the -test are often insignificant according to AR. We also show that for first-stage in the 10–20 range there is a high probability that OLS estimates will be closer to the truth than 2SLS. Hence we advocate a higher standard of instrument strength in applied work.

Loading

Article metrics loading...

/content/journals/10.1146/annurev-economics-092123-111021
2024-04-26
2024-05-12
Loading full text...

Full text loading...

/content/journals/10.1146/annurev-economics-092123-111021
Loading

Supplemental Material

Supplemental Material

  • Article Type: Review Article
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error