Instrumental Variables Calculate LATE by Hand
Use this premium Wald estimator calculator to compute the Local Average Treatment Effect, also called LATE, from instrument level treatment rates and average outcomes. Enter the observed means for the encouraged group and the non-encouraged group, then compare the reduced form, first stage, and implied IV estimate instantly.
LATE Calculator
This tool implements the basic hand calculation for a binary instrument: LATE = difference in outcomes by instrument / difference in treatment take-up by instrument.
How to calculate LATE by hand using instrumental variables
When researchers talk about instrumental variables, they are often trying to estimate a causal effect in settings where treatment is not randomly assigned. People self select, physicians choose different treatment plans, schools admit students using multiple criteria, and firms adopt policies at different times. In these situations, a simple comparison between treated and untreated groups can be biased. Instrumental variables, often shortened to IV, offer a way to recover a causal effect if you have a variable that shifts treatment exposure without directly changing the outcome except through treatment.
The most common hand calculation in a binary instrument setting is the Wald estimator. This estimator is especially useful for teaching, checking software output, and understanding what your IV model is really doing. If the instrument takes values 0 and 1, the Local Average Treatment Effect can be computed as:
Here, Y is the outcome, D is the treatment indicator, and Z is the instrument. The numerator is the effect of the instrument on the outcome. The denominator is the effect of the instrument on treatment take up. Their ratio is the IV estimate. In the standard binary instrument case, this ratio is interpreted as the causal effect of treatment for compliers, meaning people whose treatment status changes when the instrument changes.
What LATE means in plain language
LATE does not usually estimate the treatment effect for everyone in the population. Instead, it estimates the effect for the subgroup that actually responds to the instrument. Suppose the instrument is an encouragement letter and the treatment is program participation. Some people would participate no matter what, some would never participate, and some participate only if encouraged. LATE targets that third group. This is one reason instrumental variables can be so powerful and also why interpretation matters.
- Always takers: treated regardless of the instrument.
- Never takers: untreated regardless of the instrument.
- Compliers: treatment status changes with the instrument.
- Defiers: treatment moves opposite the instrument. Standard IV generally rules these out through monotonicity.
The four assumptions behind the hand calculation
Before you trust a hand calculated LATE, you need to understand the assumptions. These are not optional details. They are the reason the ratio has a causal interpretation rather than being just a descriptive statistic.
- Relevance: the instrument must change treatment take up. In practical terms, the denominator cannot be zero, and it should be meaningfully different from zero. Weak instruments produce unstable estimates.
- Independence: the instrument must be as good as randomly assigned, or at least independent of unobserved determinants of the outcome after conditioning on controls in more advanced settings.
- Exclusion restriction: the instrument affects the outcome only through treatment. If the instrument changes the outcome directly, the ratio is biased.
- Monotonicity: there are no defiers. The instrument should not make some people less likely to take treatment while making others more likely in a way that violates the standard compliance framework.
If these assumptions hold, the Wald estimator has a clean causal interpretation. If one or more fail, the number can still be computed, but it should not be interpreted as a valid causal effect.
Step by step: calculate LATE by hand
Imagine you have two groups defined by a binary instrument. Group Z = 1 is encouraged, and group Z = 0 is not encouraged. For each group, you observe the average outcome and the treatment rate.
- Find the mean outcome in the Z = 1 group.
- Find the mean outcome in the Z = 0 group.
- Subtract them to get the reduced form.
- Find the treatment rate in the Z = 1 group.
- Find the treatment rate in the Z = 0 group.
- Subtract them to get the first stage.
- Divide the reduced form by the first stage.
Using the calculator defaults above:
- Outcome mean when Z = 1: 72
- Outcome mean when Z = 0: 68
- Treatment rate when Z = 1: 0.60
- Treatment rate when Z = 0: 0.20
The reduced form is 72 – 68 = 4. The first stage is 0.60 – 0.20 = 0.40. Therefore:
This means the estimated causal effect of treatment for compliers is 10 outcome units. If Y were annual test score points, that would mean treatment raises scores by 10 points for those whose treatment status is changed by the instrument.
Why the first stage matters so much
The denominator is not just a scaling term. It tells you how much the instrument actually moves treatment. When the first stage is tiny, the instrument is weak. In those cases, the Wald ratio can become very large in absolute value simply because you are dividing by a small number. This makes hand calculations especially useful because they force you to inspect the underlying ingredients before relying on a polished regression output.
| Scenario | Reduced Form | First Stage | Implied LATE | Interpretation |
|---|---|---|---|---|
| Strong instrument example | 0.050 | 0.250 | 0.200 | A 20 percentage point causal effect for compliers if assumptions hold. |
| Moderate instrument example | 0.030 | 0.100 | 0.300 | Still interpretable, but more sensitive to sampling noise. |
| Weak instrument example | 0.010 | 0.020 | 0.500 | Large ratio produced by a tiny first stage. Proceed carefully. |
Real world intuition with common empirical settings
Instrumental variables appear across economics, epidemiology, education, and public policy. A classic setup uses random encouragements or eligibility rules as instruments. In health research, physician prescribing preference or distance to a provider can sometimes serve as instruments, though the exclusion restriction must be defended carefully. In labor economics, quarter of birth and draft eligibility have historically been studied as instruments because they shift exposure to schooling or military service.
The idea is always the same: use a source of variation that changes treatment receipt while remaining plausibly unrelated to unobserved outcome determinants. That gives you a clean first stage. Then compare outcomes across instrument levels to get the reduced form. The ratio of those two pieces is the hand calculated IV estimate.
Binary outcomes and percentage point interpretation
If your outcome is binary, such as employment, hospitalization, graduation, or mortality, then the same formula applies. In that case the outcome means are proportions rather than continuous averages. The reduced form becomes a difference in outcome rates across instrument groups. Dividing by the difference in treatment rates gives a LATE that is often interpreted in percentage points.
For example, if mortality is 0.12 in the encouraged group and 0.15 in the non-encouraged group, the reduced form is -0.03. If treatment take up rises from 0.30 to 0.55, then the first stage is 0.25. The implied LATE is -0.12. That means treatment lowers mortality by 12 percentage points for compliers, under the standard IV assumptions.
| Empirical quantity | Notation | Example value | How to read it |
|---|---|---|---|
| Outcome mean under Z = 1 | E(Y|Z=1) | 0.12 | 12 percent mortality in the instrument = 1 group |
| Outcome mean under Z = 0 | E(Y|Z=0) | 0.15 | 15 percent mortality in the instrument = 0 group |
| Treatment rate under Z = 1 | E(D|Z=1) | 0.55 | 55 percent treated in the instrument = 1 group |
| Treatment rate under Z = 0 | E(D|Z=0) | 0.30 | 30 percent treated in the instrument = 0 group |
| Wald estimate | RF / FS | -0.12 | Treatment reduces mortality by 12 percentage points for compliers |
How this relates to two stage least squares
In a simple binary instrument and binary treatment setting without covariates, the hand calculated Wald estimator and the 2SLS coefficient are the same number. Two stage least squares becomes more useful when you have controls, multiple instruments, continuous treatments, or clustered standard errors. But conceptually, 2SLS is building on the same logic: isolate the part of treatment variation generated by the instrument, then use that variation to estimate the causal effect on the outcome.
If you ever run a software command and want to know whether the coefficient makes sense, compute the Wald ratio by hand from group means. It is one of the fastest ways to verify that your setup is behaving as expected.
Common mistakes when calculating IV LATE manually
- Using percentages like 60 instead of proportions like 0.60 for treatment rates, then forgetting to keep units consistent.
- Mixing adjusted and unadjusted means. A hand calculation should usually use a consistent set of simple group averages unless you are explicitly deriving a covariate adjusted estimator.
- Ignoring the sign of the first stage. If the instrument lowers treatment, the denominator may be negative, which changes interpretation.
- Claiming the estimate applies to everyone. LATE is local to compliers, not automatically the average treatment effect for the full population.
- Overlooking weak instruments. A tiny denominator can make the estimate unstable.
Interpreting magnitude and external validity
A large IV estimate is not automatically implausible, and a small one is not automatically weak. The key question is whose effect you are estimating. If compliers are especially responsive to treatment, their treatment effect may differ materially from the average effect in the overall sample. This is a feature of IV, not a bug, but it means you should always describe the instrument, the compliance margin, and the likely complier population.
For policy work, external validity often hinges on whether future target populations resemble the complier group induced by the instrument in your study. An admission cutoff, a transportation shock, and a physician preference instrument each define different complier groups. The hand calculation gives you the point estimate, but interpretation requires substantive knowledge of the setting.
When a hand calculation is enough and when it is not
The by hand Wald ratio is enough for intuition, quick validation, and simple binary instrument examples. It is not enough if you need standard errors, confidence intervals, covariate adjustment, multiple instruments, overidentification tests, weak instrument diagnostics, or clustered inference. In those cases, formal econometric software is essential. Still, even advanced users benefit from checking the raw means first. A clean hand calculation often reveals problems that would otherwise remain hidden inside a regression table.
Authoritative resources for deeper study
If you want a more formal treatment of instrumental variables and causal inference, these sources are excellent starting points:
- Harvard University causal inference materials
- Boston University School of Public Health IV methods guide
- National Institutes of Health discussion of causal inference methods
Final takeaway
To calculate LATE by hand, you only need four numbers in the simplest binary instrument design: the outcome mean for Z = 1 and Z = 0, plus the treatment rate for Z = 1 and Z = 0. Subtract outcomes to get the reduced form, subtract treatment rates to get the first stage, and divide. That ratio is the Wald estimator. It is one of the most important formulas in applied causal inference because it turns a difficult selection problem into a transparent comparison built on instrument induced variation. The calculator above helps you perform that computation instantly, but the real value is understanding what each component means and why the assumptions matter.