Why read this?
Questions of attribution are everywhere: i.e., did
One alternative to solve the problem of attribution is to reason in the following manner: if there is no possible alternative causal process, which does not involve
In the above alternative, however, the reasoning is tailored to a specific event under consideration. What if we are interested in studying a general tendency of a given effect? In this case, we are asking how sufficient is a cause,
In this blogpost, we will give counterfactual interpretations to both probabilities:
Finally, we will work through some examples to put what we have learnt into practice. This blogpost follows the notation of Pearl’s Causality, Chapter 9 and Pearl’s Causal Inference in Statistics: A primer.
Counterfactual definitions
Let
Probability of Necessity
The Probability of Necessity (PN) stands for the probability that the event
To gain some intuition, imagine Ms. Jones: a former cancer patient that underwent both a lumpectomy and irradiation. She speculates: do I owe my life to irradiation? We can study this question by figuring out how necessary was the lumpectomy for the remission to occur:
If
Probability of Sufficiency
On the other hand:
The Probability of Sufficiency (PS) measures the capacity of
to produce , and, since “production” implies a transition from absence to presence, we condition on situations where x and y are absent
Therefore:
The following example may clarify things. Imagine that, contrary to Ms. Jones above, Mrs. Smith had a lumpectomy alone and her tumor recurred. She speculates on her decision and concludes: I should have gone through irradiation. Is this regret warranted? We can quantify this by speaking about
Combining both probabilities
We can compute the probability that the cause is necessary and sufficient thus:
That is, the contribution of
Idenitfiability
In the general case, when we have a causal diagram and observed (and experimental) data, neither
In practice, if we don’t know the functional relationship, we must at least assume monotonicity of
What is Monotonicity?
Let
That is, exposure to treatment
Identifying the probabilities of causation
If we are willing to assume that Y is monotonic relative to X, then both
Moreover, if monotonicity does not hold, the above expression becomes a lower bound for
Equivalently,
Which becomes the lower bound if we are not willing to assume monotonicity:
Let’s use the estimators and the bounds in the following example.
A first example
The experimental data provide the estimates [A lawsuit is filed against the manufacturer of a drug X that was supposed to relieve back-pain. Was the drug a necessary cause for the the death of Mr. A?
]
Therefore, assuming that the drug could only cause (but never prevent death), monotonicity holds:
The plaintiff was correct; barring sampling errors, the data provide us with 100% assurance that drug x was in fact responsible for the death of Mr A.
A second example
Remember Ms. Jones? Is she right in attributing her recovery to the irradiation therapy. Suppose she gets her hands on the following data:
We can therefore start to bound
We don’t have data for
Therefore, irradiation was more likely than not necessary for her remission.
Conclusion
In this blogpost, we saw how we can analyze attribution problems by giving counterfactual interpretations to the probability that a cause is necessary and/or sufficient. These quantities turn out be generally not identifiable because they are sensitive to the specific functional relationships that connect
However, we can give theoretically sharp bounds for them by combining experimental and observational data. If we are willing to assume monotonicity, the bounds collapse to give a point estimate for both probabilities of causation,