graph LR
A(Outcomes &<br>Events) --> B(Probability) --> C(Random<br>Variables) --> D(Probability<br>Distributions)
POP88162 Introduction to Quantitative Research Methods
Department of Political Science, Trinity College Dublin
graph LR
A(Outcomes &<br>Events) --> B(Probability) --> C(Random<br>Variables) --> D(Probability<br>Distributions)
flowchart LR
A((Population))
B((Sample))
A-- Probability -->B
B-- Inference -->A
Imagine a world in which political candidates are selected by lot (sortition).
Three Parties:
Candidates can be of two genders:
Six possible candidates:
Hypothetical trial: roll a 🎲 to pick a candidate.
Uncertainty: we don’t know which candidate will be selected.
One possible outcome:
picking 👩🏽🌾 (♀️🌳)
Sample space \(S\):
{👩🏻🔧, 👨🏿⚕️, 🧑🏾⚖️, 🧑🏻💼, 👩🏽🌾, 👨🏿🎨}
An event \(A\):
selecting a ♀️
Any of these outcomes would make us say that an event \(A\) has occurred:
{👩🏻🔧, 🧑🏾⚖️, 👩🏽🌾}
\[P(A) = \frac{\text{Number of elements in A}}{\text{Number of elements in }S}\]
\[P(A) = \frac{\text{Number of elements in A}}{\text{Number of elements in }S}\]
is actually rather naive.
How do we map the possible outcomes of sortition to numbers in our data?
Using random variables.
Consider the sample space:
{👩🏻🔧, 👨🏿⚕️, 🧑🏾⚖️, 🧑🏻💼, 👩🏽🌾, 👨🏿🎨}
Let \(Y\) be the selection of a ♀️ candidate:
Y(👩🏻🔧) = Y(🧑🏾⚖️) = Y(👩🏽🌾) = 1
Y(👨🏿⚕️) = Y(🧑🏻💼) = Y(👨🏿🎨) = 0
These 0’s and 1’s are what we actually see in our data.
In other words, random variable \(Y\) provides the numerical summary of the candidate draw with our question (selection of a ♀️ candidate) in mind.
The source of randomness is that we don’t know which candidate will be selected.
Imagine that instead of being interested of the selection of a ♀️ candidate we are interested in selection of a ️🌳 candidate.
We have the same sample space: {👩🏻🔧, 👨🏿⚕️, 🧑🏾⚖️, 🧑🏻💼, 👩🏽🌾, 👨🏿🎨}
But another random variable \(X\) maps the same outcomes differently than \(Y\):
X(👩🏻🔧) = X(🧑🏾⚖️) = X(👨🏿⚕️) = X(🧑🏻💼) = 0
X(👩🏽🌾) = X(👨🏿🎨) = 1
Alternatively, rather than focussing on ️🌳, we may choose the variable \(X\) to map the selection of a candidate from any party such that:
X(👩🏻🔧) = X(👨🏿⚕️) = 1 for 🛠️
X(🧑🏾⚖️) = X(🧑🏻💼) = 2 for 🏦
X(👩🏽🌾) = X(👨🏿🎨) = 3 for 🌳
Note that since these all are categorical variables the actual numbers assigned by random variables are somewhat arbitrary.
| Party | \(P(Y)\) |
|---|---|
| 🛠️ | 0.33 |
| 🏦 | 0.33 |
| 🌳 | 0.33 |