POP77032 Quantitative Text Analysis for Social Scientists
402 - Incentives: Positive
402 - Incentives: Positive
403 - Market Regulation
A weight-judging competition was carried on at the annual show of the West of England Fat Stock and Poultry Exhibition recently held at Plymouth. A fat ox having been selected, competitors bought stamped and numbered cards, for 6d. each, on which to inscribe their respective names, addresses, and estimates of what the ox would weigh after it had been slaughtered and “dressed.” Those who guessed most successfully received prizes. About 800 tickets were issued, which were kindly lent me for examination after they had fulfilled their immediate purpose. These afforded excellent material. The judgments were unbiassed by passion and uninfluenced by oratory and the like. The sixpenny fee deterred practical joking, and the hope of a prize and the joy of competition prompted each competitor to do his best. The competitors included butchers and farmers, some of whom were highly expert in judging the weight of cattle; others were probably guided by such information as they might pick up, and by their own fancies. The average competitor was probably as well fitted for making a just estimate of the dressed weight of the ox, as an average voter is of judging the merits of most political issues on which he votes, and the variety among the voters to judge justly was probably much the same in either case.
According to the democratic principle of “one vote one value,” the middlemost estimate expresses the vox populi, every other estimate being condemned as too low or too high by a majority of the voters […]. Now the middlemost estimate is 1207 lb., and the weight of the dressed ox proved to be 1198 lb.; so the vox populi was in this case 9 lb., or 0.8 per cent. of the whole weight too high.
| Type | Test Design | Cause of Disagreement |
|---|---|---|
| Stability | rest-retest | intraobserver inconsistencies |
| Reliability | test-test | interobserver disagreements |
| Validity | test-standard | deviations from a standard |
| Unit | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| Coder 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Coder 2 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 |
\[ \begin{array}{c|cc|c} & 0 & 1 & \text{Total} \\ \hline 0 & 10 & 4 & 14 \\ 1 & 4 & 2 & 6 \\ \hline \text{Total} & 14 & 6 & 20 \end{array} \]
\[ \begin{array}{c|cc|c} & 0 & 1 & \text{Total} \\ \hline 0 & 9.6 & 4.4 & 14 \\ 1 & 4.4 & 1.6 & 6 \\ \hline \text{Total} & 14 & 6 & 20 \end{array} \]
where \(e_{01} = e_{10} = n_0 \times n_1/(n-1)\)
Thus, \(\text{Krippendorff's } \alpha = 1 - \frac{D_o}{D_e} = 1 - \frac{4}{4.4211} = 0.095\), which is quite low.
irr package: Krippendorff's alpha
Subjects = 10
Raters = 2
alpha = 0.0952
Unsupervised modelling:
Supervised modelling:
learning latent structure from unlabelled data
learning a relationship between inputs and labelled data
E.g. principal component analysis of a DTM
E.g. sentiment analysis using a training set of positive and negative reviews
\[ \begin{array}{c|cc} \textbf{Predicted / True} & \textbf{Positive} & \textbf{Negative} \\ \hline \textbf{Positive} & TP & FP \\ \textbf{Negative} & FN& TN \\ \end{array} \]
\[ P(A|B) = \frac{P(A, B)}{P(B)} \]
\[ \color{red}{P(A|B) = \frac{P(A)P(B|A)}{P(B)}} \]
\[ P(c_k | w_j) = \frac{P(c_k) P(w_j | c_k)}{P(w_j)} \]
\[ P(c_k | w_j) = \frac{P(c_k) P(w_j | c_k)}{P(c_k) P(w_j | c_k) + P(c_{\neg{k}}) P(w_j | c_{\neg{k}})} \]
where \(c_{\neg{k}}\) is the class alternative to class \(c_k\).
\[ P(c_k | w_j) = \frac{P(c_k) \color{red}{P(w_j | c_k)}}{P(c_k) \color{red}{P(w_j | c_k)} + P(c_{\neg{k}}) \color{red}{P(w_j | c_{\neg{k}})}} \]
\[ P(c_k | w_j) = \frac{\color{red}{P(c_k)} P(w_j | c_k)}{\color{red}{P(c_k)} P(w_j | c_k) + \color{red}{P(c_{\neg{k}})} P(w_j | c_{\neg{k}})} \]
\[ \color{red}{P(c_k | w_j)} = \frac{P(c_k) P(w_j | c_k)}{P(c_k) P(w_j | c_k) + P(c_{\neg{k}}) P(w_j | c_{\neg{k}})} \]
\[ P(c_k | d_i) = P(c_k) \prod_{j=1}^J \frac{P(w_j|c_k)}{P(w_j)} \]